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PREFACE 


This edition, like its predecessors, is written from the viewpoint of the applied mathe- 
matician, whose interest in differential equations may sometimes be quite theoretical, 
sometimes intensely practical, and often somewhere in between. We have sought to 
combine a sound and accurate (but not abstract) exposition of the elementary theory 
of differential equations with considerable material on methods of solution, analysis, 
and approximation that have proved useful in a wide variety of applications. 

The book is written primarily for undergraduate students of mathematics, science, 
or engineering, who typically take a course on differential equations during their first 
or second year of study. The main prerequisite for reading the book is a working 
knowledge of calculus, gained from a normal two- or three-semester course sequence 
or its equivalent. Some familiarity with matrices will also be helpful in the chapters 
on systems of differential equations. 

To be widely useful a textbook must be adaptable to a variety of instructional 
strategies. This implies at least two things. First, instructors should have maximum 
flexibility to choose both the particular topics that they wish to cover and also the 
order in which they want to cover them. Second, the book should be useful to students 
having access to a wide range of technological capability. 

With respect to content, we provide this flexibility by making sure that, so far as 
possible, individual chapters are independent of each other. Thus, after the basic 
parts of the first three chapters are completed (roughly Sections 1.1 through 1.3, 
2.1 through 2.5, and 3.1 through 3.5), the selection of additional topics, and the order 
and depth in which they are covered, is at the discretion of the instructor. Chapters 
4 through 11 are essentially independent of each other, except that Chapter 7 should 
precede Chapter 9 and that Chapter 10 should precede Chapter 11. This means that 
there are multiple pathways through the book and many different combinations have 
been used effectively with earlier editions. 
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With respect to technology, we note repeatedly in the text that computers are 
extremely useful for investigating differential equations and their solutions, and many 
of the problems are best approached with computational assistance. Nevertheless, 
the book is adaptable to courses having various levels of computer involvement, 
ranging from little or none to intensive. The text is independent of any particular 
hardware platform or software package. 

Many problems are marked with the symbol to indicate that we consider them 
to be technologically intensive. Computers have at least three important uses in a dif- 
ferential equations course. The first is simply to crunch numbers, thereby generating 
accurate numerical approximations to solutions. The second is to carry out symbolic 
manipulations that would be tedious and time-consuming to do by hand. Finally, 
and perhaps most important of all, is the ability to translate the results of numerical 
or symbolic computations into graphical form, so that the behavior of solutions can 
be easily visualized. The marked problems typically involve one or more of these 
features. Naturally, the designation of a problem as technologically intensive is a 
somewhat subjective judgment, and the is intended only as a guide. Many of the 
marked problems can be solved, at least in part, without computational help, and a 
computer can also be used effectively on many of the unmarked problems. 

From a student’s point of view, the problems that are assigned as homework and 
that appear on examinations drive the course. We believe that the most outstanding 
feature of this book is the number, and above all the variety and range, of the problems 
that it contains. Many problems are entirely straightforward, but many others are 
more challenging, and some are fairly open-ended, and can serve as the basis for 
independent student projects. There are far more problems than any instructor can 
use in any given course, and this provides instructors with a multitude of choices in 
tailoring their course to meet their own goals and the needs of their students. 

The motivation for solving many differential equations is the desire to learn some- 
thing about an underlying physical process that the equation is believed to model. 
It is basic to the importance of differential equations that even the simplest equa- 
tions correspond to useful physical models, such as exponential growth and decay, 
spring-mass systems, or electrical circuits. Gaining an understanding of a complex 
natural process is usually accomplished by combining or building upon simpler and 
more basic models. Thus a thorough knowledge of these basic models, the equations 
that describe them, and their solutions, is the first and indispensable step toward the 
solution of more complex and realistic problems. We describe the modeling process 
in detail in Sections 1.1, 1.2, and 2.3. Careful constructions of models appear also in 
Sections 2.5, 3.7, and in the appendices to Chapter 10. Differential equations result- 
ing from the modeling process appear frequently throughout the book, especially in 
the problem sets. 

The main reason for including fairly extensive material on applications and math- 
ematical modeling in a book on differential equations is to persuade students that 
mathematical modeling often leads to differential equations, and that differential 
equations are part of an investigation of problems in a wide variety of other fields. 
We also emphasize the transportability of mathematical knowledge: once you master 
a particular solution method, you can use it in any field of application in which an 
appropriate differential equation arises. Once these points are convincingly made, 
we believe that it is unnecessary to provide specific applications of every method 
of solution or type of equation that we consider. This helps to keep this book to 
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a reasonable size, and in any case, there is only a limited time in most differential 
equations courses to discuss modeling and applications. 

Nonroutine problems often require the use of a variety of tools, both analytical 
and numerical. Paper and pencil methods must often be combined with effective 
use of a computer. Quantitative results and graphs, often produced by a computer, 
serve to illustrate and clarify conclusions that may be obscured by complicated ana- 
lytical expressions. On the other hand, the implementation of an efficient numerical 
procedure typically rests on a good deal of preliminary analysis — to determine the 
qualitative features of the solution as a guide to computation, to investigate limit- 
ing or special cases, or to discover which ranges of the variables or parameters may 
require or merit special attention. Thus, a student should come to realize that investi- 
gating a difficult problem may well require both analysis and computation; that good 
judgment may be required to determine which tool is best-suited for a particular task; 
and that results can often be presented in a variety of forms. 

We believe that it is important for students to understand that (except perhaps in 
courses on differential equations) the goal of solving a differential equation is seldom 
simply to obtain the solution. Rather, one is interested in the solution in order to 
obtain insight into the behavior of the process that the equation purports to model. 
In other words, the solution is not an end in itself. Thus, we have included a great 
many problems, as well as some examples in the text, that call for conclusions to be 
drawn about the solution. Sometimes this takes the form of asking for the value of the 
independent variable at which the solution has a certain property, or to determine the 
long term behavior of the solution. Other problems ask for the effect of variations in 
a parameter, or for the determination of a critical value of a parameter at which the 
solution experiences a substantial change. Such problems are typical of those that 
arise in the applications of differential equations, and, depending on the goals of the 
course, an instructor has the option of assigning few or many of these problems. 

Readers familiar with the preceding edition will observe that the general structure 
of the book is unchanged. The revisions that we have made in this edition have several 
goals: to streamline the presentation in a few places, to make the presentation more 
visual by adding some new figures, and to improve the exposition by including several 
new or improved examples. More specifically, the most important changes are the 
following: 

1. We have removed the discussion of linear dependence and independence from Chapter 3 
(Second Order Linear Equations), where it is difficult to explain their importance, and 
introduced these concepts later in Chapter 4 (Higher Order Linear Equations) and in 
Chapter 7 (Linear Systems), where they appear more naturally. This results in a smaller 
block of theoretical material at the beginning of Chapter 3. Since not all courses cover 
Chapter 4, we also avoid using the words “linear dependence” and “linear independence” 
in Chapters 5 (Power Series Solutions) and 6 (Laplace Transforms). 

2. Sections 5.4 (Regular Singular Points) and 5.5 (Euler Equations) from the eighth edition 
have now been combined into a single section, with Euler equations appearing first. They 
are then used as the prototype of equations having regular singular points, resulting in a 
somewhat briefer and more compact presentation. 

3. Chapter 9 (Nonlinear Autonomous Systems) has several modifications. The concept of 
basins of attraction now appears earlier (in Section 9.2). This section also includes two 
new examples and three new figures with the goal of providing visual evidence that near 
a critical point nonlinear systems (usually) behave very much like linear systems. Such 
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systems are now referred to as “locally linear.” In the following sections the Jacobian 
matrix is used more systematically to construct these linear approximations. There are 
also about 25 new problems in this chapter. 

4. In response to suggestions from several users, we have begun the discussion of forced 
linear oscillators in Section 3.8 with an example, rather than a general presentation. 

5. There are eight new problems on Euler equations (making a total of fifteen) in Sections 
3.3 and 3.4. This will enable instructors to cover this topic, if they wish, even if Chapter 5 
is not to be used. Euler equations appear in the text in Section 5.4. 

6. There are several revisions in Chapter 6 clarifying the integration of piecewise continuous 
functions, the essential uniqueness of the Laplace transform, and the use of the delta 
function. In addition, there is a new example, a new figure, and six new problems on the 
use of the unit step function to represent more complicated step functions. 

7. The list of 32 miscellaneous problems at the end of Chapter 2 has been substantially revised 
and includes nine new problems. This list now better reflects the contents of Chapter 2. 

The authors have found differential equations to be a never ending source of 
interesting, and sometimes surprising, results and phenomena. We hope that users 
of this book, both students and instructors, will share our enthusiasm for the subject. 

William E. Boyce 
Grafton, New York 
June 26, 2008 
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Supplemental Resources for Instructors and Students 


A passcode for access to ODE Architect is included with every copy of the text. 
ODE Architect is a prize-winning NSF-sponsored learning software package, which 
is Windows-compatible. A solver tool enables the user to build models with ordinary 
differential equations and study them in an interactive point-and-click environment. 
The Architect includes an interactive library of more than one hundred model differ- 
ential equation systems with graphs of solutions. Additionally, fourteen interactive 
multimedia modules provide a range of models and phenomena, from a golf game 
to chaos. 

An Instructor’s Solutions Manual, ISBN 978-0-470-42473-5, includes solutions for 
all problems not contained in the Student Solutions Manual. 

A Student Solutions Manual, ISBN 978-0-470-38335-3, includes solutions for se- 
lected problems in the text. 

A Book Companion Site, www.wiley.com/college/boyce, provides a wealth of re- 
sources for students and instructors, including: 

• PowerPoint slides of important definitions, examples, and theorems from the 
book, as well as graphics for presentation in lectures or for study and note taking. 

• Chapter Review Sheets enable students to test their knowledge of key concepts. 
For further review, diagnostic feedback is provided that refers to pertinent sec- 
tions in the text. 

• ODE Architect: Getting Started. This guide introduces students and professors 
to ODE Architect’s simulations and multimedia. 

• Additional problems for use with Mathematica, Maple, and Matlab, providing 
opportunities for further exploration of important concepts using these com- 
puter algebra and numerical analysis packages. 

A series of supplemental guidebooks, also published by John Wiley & Sons, can be 
used with Boyce/DiPrima in order to incorporate computing technologies into the 
course. These books emphasize numerical methods and graphical analysis, showing 
how these methods enable us to interpret solutions of ODEs in the real world. Sep- 
arate guidebooks cover each of the three major mathematical software formats, but 
the ODE subject matter is the same in each. 

• Hunt, Lipsman, Osborn, Rosenberg, Differential Equations with Matlab , 2nd 
ed„ © 2005, ISBN 978-0-471-71812-3 

• Hunt, Lardy, Lipsman, Osborn, Rosenberg, Differential Equations with Maple , 
3rd ed„ © 2009, ISBN 978-0-471-77317-7 

• Hunt, Outing, Lipsman, Osborn, Rosenberg, Differential Equations with Math- 
ematica , 3rd ed., © 2009, ISBN 978-0-471-77316-0 


WileyPLUS 

WileyPLUS, Wiley’s digital-learning environment, is loaded with all of the supple- 
ments above, and also features: 

• the E-book, which is an exact version of the print text, but also features hyper- 
links to questions, definitions, and supplements for quicker and easier support. 
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• Guided Online (GO) Exercises, which prompt students to build solutions step- 
by-step. Rather than simply grading an exercise answer as wrong, GO problems 
show students precisely where they are making a mistake. 

• homework management tools, which easily enable instructors to assign and grade 
questions, as well as gauge student comprehension. 

• QuickStart pre-designed reading and homework assignments. Use them as-is 
or customize them to fit the needs of your classroom. 
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CHAPTER 


1 


Introduction 


In this chapter we give perspective to your study of differential equations in several 
different ways. First, we use two problems to illustrate some of the basic ideas that 
we will return to, and elaborate upon, frequently throughout the remainder of the 
book. Later, to provide organizational structure for the book, we indicate several 
ways of classifying differential equations. Finally, we outline some of the major trends 
in the historical development of the subject and mention a few of the outstanding 
mathematicians who have contributed to it. The study of differential equations has 
attracted the attention of many of the world’s greatest mathematicians during the 
past three centuries. Nevertheless, it remains a dynamic field of inquiry today, with 
many interesting open questions. 


1.1 Some Basic Mathematical Models; Direction Fields 


Before embarking on a serious study of differential equations (for example, by read- 
ing this book or major portions of it), you should have some idea of the possible 
benefits to be gained by doing so. For some students the intrinsic interest of the 
subject itself is enough motivation, but for most it is the likelihood of important 
applications to other fields that makes the undertaking worthwhile. 

Many of the principles, or laws, underlying the behavior of the natural world are 
statements or relations involving rates at which things happen. When expressed 
in mathematical terms, the relations are equations and the rates are derivatives. 
Equations containing derivatives are differential equations. Therefore, to understand 
and to investigate problems involving the motion of fluids, the flow of current in 
electric circuits, the dissipation of heat in solid objects, the propagation and detection 
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Chapter 1. Introduction 


EXAMPLE 

1 

A Falling 
Object 


of seismic waves, or the increase or decrease of populations, among many others, it 
is necessary to know something about differential equations. 

A differential equation that describes some physical process is often called a math- 
ematical model of the process, and many such models are discussed throughout this 
book. In this section we begin with two models leading to equations that are easy to 
solve. It is noteworthy that even the simplest differential equations provide useful 
models of important physical processes. 


Suppose that an object is falling in the atmosphere near sea level. Formulate a differential 
equation that describes the motion. 

We begin by introducing letters to represent various quantities that may be of interest in this 
problem. The motion takes place during a certain time interval, so let us use t to denote time. 
Also, let us use v to represent the velocity of the falling object. The velocity will presumably 
change with time, so we think of v as a function of t; in other words, t is the independent 
variable and v is the dependent variable. The choice of units of measurement is somewhat 
arbitrary, and there is nothing in the statement of the problem to suggest appropriate units, 
so we are free to make any choice that seems reasonable. To be specific, let us measure time 
t in seconds and velocity v in meters/second. Further, we will assume that v is positive in the 
downward direction — that is, when the object is falling. 

The physical law that governs the motion of objects is Newton’s second law, which states 
that the mass of the object times its acceleration is equal to the net force on the object. In 
mathematical terms this law is expressed by the equation 

F — ma , (1) 

where m is the mass of the object, a is its acceleration, and F is the net force exerted on the 
object. To keep our units consistent, we will measure m in kilograms, a in meters/second 2 , and 
F in newtons. Of course, a is related to v by a = dv/dt , so we can rewrite Eq. (1) in the form 

F = m(dv/dt). (2) 

Next, consider the forces that act on the object as it falls. Gravity exerts a force equal to the 
weight of the object, or mg , where g is the acceleration due to gravity. In the units we have 
chosen, g has been determined experimentally to be approximately equal to 9.8 m/s 2 near 
the earth’s surface. There is also a force due to air resistance, or drag, that is more difficult 
to model. This is not the place for an extended discussion of the drag force; suffice it to say 
that it is often assumed that the drag is proportional to the velocity, and we will make that 
assumption here. Thus the drag force has the magnitude yv, where y is a constant called the 
drag coefficient. The numerical value of the drag coefficient varies widely from one object 
to another; smooth streamlined objects have much smaller drag coefficients than rough blunt 
ones. The physical units for y are mass/time, or kg/s for this problem; if these units seem 
peculiar, remember that yv must have the units of force, namely, kg-m/s 2 . 

In writing an expression for the net force F, we need to remember that gravity always acts 
in the downward (positive) direction, whereas drag acts in the upward (negative) direction, as 


shown in Figure 1.1.1. Thus 

F = mg — yv 

(3) 

and Eq. (2) then becomes 

dv 

(4) 


m-—= mg — yv. 
dt 


Equation (4) is a mathematical model of an object falling in the atmosphere near sea level. 
Note that the model contains the three constants m, g , and y. The constants m and y depend 
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very much on the particular object that is falling, and they are usually different for different 
objects. It is common to refer to them as parameters, since they may take on a range of values 
during the course of an experiment. On the other hand, g is a physical constant, whose value 
is the same for all objects. 


yv 

Qm 

mg 

FIGURE 1.1.1 Free-body diagram of the forces on a falling object. 


To solve Eq. (4), we need to find a function v = v(t) that satisfies the equation. It 
is not hard to do this, and we will show you how in the next section. For the present, 
however, let us see what we can learn about solutions without actually finding any 
of them. Our task is simplified slightly if we assign numerical values to m and y, but 
the procedure is the same regardless of which values we choose. So, let us suppose 
that m = 10 kg and y = 2 kg/s. Then Eq. (4) can be rewritten as 



v 

5' 


( 5 ) 


EXAMPLE 

2 


A Falling 
Object 
(continued) 


Investigate the behavior of solutions of Eq. (5) without solving the differential equation. 

We will proceed by looking at Eq. (5) from a geometrical viewpoint. Suppose that the 
velocity v has a certain given value. Then, by evaluating the right side of Eq. (5), we can find 
the corresponding value of dv/dt. For instance, if v — 40, then dv/dt = 1.8. This means that 
the slope of a solution v = v{t) has the value 1.8 at any point where v = 40. We can display this 
information graphically in the tn-plane by drawing short line segments with slope 1.8 at several 
points on the line v = 40. Similarly, if v = 50, then dv/dt = —0.2, so we draw line segments 
with slope —0.2 at several points on the line v = 50. We obtain Figure 1.1.2 by proceeding in 
the same way with other values of v. Figure 1.1.2 is an example of what is called a direction 
field or sometimes a slope field. 

Remember that a solution of Eq. (5) is a function v = v{t) whose graph is a curve in the 
/v-plane. The importance of Figure 1.1.2 is that each line segment is a tangent line to one of 
these solution curves. Thus, even though we have not found any solutions, and no graphs of 
solutions appear in the figure, we can nonetheless draw some qualitative conclusions about 
the behavior of solutions. For instance, if v is less than a certain critical value, then all the line 
segments have positive slopes, and the speed of the falling object increases as it falls. On the 
other hand, if v is greater than the critical value, then the line segments have negative slopes, 
and the falling object slows down as it falls. What is this critical value of v that separates objects 
whose speed is increasing from those whose speed is decreasing? Referring again to Eq. (5), 
we ask what value of v will cause dv/dt to be zero. The answer is v = (5) (9.8) = 49 m/s. 

In fact, the constant function v(t) = 49 is a solution of Eq. (5). To verify this statement, 
substitute v(t) = 49 into Eq. (5) and observe that each side of the equation is zero. Because 
it does not change with time, the solution v{t) = 49 is called an equilibrium solution. It is the 
solution that corresponds to a perfect balance between gravity and drag. In Figure 1.1.3 we 
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show the equilibrium solution v(t ) = 49 superimposed on the direction Held. From this figure 
we can draw another conclusion, namely, that all other solutions seem to be converging to the 
equilibrium solution as t increases. 




FIGURE 1.1.3 Direction field and equilibrium solution for Eq. (5). 


The approach illustrated in Example 2 can be applied equally well to the more 
general Eq. (4), where the parameters m and y are unspecified positive numbers. The 
results are essentially identical to those of Example 2. The equilibrium solution of 
Eq. (4) is v(t) = mg/y. Solutions below the equilibrium solution increase with time, 
those above it decrease with time, and all other solutions approach the equilibrium 
solution as t becomes large. 
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Direction Fields. Direction fields are valuable tools in studying the solutions of differ- 
ential equations of the form 

( 6 ) 

where / is a given function of the two variables t and y, sometimes referred to as the 
rate function. A direction field for equations of the form (6) can be constructed by 
evaluating / at each point of a rectangular grid. At each point of the grid, a short line 
segment is drawn whose slope is the value of/ at that point. Thus each line segment 
is tangent to the graph of the solution passing through that point. A direction field 
drawn on a fairly fine grid gives a good picture of the overall behavior of solutions of 
a differential equation. Usually a grid consisting of a few hundred points is sufficient. 
The construction of a direction field is often a useful first step in the investigation of 
a differential equation. 

Two observations are worth particular mention. First, in constructing a direction 
field, we do not have to solve Eq. (6), but merely to evaluate the given function /(t,y) 
many times. Thus direction fields can be readily constructed even for equations that 
may be quite difficult to solve. Second, repeated evaluation of a given function is a 
task for which a computer is well suited, and you should usually use a computer to 
draw a direction field. All the direction fields shown in this book, such as the one in 
Figure 1.1.2, were computer-generated. 


Field Mice and Owls. Now let us look at another, quite different example. Consider a 
population of field mice who inhabit a certain rural area. In the absence of preda- 
tors we assume that the mouse population increases at a rate proportional to the 
current population. This assumption is not a well-established physical law (as New- 
ton’s law of motion is in Example 1), but it is a common initial hypothesis 1 in a 
study of population growth. If we denote time by t and the mouse population by 
p(t), then the assumption about population growth can be expressed by the equa- 
tion 



( 7 ) 


where the proportionality factor r is called the rate constant or growth rate. To be 
specific, suppose that time is measured in months and that the rate constant r has the 
value 0.5/month. Then each term in Eq. (7) has the units of mice/month. 

Now let us add to the problem by supposing that several owls live in the same 
neighborhood and that they kill 15 field mice per day. To incorporate this information 
into the model, we must add another term to the differential equation (7), so that it 
becomes 

^ = 0.5 p - 450. (8) 


Observe that the predation term is —450 rather than —15 because time is measured 
in months and the monthly predation rate is needed. 


1 A better model of population growth is discussed in Section 2.5. 
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EXAMPLE 

3 


Investigate the solutions of Eq. (8) graphically. 

A direction field for Eq. (8) is shown in Figure 1.1.4. For sufficiently large values of p it can 
be seen from the figure, or directly from Eq. (8) itself, that dp/dt is positive, so that solutions 
increase. On the other hand, if p is small, than dp / dt is negative and solutions decrease. Again, 
the critical value of p that separates solutions that increase from those that decrease is the 
value of p for which dp/dt is zero. By setting dp/dt equal to zero in Eq. (8) and then solving 
for p, we find the equilibrium solution p(t) = 900 for which the growth term and the predation 
term in Eq. (8) are exactly balanced. The equilibrium solution is also shown in Figure 1.1.4. 



FIGURE 1.1.4 Direction field and equilibrium solution for Eq. (8). 


Comparing Examples 2 and 3, we note that in both cases the equilibrium solution 
separates increasing from decreasing solutions. In Example 2 other solutions con- 
verge to, or are attracted by, the equilibrium solution, so that after the object falls far 
enough, an observer will see it moving at very nearly the equilibrium velocity. On 
the other hand, in Example 3 other solutions diverge from, or are repelled by, the 
equilibrium solution. Solutions behave very differently depending on whether they 
start above or below the equilibrium solution. As time passes, an observer might see 
populations either much larger or much smaller than the equilibrium population, but 
the equilibrium solution itself will not, in practice, be observed. In both problems, 
however, the equilibrium solution is very important in understanding how solutions 
of the given differential equation behave. 

A more general version of Eq. (8) is 

where the growth rate r and the predation rate k are unspecified. Solutions of this 
more general equation are very similar to those of Eq. (8). The equilibrium solution 
of Eq. (9) is p{t) = k/r. Solutions above the equilibrium solution increase, while 
those below it decrease. 

You should keep in mind that both of the models discussed in this section have 
their limitations. The model (5) of the falling object is valid only as long as the 
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object is falling freely, without encountering any obstacles. The population model 
(8) eventually predicts negative numbers of mice (if p < 900) or enormously large 
numbers (if p > 900). Both these predictions are unrealistic, so this model becomes 
unacceptable after a fairly short time interval. 

Constructing Mathematical Models. In applying differential equations to any of the nu- 
merous fields in which they are useful, it is necessary first to formulate the appropriate 
differential equation that describes, or models, the problem being investigated. In 
this section we have looked at two examples of this modeling process, one drawn 
from physics and the other from ecology. In constructing future mathematical mod- 
els yourself, you should recognize that each problem is different, and that successful 
modeling is not a skill that can be reduced to the observance of a set of prescribed 
rules. Indeed, constructing a satisfactory model is sometimes the most difficult part 
of the problem. Nevertheless, it may be helpful to list some steps that are often part 
of the process: 

1. Identify the independent and dependent variables and assign letters to represent them. 
Often the independent variable is time. 

2. Choose the units of measurement for each variable. In a sense the choice of units is 
arbitrary, but some choices may be much more convenient than others. For example, we 
chose to measure time in seconds for the falling-object problem and in months for the 
population problem. 

3. Articulate the basic principle that underlies or governs the problem you are investigating. 
This may be a widely recognized physical law, such as Newton’s law of motion, or it may be 
a more speculative assumption that may be based on your own experience or observations. 
In any case, this step is likely not to be a purely mathematical one, but will require you to 
be familiar with the field in which the problem originates. 

4. Express the principle or law in step 3 in terms of the variables you chose in step 1. This 
may be easier said than done. It may require the introduction of physical constants or 
parameters (such as the drag coefficient in Example 1 ) and the determination of appro- 
priate values for them. Or it may involve the use of auxiliary or intermediate variables 
that must then be related to the primary variables. 

5. Make sure that each term in your equation has the same physical units. If this is not the 
case, then your equation is wrong and you should seek to repair it. If the units agree, then 
your equation at least is dimensionally consistent, although it may have other shortcomings 
that this test does not reveal. 

6. In the problems considered here, the result of step 4 is a single differential equation, which 
constitutes the desired mathematical model. Keep in mind, though, that in more complex 
problems the resulting mathematical model may be much more complicated, perhaps 
involving a system of several differential equations, for example. 


PROBLEMS In each of Problems 1 through 6 draw a direction field for the given differential equation. 

Based on the direction field, determine the behavior of y as t — >■ oo. If this behavior depends 
on the initial value of y at t = 0, describe the dependency. 


41 

i. 

y = 3 - 2y 

41 

to 

II 

1 

u> 

41 

3. 

y' = 3 + 2y 

41 

4. y = -1 - 2 y 

41 

5. 

y' = l + 2y 

41 

6. y' = y + 2 
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In each of Problems 7 through 10 write down a differential equation of the form dy/dt = ay + b 
whose solutions have the required behavior as t — y oo. 

7. All solutions approach y = 3. 8. All solutions approach y = 2/3. 

9. All other solutions diverge from y = 2. 10. All other solutions diverge from y = 1/3. 

In each of Problems 11 through 14 draw a direction field for the given differential equation. 
Based on the direction field, determine the behavior of y as t — > oo. If this behavior depends 
on the initial value of y at t = 0, describe this dependency. Note that in these problems the 
equations are not of the form / = ay + b, and the behavior of their solutions is somewhat 
more complicated than for the equations in the text. 

= fll2.? = -y(.5-y) 

13. y=f 14. y = y(y - 2) 2 

Consider the following list of differential equations, some of which produced the direction 
fields shown in Figures 1.1.5 through 1.1.10. In each of Problems 15 through 20 identify the 
differential equation that corresponds to the given direction field. 


(a) / = 2y - 1 

(b) y = 2 + y 

(c) y =y-2 

(d) / = y(y + 3) 

(e) / = y(y - 3) 

(f) y = l + 2y 

(g) y' = —2 — y 

(h) y = y(3 - y) 

(i) y' — i — 2y 

0) y = 2 - y 


15. The direction field of Figure 1.1.5. 

16. The direction field of Figure 1.1.6. 

17. The direction field of Figure 1.1.7. 

18. The direction field of Figure 1.1.8. 

19. The direction field of Figure 1.1.9. 

20. The direction field of Figure 1.1.10. 

21. A pond initially contains 1,000,000 gal of water and an unknown amount of an undesirable 
chemical. Water containing 0.01 g of this chemical per gallon flows into the pond at a rate 
of 300 gal/h. The mixture flows out at the same rate, so the amount of water in the pond 
remains constant. Assume that the chemical is uniformly distributed throughout the pond. 

(a) Write a differential equation for the amount of chemical in the pond at any time. 

(b) How much of the chemical will be in the pond after a very long time? Does this 
limiting amount depend on the amount that was present initially? 

22. A spherical raindrop evaporates at a rate proportional to its surface area. Write a differ- 
ential equation for the volume of the raindrop as a function of time. 

23. Newton’s law of cooling states that the temperature of an object changes at a rate propor- 
tional to the difference between the temperature of the object itself and the temperature 
of its surroundings (the ambient air temperature in most cases). Suppose that the ambient 
temperature is 70°F and that the rate constant is 0.05 (min) -1 . Write a differential equa- 
tion for the temperature of the object at any time. Note that the differential equation is the 
same whether the temperature of the object is above or below the ambient temperature. 

24. A certain drug is being administered intravenously to a hospital patient. Fluid containing 
5 mg/cm 3 of the drug enters the patient’s bloodstream at a rate of 100 cm 3 /h. The drug 
is absorbed by body tissues or otherwise leaves the bloodstream at a rate proportional to 
the amount present, with a rate constant of 0.4 (h) -1 . 

(a) Assuming that the drug is always uniformly distributed throughout the bloodstream, 
write a differential equation for the amount of the drug that is present in the bloodstream 
at any time. 

(b) How much of the drug is present in the bloodstream after a long time? 
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FIGURE 1.1.5 Direction field for 
Problem 15. 



FIGURE 1.1.7 Direction field for 
Problem 17. 
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FIGURE 1.1.8 Direction field for 
Problem 18. 



FIGURE 1.1.9 Direction field for 
Problem 19. 



FIGURE 1.1.10 Direction field for 
Problem 20. 


25. For small, slowly falling objects, the assumption made in the text that the drag force is 
proportional to the velocity is a good one. For larger, more rapidly falling objects, it is 
more accurate to assume that the drag force is proportional to the square of the velocity. 2 

(a) Write a differential equation for the velocity of a falling object of mass m if the drag 
force is proportional to the square of the velocity. 

(b) Determine the limiting velocity after a long time. 


2 See Lyle N. Long and Howard Weiss, “The Velocity Dependence of Aerodynamic Drag: A Primer for 
Mathematicians,” American Mathematical Monthly 106 (1999), 2, pp. 127-135. 
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(c) If m = 10 kg, find the drag coefficient so that the limiting velocity is 49 m/s. 

(d) Using the data in part (c), draw a direction field and compare it with Figure 1.1.3. 


In each of Problems 26 through 33 draw a direction field for the given differential equation. 
Based on the direction field, determine the behavior of y as t — > oo. If this behavior depends 
on the initial value of y at t = 0, describe this dependency. Note that the right sides of these 
equations depend on t as well as y; therefore their solutions can exhibit more complicated 
behavior than those in the text. 


26. y = -2 + t-y 
28. y = e~* + y 
30. / = 3 sin t + 1 + y 
32. y = ~(2t + y)/2y 


27. / = te~ 2 ' - 2 y 
29. y' = t + 2y 
31. y' = 2t — 1 — y 2 

33 . y=\y-y-\t 2 


1.2 Solutions of Some Differential Equations 


In the preceding section we derived the differential equations 


and 


dv 

m— = mg — yv 
dt 



— k. 


a) 

( 2 ) 


Equation (1) models a falling object and Eq. (2) a population of field mice preyed 
on by owls. Both these equations are of the general form 

^ =ay-b, (3) 

where a and b are given constants. We were able to draw some important qualitative 
conclusions about the behavior of solutions of Eqs. (1) and (2) by considering the 
associated direction fields. To answer questions of a quantitative nature, however, 
we need to find the solutions themselves, and we now investigate how to do that. 


EXAMPLE 

1 

Field Mice 
and Owls 
(continued) 


Consider the equation 

— = 0.5p - 450, 
dt 


(4) 


which describes the interaction of certain populations of field mice and owls [see Eq. (8) of 
Section 1.1]. Find solutions of this equation. 

To solve Eq. (4), we need to find functions p(t) that, when substituted into the equation, 
reduce it to an obvious identity. Here is one way to proceed. First, rewrite Eq. (4) in the form 

dp _ p - 900 

dt 2 ’ ^ ; 


dp/dt 1 
p — 900 = 2' 


or, if p y 900, 


( 6 ) 
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By the chain rule the left side of Eq. (6) is the derivative of In | p — 900| with respect to f, so we 
have 

^ In |p — 900|= X ~. (7) 

Then, by integrating both sides of Eq. (7), we obtain 

In |p - 900 1 = r - + C, (8) 

where C is an arbitrary constant of integration. Therefore, by taking the exponential of both 
sides of Eq. (8), we find that 

|p - 900 1 = e (,/2)+c = e c e' /2 , (9) 


or 


p - 900 = ±e c e f/2 , 


(10) 


and finally 


p = 900 + ce‘ /2 . 


( 11 ) 


where c = ±e c is also an arbitrary (nonzero) constant. Note that the constant function p = 900 
is also a solution of Eq. (5) and that it is contained in the expression (11) if we allow c to take 
the value zero. Graphs of Eq. (11) for several values of c are shown in Figure 1.2.1. 



Note that they have the character inferred from the direction field in Figure 1.1.4. For 
instance, solutions lying on either side of the equilibrium solution p = 900 tend to diverge 
from that solution. 


In Example 1 we found infinitely many solutions of the differential equation (4), 
corresponding to the infinitely many values that the arbitrary constant c in Eq. (11) 
might have. This is typical of what happens when you solve a differential equa- 
tion. The solution process involves an integration, which brings with it an arbitrary 
constant, whose possible values generate an infinite family of solutions. 
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Frequently, we want to focus our attention on a single member of the infinite family 
of solutions by specifying the value of the arbitrary constant. Most often, we do this 
indirectly by specifying instead a point that must lie on the graph of the solution. For 
example, to determine the constant c in Eq. (11), we could require that the population 
have a given value at a certain time, such as the value 850 at time t = 0. In other 
words, the graph of the solution must pass through the point (0,850). Symbolically, 
we can express this condition as 

p( 0) = 850. (12) 

Then, substituting t = 0 and p = 850 into Eq. (11), we obtain 

850 = 900 + c. 


Hence c = —50, and by inserting this value in Eq. (11), we obtain the desired solution, 
namely, 

p = 900-50e r/2 . (13) 


The additional condition (12) that we used to determine c is an example of an initial 
condition. The differential equation (4) together with the initial condition (12) form 

an initial value problem. 

Now consider the more general problem consisting of the differential equation (3) 


and the initial condition 


dy , 

d, =ay - b 


y(0) = y 0 , 


(14) 


where yo is an arbitrary initial value. We can solve this problem by the same method 
as in Example 1. If a ^ 0 and y / b/a, then we can rewrite Eq. (3) as 

dy/dt 


y ~ (b/a) 

By integrating both sides, we find that 

In | y — (b/a) \ — at + C, 


(15) 

(16) 


where C is arbitrary. Then, taking the exponential of both sides of Eq. (16) and 
solving for y, we obtain 

y = (b/a) + ce at , (17) 

where c = ±e c is also arbitrary. Observe that c = 0 corresponds to the equilibrium 
solution y = b/a. Finally, the initial condition (14) requires that c — yo — (b/a), so 
the solution of the initial value problem (3), (14) is 

y = (b/a) + [yo - (b/a)]e at . (18) 


For a ^ 0 the expression (17) contains all possible solutions of Eq. (3) and is called 
the general solution . 3 The geometrical representation of the general solution (17) is 
an infinite family of curves called integral curves. Each integral curve is associated 
with a particular value of c and is the graph of the solution corresponding to that 


3 If a = 0, then the solution of Eq. (3) is not given by Eq. (17). We leave it to you to find the general 
solution in this case. 
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EXAMPLE 

2 

A Falling 
Object 
(continued) 


value of c. Satisfying an initial condition amounts to identifying the integral curve 
that passes through the given initial point. 

To relate the solution (18) to Eq. (2), which models the field mouse population, 
we need only replace a by the growth rate r and b by the predation rate k. Then the 
solution (18) becomes 

P = (k/r) + [po - (k/r)]e n , (19) 

where po is the initial population of field mice. The solution (19) confirms the con- 
clusions reached on the basis of the direction field and Example 1. If po = k/r. then 
from Eq. (19) it follows that p = k/r for all f; this is the constant, or equilibrium, 
solution. If po / k/r. then the behavior of the solution depends on the sign of the 
coefficient p 0 — {k/r) of the exponential term in Eq. (19). If po > k/r. then p grows 
exponentially with time ?; if po < k/r. then p decreases and eventually becomes zero, 
corresponding to extinction of the field mouse population. Negative values of p. 
while possible for the expression (19), make no sense in the context of this particular 
problem. 

To put the falling-object equation (1) in the form (3), we must identify a with —y/m 
and b with —g. Making these substitutions in the solution (18), we obtain 

v = (, mg/y ) + [v 0 - (/ ng/y)]e~ yt/m , (20) 

where vq is the initial velocity. Again, this solution confirms the conclusions reached 
in Section 1.1 on the basis of a direction field. There is an equilibrium, or constant, 
solution v = mg/y . and all other solutions tend to approach this equilibrium solution. 
The speed of convergence to the equilibrium solution is determined by the exponent 
—y/m. Thus, for a given mass m. the velocity approaches the equilibrium value more 
rapidly as the drag coefficient y increases. 


Suppose that, as in Example 2 of Section 1.1, we consider a falling object of mass m — 10 kg 
and drag coefficient y = 2 kg/s. Then the equation of motion (1) becomes 


$=9.8-i 
dt 5 


( 21 ) 


Suppose this object is dropped from a height of 300 m. Find its velocity at any time t. How 
long will it take to fall to the ground, and how fast will it be moving at the time of impact? 

The first step is to state an appropriate initial condition for Eq. (21). The word “dropped” in 
the statement of the problem suggests that the initial velocity is zero, so we will use the initial 
condition 

u(0) = 0. (22) 


The solution of Eq. (21) can be found by substituting the values of the coefficients into the 
solution (20), but we will proceed instead to solve Eq. (21) directly. First, rewrite the equation 
as 


dv/dt 1 
v — 49 = _ 5’ 


(23) 


By integrating both sides, we obtain 

In |v — 49| = ~ t - + C, (24) 

and then the general solution of Eq. (21) is 


- 1/5 


v = 49 4- ce 


(25) 
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where c is arbitrary. To determine c, we substitute t = 0 and v = 0 from the initial condition 
(22) into Eq. (25), with the result that c = —49. Then the solution of the initial value problem 
(21), (22) is 

v = 49(1 - e" ,/5 ). (26) 

Equation (26) gives the velocity of the falling object at any positive time (before it hits the 
ground, of course). 

Graphs of the solution (25) for several values of c are shown in Figure 1.2.2, with the solution 
(26) shown by the heavy curve. It is evident that all solutions tend to approach the equilibrium 
solution v = 49. This confirms the conclusions we reached in Section 1.1 on the basis of the 
direction fields in Figures 1.1.2 and 1.1.3. 



FIGURE 1.2.2 Graphs of the solution (25) for several values of c. 


To find the velocity of the object when it hits the ground, we need to know the time at 
which impact occurs. In other words, we need to determine how long it takes the object to fall 
300 m. To do this, we note that the distance x the object has fallen is related to its velocity v 
by the equation v = dx/dt, or 

d y 

— = 49(1 - e“ </5 ). (27) 

dt 

Consequently, by integrating both sides of Eq. (27), we have 

x = 49 1 + 245e- f/s + c, (28) 

where c is an arbitrary constant of integration. The object starts to fall when t = 0, so we know 
that x = 0 when t = 0. From Eq. (28) it follows that c = —245, so the distance the object has 
fallen at time t is given by 

x = 49 1 + 245e~ ,/s - 245. (29) 

Let T be the time at which the object hits the ground; thenx = 300 when t = T. By substituting 
these values in Eq. (29), we obtain the equation 


49T + 245e -r/s — 545 = 0. 


(30) 
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The value of T satisfying Eq. (30) can be approximated by a numerical process * * * 4 using a scientific 
calculator or computer, with the result that T = 10.51 s. At this time, the corresponding 
velocity v T is found from Eq. (26) to be v t = 43.01 m/s. The point (10.51,43.01) is also shown 
in Figure 1.2.2. 

Further Remarks on Mathematical Modeling. Up to this point we have related our discus- 
sion of differential equations to mathematical models of a falling object and of a 
hypothetical relation between field mice and owls. The derivation of these models 
may have been plausible, and possibly even convincing, but you should remember 
that the ultimate test of any mathematical model is whether its predictions agree 
with observations or experimental results. We have no actual observations or exper- 
imental results to use for comparison purposes here, but there are several sources of 
possible discrepancies. 

In the case of the falling object, the underlying physical principle (Newton’s law of 
motion) is well established and widely applicable. However, the assumption that the 
drag force is proportional to the velocity is less certain. Even if this assumption is 
correct, the determination of the drag coefficient y by direct measurement presents 
difficulties. Indeed, sometimes one finds the drag coefficient indirectly — for example, 
by measuring the time of fall from a given height and then calculating the value of y 
that predicts this observed time. 

The model of the field mouse population is subject to various uncertainties. The 
determination of the growth rate r and the predation rate k depends on observa- 
tions of actual populations, which may be subject to considerable variation. The 
assumption that r and k are constants may also be questionable. For example, a 
constant predation rate becomes harder to sustain as the field mouse population be- 
comes smaller. Further, the model predicts that a population above the equilibrium 
value will grow exponentially larger and larger. This seems at variance with the be- 
havior of actual populations; see the further discussion of population dynamics in 
Section 2.5. 

If the differences between actual observations and a mathematical model’s pre- 
dictions are too great, then you need to consider refining the model, making more 
careful observations, or perhaps both. There is almost always a tradeoff between ac- 
curacy and simplicity. Both are desirable, but a gain in one usually involves a loss in 
the other. However, even if a mathematical model is incomplete or somewhat inac- 
curate, it may nevertheless be useful in explaining qualitative features of the problem 
under investigation. It may also give satisfactory results under some circumstances 
but not others. Thus you should always use good judgment and common sense in 
constructing mathematical models and in using their predictions. 


PROBLEMS 1. Solve each of the following initial value problems and plot the solutions for several values 

of yo- Then describe in a few words how the solutions resemble, and differ from, each 

other. 

(a) dy/dt = -y + 5, y(0) = y 0 


4 A computer algebra system provides this capability; many calculators also have built-in routines for 
solving such equations. 
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(b) dy/dt = -2 y + 5, y(0) = y 0 

(c) dy/dt = -2 y + 10, y( 0) = y 0 

2. Follow the instructions for Problem 1 for the following initial value problems: 

(a) dy/dt =y- 5, v(0) = y Q 

(b) dy/dt = 2y-5, y( 0) = y 0 

(c) dy/dt = 2 y- 10, v(0) = y 0 

3. Consider the differential equation 

dy/dt = —ay + b , 

where both a and b are positive numbers. 

(a) Solve the differential equation. 

(b) Sketch the solution for several different initial conditions. 

(c) Describe how the solutions change under each of the following conditions: 

i. a increases. 

ii. b increases. 

iii. Both a and b increase, but the ratio b/a remains the same. 

4. Consider the differential equation dy/dt = ay — b. 

(a) Find the equilibrium solution y e . 

(b) Let Y(t) = y — y e : thus Y (t) is the deviation from the equilibrium solution. Find the 
differential equation satisfied by Y (t). 

5. Undetermined Coefficients. Flere is an alternative way to solve the equation 

dy/dt = ay — b. (i) 

(a) Solve the simpler equation 

dy/dt = ay. (ii) 

Call the solution yi(t). 

(b) Observe that the only difference between Eqs. (i) and (ii) is the constant — b in Eq. (i). 
Therefore it may seem reasonable to assume that the solutions of these two equations 
also differ only by a constant. Test this assumption by trying to find a constant k such that 
y = yi(t) + k is a solution of Eq. (i). 

(c) Compare your solution from part (b) with the solution given in the text in Eq. (17). 
Note: This method can also be used in some cases in which the constant b is replaced by a 
function g(f). It depends on whether you can guess the general form that the solution is 
likely to take. This method is described in detail in Section 3.5 in connection with second 
order equations. 

6. Use the method of Problem 5 to solve the equation 

dy/dt = —ay + b. 

7. The field mouse population in Example 1 satisfies the differential equation 

dp/dt = 0.5 p — 450. 

(a) Find the time at which the population becomes extinct if p( 0) = 850. 

(b) Find the time of extinction if p( 0) = po , where 0 < po < 900. 

(c) Find the initial population p 0 if the population is to become extinct in 1 year. 
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8. Consider a population p of field mice that grows at a rate proportional to the current 
population, so that dp/dt = rp. 

(a) Find the rate constant r if the population doubles in 30 days. 

(b) Find r if the population doubles in N days. 

9. The falling object in Example 2 satisfies the initial value problem 

dv/dt — 9.8 — (u/5), u(0) = 0. 

(a) Find the time that must elapse for the object to reach 98% of its limiting velocity. 

(b) Flow far does the object fall in the time found in part (a)? 

10. Modify Example 2 so that the falling object experiences no air resistance. 

(a) Write down the modified initial value problem. 

(b) Determine how long it takes the object to reach the ground. 

(c) Determine its velocity at the time of impact. 

11. Consider the falling object of mass 10 kg in Example 2, but assume now that the drag force 
is proportional to the square of the velocity. 

(a) If the limiting velocity is 49 m/s (the same as in Example 2), show that the equation 
of motion can be written as 

dv/dt = [(49) 2 - t> 2 ]/245. 

Also see Problem 25 of Section 1.1. 

(b) If u(0) = 0, find an expression for v{t) at any time. 

(c) Plot your solution from part (b) and the solution (26) from Example 2 on the same 
axes. 

(d) Based on your plots in part (c), compare the effect of a quadratic drag force with that 
of a linear drag force. 

(e) Find the distance x(t) that the object falls in time t. 

(f) Find the time T it takes the object to fall 300 m. 

12. A radioactive material, such as the isotope thorium-234, disintegrates at a rate proportional 
to the amount currently present. If(2(t)istheamountpresentattimet,thendQ/dt = —rQ, 
where r > 0 is the decay rate. 

(a) If 100 mg of thorium-234 decays to 82.04 mg in 1 week, determine the decay rate r. 

(b) Find an expression for the amount of thorium-234 present at any time t. 

(c) Find the time required for the thorium-234 to decay to one-half its original amount. 

13. The half-life of a radioactive material is the time required for an amount of this material 
to decay to one-half its original value. Show that for any radioactive material that decays 
according to the equation Q' — —rQ, the half-life r and the decay rate r satisfy the equation 
rx = In 2. 

14. Radium-226 has a half-life of 1620 years. Find the time period during which a given 
amount of this material is reduced by one-quarter. 

15. According to Newton’s law of cooling (see Problem 23 of Section 1.1), the temperature 
u(t) of an object satisfies the differential equation 

du/dt = —k(u — T), 

where T is the constant ambient temperature and k is a positive constant. Suppose that 
the initial temperature of the object is u( 0) = u 0 . 

(a) Find the temperature of the object at any time. 
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(b) Let r be the time at which the initial temperature difference uo — T has been reduced 
by half. Find the relation between k and r. 

16. Suppose that a building loses heat in accordance with Newton’s law of cooling (see Prob- 
lem 15) and that the rate constant k has the value 0.15 h -1 . Assume that the interior 
temperature is 70°F when the heating system fails. If the external temperature is 10°F, 
how long will it take for the interior temperature to fall to 32°F? 

17. Consider an electric circuit containing a capacitor, resistor, and battery; see Figure 1.2.3. 
The charge Q(t) on the capacitor satisfies the equation 5 


dQ Q 

R l + h v ' 


where R is the resistance, C is the capacitance, and V is the constant voltage supplied by 
the battery. 

(a) If Q( 0) = 0, find Q(t) at any time t, and sketch the graph of Q versus t. 

(b) Find the limiting value Q L that Q(t) approaches after a long time. 

(c) Suppose that Q(h) = <2l and that at time t = t\ the battery is removed and the circuit 
closed again. Find Q(t) for t > t\ and sketch its graph. 


1 / 


FIGURE 1.2.3 


R 



C 


The electric circuit of Problem 17. 


18. A pond containing 1,000,000 gal of water is initially free of a certain undesirable chemical 
(see Problem 21 of Section 1.1). Water containing 0.01 g/gal of the chemical flows into the 
pond at a rate of 300 gal/h, and water also flows out of the pond at the same rate. Assume 
that the chemical is uniformly distributed throughout the pond. 

(a) Let <2(0 be the amount of the chemical in the pond at time t. Write down an initial 
value problem for <2(0- 

(b) Solve the problem in part (a) for Q(t). How much chemical is in the pond after 1 
year? 

(c) At the end of 1 year the source of the chemical in the pond is removed; thereafter 
pure water flows into the pond, and the mixture flows out at the same rate as before. Write 
down the initial value problem that describes this new situation. 

(d) Solve the initial value problem in part (c). How much chemical remains in the pond 
after 1 additional year (2 years from the beginning of the problem)? 

(e) How long does it take for <2(0 to be reduced to 10 g? 

(f) Plot Q(t) versus t for 3 years. 

19. Your swimming pool containing 60,000 gal of water has been contaminated by 5 kg of 
a nontoxic dye that leaves a swimmer’s skin an unattractive green. The pool’s filtering 
system can take water from the pool, remove the dye, and return the water to the pool at 
a flow rate of 200 gal/min. 


5 This equation results from Kirchhoff' s laws, which are discussed in Section 3.7. 


1.3 Classification of Differential Equations 


19 


(a) Write down the initial value problem for the filtering process; let q(t) be the amount 
of dye in the pool at any time t. 

(b) Solve the problem in part (a). 

(c) You have invited several dozen friends to a pool party that is scheduled to begin in 4 h. 
You have also determined that the effect of the dye is imperceptible if its concentration 
is less than 0.02 g/gal. Is your filtering system capable of reducing the dye concentration 
to this level within 4 h? 

(d) Find the time T at which the concentration of dye first reaches the value 0.02 g/gal. 

(e) Find the flow rate that is sufficient to achieve the concentration 0.02 g/gal within 4 h. 


1.3 Classification of Differential Equations 


The main purpose of this book is to discuss some of the properties of solutions of 
differential equations, and to present some of the methods that have proved effective 
in finding solutions or, in some cases, approximating them. To provide a framework 
for our presentation, we describe here several useful ways of classifying differential 
equations. 


Ordinary and Partial Differential Equations. One important classification is based on 
whether the unknown function depends on a single independent variable or on sev- 
eral independent variables. In the first case, only ordinary derivatives appear in the 
differential equation, and it is said to be an ordinary differential equation. In the 
second case, the derivatives are partial derivatives, and the equation is called a partial 
differential equation. 

All the differential equations discussed in the preceding two sections are ordinary 
differential equations. Another example of an ordinary differential equation is 


, d 2 Q(t) 
dt 2 


+ R 


dQ(t) 
dt + 


1 

C 


2(0 = E(t), 


(l) 


for the charge 2(0 on a capacitor in a circuit with capacitance C, resistance R, and 
inductance L; this equation is derived in Section 3.7. Typical examples of partial 
differential equations are the heat conduction equation 


and the wave equation 


a 


2 3 2 u(x,t) 

dx 2 

d u(x, t) 

dt 

( 2 ) 

d 2 u(x, t) 

d 2 u{x , t ) 

( 3 ) 

dx 2 

dt 2 


Here, a 2 and a 2 are certain physical constants. Note that in both Eqs. (2) and (3) the 
dependent variable u depends on the two independent variables x and t. The heat 
conduction equation describes the conduction of heat in a solid body, and the wave 
equation arises in a variety of problems involving wave motion in solids or fluids. 


Systems of Differential Equations. Another classification of differential equations de- 
pends on the number of unknown functions that are involved. If there is a single 
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function to be determined, then one equation is sufficient. However, if there are two 
or more unknown functions, then a system of equations is required. For example, the 
Lotka-Volterra, or predator-prey, equations are important in ecological modeling. 
They have the form 


dx/dt = ax — axy 
dy/dt — —cy + yxy, 


(4) 


where x(t) and y(t) are the respective populations of the prey and predator species. 
The constants a,a,c, and y are based on empirical observations and depend on the 
particular species being studied. Systems of equations are discussed in Chapters 7 
and 9; in particular, the Lotka-Volterra equations are examined in Section 9.5. In 
some areas of application it is not unusual to encounter very large systems containing 
hundreds, or even many thousands, of equations. 


Order. The order of a differential equation is the order of the highest derivative that 
appears in the equation. The equations in the preceding sections are all first order 
equations, whereas Eq. (1) is a second order equation. Equations (2) and (3) are 
second order partial differential equations. More generally, the equation 

F[t,u(t),u'(t ), . . . ,n ( ” , (t)] = 0 (5) 


is an ordinary differential equation of the nth order. Equation (5) expresses a re- 
lation between the independent variable t and the values of the function u and its 
first n derivatives u', u " , . . . , u (n) . It is convenient and customary in differential equa- 
tions to write y for «(/), with y',y ", . . . ,y (M) standing for u'(t), u"(t ), . . . , u (n \t). Thus 
Eq. (5) is written as 

F(t,y, y M ) = 0. (6) 


For example, 


/" + 2e‘y" + yy' = t 4 


(7) 


is a third order differential equation for y = u(t). Occasionally, other letters will be 
used instead of t and y for the independent and dependent variables; the meaning 
should be clear from the context. 

We assume that it is always possible to solve a given ordinary differential equation 
for the highest derivative, obtaining 

y (n) =f(t,y,/, (8) 

We study only equations of the form (8). This is mainly to avoid the ambiguity 
that may arise because a single equation of the form (6) may correspond to several 
equations of the form (8). For example, the equation 

C/) 2 + ty' + 4y = 0 (9) 


leads to the two equations 

, —t + s/t 1 - 16 y 


—t — sjt 1 — 16y 


2 


or y' = 


2 


( 10 ) 
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Linear and Nonlinear Equations. A crucial classification of differential equations is 
whether they are linear or nonlinear. The ordinary differential equation 

F(t,y,y',...,yW) = 0 

is said to be linear if F is a linear function of the variables y,y', . . . ,y <n> ', a similar 
definition applies to partial differential equations. Thus the general linear ordinary 
differential equation of order n is 

a 0 (t)y w + ni(/)y (,1_1) H f a n (t)y = g(t). (11) 

Most of the equations you have seen thus far in this book are linear; examples are 
the equations in Sections 1.1 and 1.2 describing the falling object and the field mouse 
population. Similarly, in this section, Eq. (1) is a linear ordinary differential equation 
and Eqs. (2) and (3) are linear partial differential equations. An equation that is not 
of the form (11) is a nonlinear equation. Equation (7) is nonlinear because of the 
term yy’ . Similarly, each equation in the system (4) is nonlinear because of the terms 
that involve the product xy. 

A simple physical problem that leads to a nonlinear differential equation is the 
oscillating pendulum. The angle d that an oscillating pendulum of length L makes 
with the vertical direction (see Figure 1.3.1) satisfies the equation 

(jin p. 

^ + 1^ = 0. ,12, 

whose derivation is outlined in Problems 29 through 31. The presence of the term 
involving sind makes Eq. (12) nonlinear. 



mg 

FIGURE 1.3.1 An oscillating pendulum. 


The mathematical theory and methods for solving linear equations are highly de- 
veloped. In contrast, for nonlinear equations the theory is more complicated, and 
methods of solution are less satisfactory. In view of this, it is fortunate that many 
significant problems lead to linear ordinary differential equations or can be approx- 
imated by linear equations. For example, for the pendulum, if the angle 6 is small, 
then sind = 6 and Eq. (12) can be approximated by the linear equation 
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This process of approximating a nonlinear equation by a linear one is called lineariza- 
tion; it is an extremely valuable way to deal with nonlinear equations. Nevertheless, 
there are many physical phenomena that simply cannot be represented adequately 
by linear equations. To study these phenomena, it is essential to deal with nonlinear 
equations. 

In an elementary text it is natural to emphasize the simpler and more straight- 
forward parts of the subject. Therefore the greater part of this book is devoted to 
linear equations and various methods for solving them. However, Chapters 8 and 9, 
as well as parts of Chapter 2, are concerned with nonlinear equations. Whenever it 
is appropriate, we point out why nonlinear equations are, in general, more difficult 
and why many of the techniques that are useful in solving linear equations cannot 
be applied to nonlinear equations. 

Solutions. A solution of the ordinary differential equation (8) on the interval 
a < t < ft is a function <p such that </>' ,<p" , </> (n) exist and satisfy 

<p (n \t) (i4) 

for every t in a < t < ft. Unless stated otherwise, we assume that the function / 
of Eq. (8) is a real-valued function, and we are interested in obtaining real-valued 
solutions y = 0(f). 

Recall that in Section 1.2 we found solutions of certain equations by a process of 
direct integration. For instance, we found that the equation 

= 0.5p — 450 (15) 

dt 

has the solution 

p — 900 + ce t/2 , (16) 

where c is an arbitrary constant. It is often not so easy to find solutions of differential 
equations. However, if you find a function that you think may be a solution of a given 
equation, it is usually relatively easy to determine whether the function is actually a 
solution simply by substituting the function into the equation. For example, in this 
way it is easy to show that the function yq (f) = cos t is a solution of 

y" + y = o (17) 

for all t. To confirm this, observe that yj (t ) = — sin t and y'[ (t) = — cos f; then it follows 
that y"(t) + yi(f) = 0. In the same way you can easily show that yq (0 = sin t is also 
a solution of Eq. (17). Of course, this does not constitute a satisfactory way to solve 
most differential equations, because there are far too many possible functions for you 
to have a good chance of finding the correct one by a random choice. Nevertheless, 
you should realize that you can verify whether any proposed solution is correct by 
substituting it into the differential equation. This can be a very useful check; it is one 
that you should make a habit of considering. 

Some Important Questions. Although for the equations (15) and (17) we are able to 
verify that certain simple functions are solutions, in general we do not have such 
solutions readily available. Thus a fundamental question is the following: Does an 
equation of the form (8) always have a solution? The answer is “No.” Merely writing 
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down an equation of the form (8) does not necessarily mean that there is a function 
y — (p(t) that satisfies it. So, how can we tell whether some particular equation has a 
solution? This is the question of existence of a solution, and it is answered by theorems 
stating that under certain restrictions on the function/ in Eq. (8), the equation always 
has solutions. This is not a purely mathematical concern for at least two reasons. If 
a problem has no solution, we would prefer to know that fact before investing time 
and effort in a vain attempt to solve the problem. Further, if a sensible physical 
problem is modeled mathematically as a differential equation, then the equation 
should have a solution. If it does not, then presumably there is something wrong 
with the formulation. In this sense an engineer or scientist has some check on the 
validity of the mathematical model. 

If we assume that a given differential equation has at least one solution, the ques- 
tion arises as to how many solutions it has, and what additional conditions must be 
specified to single out a particular solution. This is the question of uniqueness. In 
general, solutions of differential equations contain one or more arbitrary constants 
of integration, as does the solution (16) of Eq. (15). Equation (16) represents an in- 
finity of functions corresponding to the infinity of possible choices of the constant c. 
As we saw in Section 1.2, if p is specified at some time t, this condition will determine 
a value for c; even so, we have not yet ruled out the possibility that there may be 
other solutions of Eq. (15) that also have the prescribed value of p at the prescribed 
time t. The issue of uniqueness also has practical implications. If we are fortunate 
enough to find a solution of a given problem, and if we know that the problem has a 
unique solution, then we can be sure that we have completely solved the problem. If 
there may be other solutions, then perhaps we should continue to search for them. 

A third important question is: Given a differential equation of the form (8), can 
we actually determine a solution, and if so, how? Note that if we find a solution of the 
given equation, we have at the same time answered the question of the existence of 
a solution. However, without knowledge of existence theory we might, for example, 
use a computer to find a numerical approximation to a “solution” that does not exist. 
On the other hand, even though we may know that a solution exists, it may be that the 
solution is not expressible in terms of the usual elementary functions — polynomial, 
trigonometric, exponential, logarithmic, and hyperbolic functions. Unfortunately, 
this is the situation for most differential equations. Thus, we discuss both elemen- 
tary methods that can be used to obtain exact solutions of certain relatively simple 
problems, and also methods of a more general nature that can be used to find ap- 
proximations to solutions of more difficult problems. 

Computer Use in Differential Equations. A computer can be an extremely valuable tool 
in the study of differential equations. For many years computers have been used 
to execute numerical algorithms, such as those described in Chapter 8, to construct 
numerical approximations to solutions of differential equations. These algorithms 
have been refined to an extremely high level of generality and efficiency. A few lines 
of computer code, written in a high-level programming language and executed (often 
within a few seconds) on a relatively inexpensive computer, suffice to approximate to 
a high degree of accuracy the solutions of a wide range of differential equations. More 
sophisticated routines are also readily available. These routines combine the ability 
to handle very large and complicated systems with numerous diagnostic features that 
alert the user to possible problems as they are encountered. 
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The usual output from a numerical algorithm is a table of numbers, listing selected 
values of the independent variable and the corresponding values of the dependent 
variable. With appropriate software it is easy to display the solution of a differential 
equation graphically, whether the solution has been obtained numerically or as the 
result of an analytical procedure of some kind. Such a graphical display is often 
much more illuminating and helpful in understanding and interpreting the solution 
of a differential equation than a table of numbers or a complicated analytical for- 
mula. There are on the market several well-crafted and relatively inexpensive special- 
purpose software packages for the graphical investigation of differential equations. 
The widespread availability of personal computers has brought powerful computa- 
tional and graphical capability within the reach of individual students. You should 
consider, in the light of your own circumstances, how best to take advantage of the 
available computing resources. You will surely find it enlightening to do so. 

Another aspect of computer use that is very relevant to the study of differential 
equations is the availability of extremely powerful and general software packages 
that can perform a wide variety of mathematical operations. Among these are Maple, 
Mathematica, and MATLAB, each of which can be used on various kinds of personal 
computers or workstations. All three of these packages can execute extensive nu- 
merical computations and have versatile graphical facilities. Maple and Mathematica 
also have very extensive analytical capabilities. For example, they can perform the 
analytical steps involved in solving many differential equations, often in response to 
a single command. Anyone who expects to deal with differential equations in more 
than a superficial way should become familiar with at least one of these products and 
explore the ways in which it can be used. 

For you, the student, these computing resources have an effect on how you should 
study differential equations. To become confident in using differential equations, it 
is essential to understand how the solution methods work, and this understanding is 
achieved, in part, by working out a sufficient number of examples in detail. Flowever, 
eventually you should plan to delegate as many as possible of the routine (often 
repetitive) details to a computer, while you focus on the proper formulation of the 
problem and on the interpretation of the solution. Our viewpoint is that you should 
always try to use the best methods and tools available for each task. In particular, 
you should strive to combine numerical, graphical, and analytical methods so as to 
attain maximum understanding of the behavior of the solution and of the underlying 
process that the problem models. You should also remember that some tasks can 
best be done with pencil and paper, while others require a calculator or computer. 
Good judgment is often needed in selecting a judicious combination. 

PROBLEMS 

In each of Problems 1 through 6 determine the order of the given differential equation; also 
state whether the equation is linear or nonlinear. 

+ = 2. , 1 + ^+4+,- e' 

dt 2 dt ' - dt 2 dt 

+ ^ + ^ + 4 . $ + ,/ = o 

dr dt 3 dt 2 dt dt 

„ d 2 y . . „ d 3 y dy ~ , 

5. —r + sin (t + y) = sin t 6. + t — + (cos 2 t)y = t 3 

dt A dr dt 


6 . 
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In each of Problems 7 through 14 verify that each given function is a solution of the differ- 
ential equation. 

7. y" - y = 0; yi(f) = e', y 2 (t) = cosh t 

8. y" + 2/ - 3y = 0; yi(t) = e" 3 ', y 2 (0 = e' 

9. ty -y = t 2 ; y = 3t+t 2 

10. y"" + 4y"' + 3 y = t; yi(f) = t/3, y 2 {t) = e~‘ + t/3 

11. 2t 2 y" + 3ty' - y = 0, t> 0; y r (t) = t 1 / 2 , y 2 (0 = f _1 

12. f 2 y" + 5iy' + 4y = 0, t > 0; y l {t) = t~ 2 , y 2 (t) = t~ 2 \nt 

13. y" + y = seer, 0<t<n/2\ y = (cos f ) In cos f + f sin t 

14. y' — 2fy = 1; y = e' 2 f e~ sl ds + e' 2 

Jo 

In each of Problems 15 through 18 determine the values of r for which the given differential 
equation has solutions of the form y = e rt . 

15. y + 2y = 0 16. y" - y = 0 

17. y" + y - 6y = 0 18. y'" - 3y" + 2y' = 0 

In each of Problems 19 and 20 determine the values of r for which the given differential 
equation has solutions of the form y = f for t > 0. 

19. t 2 y" + 4ty' + 2y = 0 20. t 2 y" - 4?y' + 4y = 0 

In each of Problems 21 through 24 determine the order of the given partial differential equa- 
tion; also state whether the equation is linear or nonlinear. Partial derivatives are denoted by 
subscripts. 

21. U xx + llyy + U zz = 0 22. ll xx + llyy + Ull x + Lilly + ll = 0 

23. U xx xx 4“ 2 1 l xx yy I - Hyyyy ~~ 0 24. ll( 4- llll x = 1 4“ llxx 

In each of Problems 25 through 28 verify that each given function is a solution of the given 
partial differential equation. 

25. u xx 4- Uyy — 0; u\(x,y) = cos. x coshy, u 2 (x,y) = ln(x 2 4- y 2 ) 

26. a 2 u xx = u t \ ui(x,t) = e~“ 2 ' sinx, u 2 (x,t) = e~“ 2 ^ t sinAjc, X a real constant 

27. a 2 u xx = u tt ; u\ (x, t) = sin Xx sin Xat, u 2 (x, t) = sin(x — at), X a real constant 

28. a 2 u xx = lit', ii = (tt / f) 1/2 e _j:2/4 “ 2f , t > 0 

29. Follow the steps indicated here to derive the equation of motion of a pendulum, Eq. (12) 
in the text. Assume that the rod is rigid and weightless, that the mass is a point mass, and 
that there is no friction or drag anywhere in the system. 

(a) Assume that the mass is in an arbitrary displaced position, indicated by the angle 9. 
Draw a free-body diagram showing the forces acting on the mass. 

(b) Apply Newton’s law of motion in the direction tangential to the circular arc on which 
the mass moves. Then the tensile force in the rod does not enter the equation. Observe 
that you need to find the component of the gravitational force in the tangential direc- 
tion. Observe also that the linear acceleration, as opposed to the angular acceleration, is 
Ld 2 9/dt 2 , where L is the length of the rod. 

(c) Simplify the result from part (b) to obtain Eq. (12) in the text. 

30. Another way to derive the pendulum equation (12) is based on the principle of conserva- 
tion of energy. 

(a) Show that the kinetic energy T of the pendulum in motion is 
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(b) Show that the potential energy V of the pendulum, relative to its rest position, is 

V = mgL (1 — cost?). 

(c) By the principle of conservation of energy, the total energy E = T + V is constant. 
Calculate dE/dt , set it equal to zero, and show that the resulting equation reduces to 
Eq. (12). 

31. A third derivation of the pendulum equation depends on the principle of angular momen- 
tum: the rate of change of angular momentum about any point is equal to the net external 
moment about the same point. 

(a) Show that the angular momentum M, or moment of momentum, about the point of 
support is given by M = mL 2 dd/dt. 

(b) Set dM/dt equal to the moment of the gravitational force, and show that the resulting 
equation reduces to Eq. (12). Note that positive moments are counterclockwise. 


1.4 Historical Remarks 


Without knowing something about differential equations and methods of solving 
them, it is difficult to appreciate the history of this important branch of mathematics. 
Further, the development of differential equations is intimately interwoven with the 
general development of mathematics and cannot be separated from it. Nevertheless, 
to provide some historical perspective, we indicate here some of the major trends in 
the history of the subject and identify the most prominent early contributors. Other 
historical information is contained in footnotes scattered throughout the book and 
in the references listed at the end of the chapter. 

The subject of differential equations originated in the study of calculus by Isaac 
Newton (1642-1727) and Gottfried Wilhelm Leibniz (1646-1716) in the seventeenth 
century. Newton grew up in the English countryside, was educated at Trinity Col- 
lege, Cambridge, and became Lucasian Professor of Mathematics there in 1669. His 
epochal discoveries of calculus and of the fundamental laws of mechanics date from 
1665. They were circulated privately among his friends, but Newton was extremely 
sensitive to criticism and did not begin to publish his results until 1687 with the ap- 
pearance of his most famous book, Philosophiae Naturalis Principia Mathematica. 
Although Newton did relatively little work in differential equations as such, his devel- 
opment of the calculus and elucidation of the basic principles of mechanics provided 
a basis for their applications in the eighteenth century, most notably by Euler. New- 
ton classified first order differential equations according to the forms dy/dx = f(x), 
dy/dx = f(y), and dy/dx = f(x,y). For the latter equation he developed a method 
of solution using infinite series when f(x,y ) is a polynomial in x and y. Newton’s 
active research in mathematics ended in the early 1690s, except for the solution of 
occasional “challenge problems” and the revision and publication of results obtained 
much earlier. He was appointed Warden of the British Mint in 1696 and resigned his 
professorship a few years later. He was knighted in 1705 and, upon his death, was 
buried in Westminster Abbey. 

Leibniz was born in Leipzig and completed his doctorate in philosophy at the 
age of 20 at the University of Altdorf. Throughout his life he engaged in scholarly 
work in several different fields. He was mainly self-taught in mathematics, since 
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his interest in this subject developed when he was in his twenties. Leibniz arrived 
at the fundamental results of calculus independently, although a little later than 
Newton, but was the first to publish them, in 1684. Leibniz was very conscious 
of the power of good mathematical notation, and was responsible for the notation 
dy/dx for the derivative and for the integral sign. He discovered the method of 
separation of variables (Section 2.2) in 1691 , the reduction of homogeneous equations 
to separable ones (Section 2.2, Problem 30) in 1691, and the procedure for solving 
first order linear equations (Section 2.1) in 1694. He spent his life as ambassador 
and adviser to several German royal families, which permitted him to travel widely 
and to carry on an extensive correspondence with other mathematicians, especially 
the Bernoulli brothers. In the course of this correspondence many problems in 
differential equations were solved during the latter part of the seventeenth century. 

The brothers Jakob (1654-1705) and Johann (1667-1748) Bernoulli of Basel did 
much to develop methods of solving differential equations and to extend the range 
of their applications. Jakob became professor of mathematics at Basel in 1687, and 
Johann was appointed to the same position upon his brother’s death in 1705. Both 
men were quarrelsome, jealous, and frequently embroiled in disputes, especially with 
each other. Nevertheless, both also made significant contributions to several areas of 
mathematics. With the aid of calculus, they solved a number of problems in mechanics 
by formulating them as differential equations. For example, Jakob Bernoulli solved 
the differential equation y' = [a 3 /(b 2 y — n 3 )] 1/2 in 1690 and in the same paper first 
used the term “integral” in the modern sense. In 1694 Johann Bernoulli was able 
to solve the equation dy/dx = y/ax. One problem which both brothers solved, and 
which led to much friction between them, was the brachistochrone problem (see 
Problem 32 of Section 2.3). The brachistochrone problem was also solved by Leibniz, 
Newton, and the Marquis de L'Hospital. It is said, perhaps apocryphally, that Newton 
learned of the problem late in the afternoon of a tiring day at the Mint and solved it 
that evening after dinner. He published the solution anonymously, but upon seeing 
it, Johann Bernoulli exclaimed, “Ah, I know the lion by his paw.” 

Daniel Bernoulli (1700-1782), son of Johann, migrated to St. Petersburg as a young 
man to join the newly established St. Petersburg Academy but returned to Basel in 
1733 as professor of botany and, later, of physics. His interests were primarily in 
partial differential equations and their applications. For instance, it is his name that 
is associated with the Bernoulli equation in fluid mechanics. He was also the first 
to encounter the functions that a century later became known as Bessel functions 
(Section 5.7). 

The greatest mathematician of the eighteenth century, Leonhard Euler (1707- 
1783), grew up near Basel and was a student of Johann Bernoulli. He followed his 
friend Daniel Bernoulli to St. Petersburg in 1727. For the remainder of his life he 
was associated with the St. Petersburg Academy (1727-1741 and 1766-1783) and 
the Berlin Academy (1741-1766). Euler was the most prolific mathematician of 
all time; his collected works fill more than 70 large volumes. His interests ranged 
over all areas of mathematics and many fields of application. Even though he was 
blind during the last 17 years of his life, his work continued undiminished until the 
very day of his death. Of particular interest here is his formulation of problems 
in mechanics in mathematical language and his development of methods of solving 
these mathematical problems. Lagrange said of Euler’s work in mechanics, “The first 
great work in which analysis is applied to the science of movement.” Among other 
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things, Euler identified the condition for exactness of first order differential equations 
(Section 2.6) in 1734-35, developed the theory of integrating factors (Section 2.6) in 
the same paper, and gave the general solution of homogeneous linear equations with 
constant coefficients (Sections 3.1, 3.3, 3.4, and 4.2) in 1743. He extended the latter 
results to nonhomogeneous equations in 1750-51. Beginning about 1750, Euler made 
frequent use of power series (Chapter 5) in solving differential equations. He also 
proposed a numerical procedure (Sections 2.7 and 8.1) in 1768-69, made important 
contributions in partial differential equations, and gave the first systematic treatment 
of the calculus of variations. 

Joseph-Louis Lagrange (1736-1813) became professor of mathematics in his na- 
tive Turin at the age of 19. He succeeded Euler in the chair of mathematics at the 
Berlin Academy in 1766 and moved on to the Paris Academy in 1787. He is most fa- 
mous for his monumental work Mecanique analytique , published in 1788, an elegant 
and comprehensive treatise of Newtonian mechanics. With respect to elementary 
differential equations, Lagrange showed in 1762-65 that the general solution of an 
nth order linear homogeneous differential equation is a linear combination of n in- 
dependent solutions (Sections 3.2 and 4.1). Later, in 1774-75, he gave a complete 
development of the method of variation of parameters (Sections 3.6 and 4.4). La- 
grange is also known for fundamental work in partial differential equations and the 
calculus of variations. 

Pierre-Simon de Laplace (1749-1827) lived in Normandy as a boy but came to 
Paris in 1768 and quickly made his mark in scientific circles, winning election to the 
Academie des Sciences in 1773. He was preeminent in the field of celestial mechanics; 
his greatest work, Trade de mecanique celeste , was published in five volumes between 
1799 and 1825. Laplace’s equation is fundamental in many branches of mathematical 
physics, and Laplace studied it extensively in connection with gravitational attraction. 
The Laplace transform (Chapter 6) is also named for him, although its usefulness in 
solving differential equations was not recognized until much later. 

By the end of the eighteenth century many elementary methods of solving ordinary 
differential equations had been discovered. In the nineteenth century interest turned 
more toward the investigation of theoretical questions of existence and uniqueness 
and to the development of less elementary methods such as those based on power 
series expansions (see Chapter 5). These methods find their natural setting in the 
complex plane. Consequently, they benefitted from, and to some extent stimulated, 
the more or less simultaneous development of the theory of complex analytic func- 
tions. Partial differential equations also began to be studied intensively, as their 
crucial role in mathematical physics became clear. In this connection a number of 
functions, arising as solutions of certain ordinary differential equations, occurred re- 
peatedly and were studied exhaustively. Known collectively as higher transcendental 
functions, many of them are associated with the names of mathematicians, including 
Bessel, Legendre, Hermite, Chebyshev, and Hankel, among others. 

The numerous differential equations that resisted solution by analytical means led 
to the investigation of methods of numerical approximation (see Chapter 8). By 
1900 fairly effective numerical integration methods had been devised, but their im- 
plementation was severely restricted by the need to execute the computations by 
hand or with very primitive computing equipment. In the last 60 years the devel- 
opment of increasingly powerful and versatile computers has vastly enlarged the 
range of problems that can be investigated effectively by numerical methods. Ex- 
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tremely refined and robust numerical integrators were developed during the same 
period and are readily available. Versions appropriate for personal computers have 
brought the ability to solve a great many significant problems within the reach of 
individual students. 

Another characteristic of differential equations in the twentieth century was the 
creation of geometrical or topological methods, especially for nonlinear equations. 
The goal is to understand at least the qualitative behavior of solutions from a geo- 
metrical, as well as from an analytical, point of view. If more detailed information is 
needed, it can usually be obtained by using numerical approximations. An introduc- 
tion to geometrical methods appears in Chapter 9. 

Within the past few years these two trends have come together. Computers, and 
especially computer graphics, have given a new impetus to the study of systems of non- 
linear differential equations. Unexpected phenomena (Section 9.8), such as strange 
attractors, chaos, and fractals, have been discovered, are being intensively studied, 
and are leading to important new insights in a variety of applications. Although it is 
an old subject about which much is known, differential equations in the twenty-first 
century remains a fertile source of fascinating and important unsolved problems. 


Computer software for differential equations changes too fast for particulars to be given in a book such 
as this. A Google search for Maple, Mathematica, or MATLAB is a good way to begin if you need 
information about one of these computer algebra systems. 

For further reading in the history of mathematics, see books such as those listed below: 

Boyer, C. B., and Merzbach, U. C ,,A History of Mathematics (2nd ed.) (New York: Wiley, 1989). 

Kline, M„ Mathematical Thought from Ancient to Modern Times (New York: Oxford University Press, 
1972). 

A useful historical appendix on the early development of differential equations appears in 
Ince, E. L., Ordinary Differential Equations (London: Longmans, Green, 1927; New York: Dover, 1956). 

An encyclopedic source of information about the lives and achievements of mathematicians of the 
past is 

Gillespie, C. C., ed.. Dictionary of Scientific Biography (15 vols.) (New York: Scribner’s, 1971). 

Much historical information can be found on the Internet. One excellent site is 
www-history.mcs.st-and.ac.uk/Bioglndex.html 

created by John J. O’Connor and Edmund F. Robertson, Department of Mathematics and Statistics, 
University of St. Andrews, Scotland. 



CHAPTER 


First Order 
Differential Equations 


This chapter deals with differential equations of first order 

f=/«.y>, (i) 

where / is a given function of two variables. Any differentiable function y — cp{t) 
that satisfies this equation for all t in some interval is called a solution, and our 
object is to determine whether such functions exist and, if so, to develop methods 
for finding them. Unfortunately, for an arbitrary function /, there is no general 
method for solving the equation in terms of elementary functions. Instead, we will 
describe several methods, each of which is applicable to a certain subclass of first 
order equations. The most important of these are linear equations (Section 2.1), 
separable equations (Section 2.2), and exact equations (Section 2.6). Other sections 
of this chapter describe some of the important applications of first order differential 
equations, introduce the idea of approximating a solution by numerical computation, 
and discuss some theoretical questions related to the existence and uniqueness of 
solutions. The final section includes an example of chaotic solutions in the context of 
first order difference equations, which have some important points of similarity with 
differential equations and are simpler to investigate. 


2.1 Linear Equations; Method of Integrating Factors 

If the function/ in Eq. (1) depends linearly on the dependent variable y, then Eq. (1) 
is called a first order linear equation. In Sections 1.1 and 1.2 we discussed a re- 
stricted type of first order linear equation in which the coefficients are constants. 
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EXAMPLE 

1 


A typical example is 

^ = -ay + b, (2) 

where a and b are given constants. Recall that an equation of this form describes the 
motion of an object falling in the atmosphere. Now we want to consider the most 
general first order linear equation, which is obtained by replacing the coefficients a 
and b in Eq. (2) by arbitrary functions of t. We will usually write the general first 
order linear equation in the standard form 


~^+p(t)y = g(t), (3) 

where p and g are given functions of the independent variable t. 

Equation (2) can be solved by the straightforward integration method introduced 
in Section 1.2. That is, provided that a / 0 and y ^ b/a, we rewrite the equation as 


dy/dt 
y - (b/a) 


(4) 


Then, by integration we obtain 


In | y — {b/a) \ = —at + C, 


from which it follows that the general solution of Eq. (2) is 

y = (b/a) + ce~ at , 


(5) 


where c is an arbitrary constant. 

Unfortunately, this direct method of solution cannot be used to solve the general 
equation (3), so we need to use a different method of solution for it. We owe this 
method to Leibniz; it involves multiplying the differential equation (3) by a certain 
function p(t), chosen so that the resulting equation is readily integrable. The function 
p.(f) is called an integrating factor, and the main difficulty is to determine how to find 
it. We will introduce this method in a simple example, later showing how to extend 
it to other first order linear equations, including the general equation (3). 


Solve the differential equation 


dy_ 

dt 


-± + b= ¥' 3 - 


( 6 ) 


Plot several solutions, and find the particular solution whose graph contains the point (0, 1). 
The first step is to multiply Eq. (6) by a function /r(f), as yet undetermined; thus 


p(t) ^ + \d(t)y = j/r(f)e' /3 . 


(V) 


The question now is whether we can choose /r(f) so that the left side of Eq. (7) is recognizable 
as the derivative of some particular expression. If so, then we can integrate Eq. (7), even 
though we do not know the function y. To guide the choice of the integrating factor p(t), ask 
yourself where in calculus you have seen an expression containing a term such as p.(t)dy/dt. 
You are on the right track if this reminds you of the product rule for differentiation. Thus, let 
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us try to determine /x(t) so that the left side of Eq. (7) is the derivative of the expression 
Comparing the left side of Eq. (7) with the differentiation formula 


d dy da(t) 

= ^ + JL r y, 

we observe that they are identical, provided we choose /x(f) to satisfy 

d/j,(t) 


dt 


= 5 ^( 0 - 


( 8 ) 


( 9 ) 


Therefore our search for an integrating factor will be successful if we can find a solution of 
Eq. (9). Perhaps you can readily identify a function that satisfies Eq. (9): what well-known 
function from calculus has a derivative that is equal to one -half times the original function? 
More systematically, rewrite Eq. (9) as 



d[x(t)/dt j 

m ~ 2 ’ 

(10) 

which is equivalent to 

i = 2- 

(11) 

Then it follows that 

In |/x(t)l = \t + C, 

(12) 

or 

O 

II 

'S 

(13) 


The function fi(t) given by Eq. (13) is an integrating factor for Eq. (6). Since we do not need 
the most general integrating factor, we will choose c to be one in Eq. (13) and use /x(f) = e ,/2 . 
Now we return to Eq. (6), multiply it by the integrating factor e t/2 , and obtain 


e 1 ' 2 


< ty_ + V/2 v _ Vf/6 
dt +2 y ~ 2 ' 


(14) 


By the choice we have made of the integrating factor, the left side of Eq. (14) is the derivative 
of e' /2 y, so that Eq. (14) becomes 

±-y 2 y)=\eW. (15) 

By integrating both sides of Eq. (15), we obtain 

e ,,2 y = |e 5,/6 + c, (16) 


where c is an arbitrary constant. Finally, on solving Eq. (16) for y, we have the general solution 
of Eq. (6), namely, 


y= fe'/s + ce-*/ 2 . 


(17) 


To find the solution passing through the point (0, 1), we set t = 0 and y = 1 in Eq. (17), 
obtaining 1 = (3/5) + c. Thus c = 2/5, and the desired solution is 

y = |e f/3 + \e~‘ 12 . (18) 


Figure 2.1.1 includes the graphs of Eq. (17) for several values of c with a direction field in 
the background. The solution passing through (0, 1) is shown by the heavy curve. 
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FIGURE 2.1.1 Integral curves of/ + \y = (e' /3 . 


Let us now extend the method of integrating factors to equations of the form 


dy 

S -+»3' = S(0, 


(19) 


where a is a given constant and g(t) is a given function. Proceeding as in Example 1, 
we find that the integrating factor pit) must satisfy 


dp 

dt 


= ap. 


( 20 ) 


rather than Eq. (9). Thus the integrating factor is p(t) = e‘" . Multiplying Eq. (19) by 
p(t), we obtain 

e atd f + aeat y = e at g(t), 
at 

or 

j t (e at y) = e a ‘g{t). (21) 

By integrating both sides of Eq. (21), we find that 

e a ‘y = J e at g(t ) dt + c, (22) 

where c is an arbitrary constant. For many simple functions g(t) we can evaluate the 
integral in Eq. (22) and express the solution y in terms of elementary functions, as 
in Example 1. However, for more complicated functions g(t), it is necessary to leave 
the solution in integral form. In this case 

y = e~ at f e as g(s) ds + ce~ at . 

Jto 


( 23 ) 
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Note that in Eq. (23) we have used s to denote the integration variable to distinguish 
it from the independent variable t, and we have chosen some convenient value to as 
the lower limit of integration. 


EXAMPLE 

2 


e~ 2 ‘ — - 2e~ 2, y = 4 e - 2 ' - te~ 2 \ (25) 

dt 

or 

-r(e~ 2 'y) = 4e~ 2 ' — te~ 2 ‘ . (26) 

dt 

Then, by integrating both sides of this equation, we have 

e~ 2t y = —2e~ 2t + \t e~ 2t + \e~ 2t + c, 

where we have used integration by parts on the last term in Eq. (26). Thus the general solution 
of Eq. (24) is 

y = -l + \t + ce 21 . (27) 

A direction field and graphs of the solution (27) for several values of c are shown in Figure 
2.1.2. The behavior of the solution for large values of t is determined by the term ce 2 '. If c ^ 0, 
then the solution grows exponentially large in magnitude, with the same sign as c itself. Thus 
the solutions diverge as t becomes large. The boundary between solutions that ultimately grow 
positively from those that ultimately grow negatively occurs when c = 0. If we substitute c = 0 
into Eq. (27) and then set t = 0, we find that y = — 7/4 is the separation point on the y-axis. 
Note that, for this initial value, the solution is y = — ^ + If; it grows positively, but linearly 
rather than exponentially. 


Solve the differential equation 

§ - 2y = 4 - t (24) 

and plot the graphs of several solutions. Discuss the behavior of solutions as t -> oo. 

Equation (24) is of the form (19) with a — —2; therefore the integrating factor is /r(t) = e~ 2 '. 
Multiplying the differential equation (24) by /x(f), we obtain 



FIGURE 2.1.2 Integral curves of y' — 2y = 4 — t. 
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Now we return to the general first order linear equation (3) 

dy 

— + p(t)y = g(t), 

where p and g are given functions. To determine an appropriate integrating factor, 
we multiply Eq. (3) by an as yet undetermined function pit), obtaining 

+P(t)p(t)y = p(t)g(t). (28) 


Following the same line of development as in Example 1, we see that the left side 
of Eq. (28) is the derivative of the product pit)y, provided that p(t) satisfies the 
equation 

= p(t)ji(t). (29) 

dt 

If we assume temporarily that pit) is positive, then we have 


dpit)/dt 

d(t) 


= P(t), 


and consequently 



pit) dt + k. 


By choosing the arbitrary constant k to be zero, we obtain the simplest possible 
function for p, namely, 

pit) — exp / pit) dt. (30) 


Note that p (t) is positive for all t , as we assumed. Returning to Eq. (28), we have 


^[M(0y] = P-it)git). (31) 

Hence 

dit) y = J dit)git) dt + c, (32) 


where c is an arbitrary constant. Sometimes the integral in Eq.(32) can be evaluated 
in terms of elementary functions. However, in general this is not possible, so the 
general solution of Eq. (3) is 


1 

pit) 


f 

J tf) 


pis)g(s) ds + c 


(33) 


where again t 0 is some convenient lower limit of integration. Observe that Eq. (33) 
involves two integrations, one to obtain p(t) from Eq. (30) and the other to determine 
y from Eq. (33). 
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EXAMPLE 

3 


Solve the initial value problem 

ty' + 2y = 4t 2 , (34) 

y( 1) = 2. (35) 

In order to determine p(t ) and g(t) correctly, we must first rewrite Eq. (34) in the standard 
form (3). Thus we have 

/ + (2/t)y = At, (36) 

so p(t ) = lit and g(t) = At. To solve Eq. (36), we first compute the integrating factor p.(t): 

p.(t) — exp J y dt = e 2ln|(| = t 2 . 

On multiplying Eq. (36) by p(t) = t 2 , we obtain 

t 2 y' + 2ty = (t 2 y)' = At 3 , 

and therefore 

t 2 y = t A + C, 

where c is an arbitrary constant. It follows that 

y = t 2 + J ( 37 ) 

is the general solution of Eq. (34). Integral curves of Eq. (34) for several values of c are shown 
in Figure 2.1.3. To satisfy the initial condition (35), it is necessary to choose c = 1; thus 

y = t 2 +^, t > 0 (38) 

is the solution of the initial value problem (34), (35). This solution is shown by the heavy curve 
in Figure 2.1.3. Note that it becomes unbounded and is asymptotic to the positive y-axis as 
t -> 0 from the right. This is the effect of the infinite discontinuity in the coefficient p(t) at 
the origin. The function y = t 2 + (1/t 2 ) for t < 0 is not part of the solution of this initial value 
problem. 

This is the first example in which the solution fails to exist for some values of t. Again, this 
is due to the infinite discontinuity in p(t) at t = 0, which restricts the solution to the interval 

0 < t < oo. 
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Looking again at Figure 2.1.3, we see that some solutions (those for which c > 0) are asymp- 
totic to the positive y-axis as t — > 0 from the right, while other solutions (for which c < 0) 
are asymptotic to the negative y-axis. The solution for which c = 0, namely, y = f 2 , remains 
bounded and differentiable even at t = 0. If we generalize the initial condition (35) to 

y( 1) = yo, (39) 

then c = y 0 — 1 and the solution (38) becomes 

y = t 2 + y ° 1 , t>Oify 0 /l. (40) 

t 

As in Example 2, this is another instance where there is a critical initial value, namely, yo = 1, 
that separates solutions that behave in one way from others that behave quite differently. 


EXAMPLE 

4 


Solve the initial value problem 


2 y' + fy = 2, 
y (0) = l. 


(41) 

(42) 


To convert the differential equation (41) to the standard form (3), we must divide by two, 
obtaining 

y' + (f/2)y = 1. (43) 

Thus p{t) — t/ 2, and the integrating factor is fi(t) = exp(t 2 /4). Then multiply Eq. (43) by /r(f), 
so that 

e' 2/ V + V /4 y = e' 2/4 . (44) 

The left side of Eq. (44) is the derivative of e ,2/4 y, so by integrating both sides of Eq. (44), we 
obtain 

= + ,45) 

The integral on the right side of Eq. (45) cannot be evaluated in terms of the usual elementary 
functions, so we leave the integral unevaluated. However, by choosing the lower limit of 
integration as the initial point t = 0, we can replace Eq. (45) by 

e ,2 / 4 y = f // 4 ds + c, (46) 

Jo 

where c is an arbitrary constant. It then follows that the general solution y of Eq. (41) is given 
by 

y = e - ,2/4 f e® 2 / 4 ds + ce-' 2/4 . (47) 

Jo 

The initial condition (42) requires that c = 1. 

The main purpose of this example is to illustrate that sometimes the solution must be left in 
terms of an integral. This is usually at most a slight inconvenience, rather than a serious obsta- 
cle. For a given value of t the integral in Eq. (47) is a definite integral and can be approximated 
to any desired degree of accuracy by using readily available numerical integrators. By repeat- 
ing this process for many values of t and plotting the results, you can obtain a graph of a solu- 
tion. Alternatively, you can use a numerical approximation method, such as those discussed 
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in Chapter 8, that proceed directly from the differential equation and need no expression 
for the solution. Software packages such as Maple and Mathematica readily execute such 
procedures and produce graphs of solutions of differential equations. 



Figure 2.1.4 displays graphs of the solution (47) for several values of c. From the figure it 
may be plausible to conjecture that all solutions approach a limit as t — > oo. The limit can be 
found analytically (see Problem 32). 


PROBLEMS In each of Problems 1 through 12: 

(a) Draw a direction field for the given differential equation. 

(b) Based on an inspection of the direction field, describe how solutions behave for large t. 

(c) Find the general solution of the given differential equation, and use it to determine how 



solutions behave as t — > oo. 
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1. 

/ + 3y = 

= t + e~ 2t 

41 

2. 

y'-2y= t 2 e 2 ‘ 


41 

3. 

y' + y = 

te~‘ + 1 

41 

4. 

y' + (1 /t)y = 3 cos 2 1, 

41 

5. 

y'-2y-- 

= 3e‘ 

41 

6. 

ty' + 2_y = sin t. 

t > 0 

41 

7. 

y' + 2 ty 

= 2 te~‘ 2 

41 

8. 

(1 + t 2 )y' + 4 ty = 

(1 + t 2 ) 

41 

9. 

2y' + y - 

= 3 1 

41 

10. 

T 

II 

1 

Vs 

t > 0 

41 

11. 

II 

+ 

5 sin 2 1 

41 

12. 

2 y'+y= 3 1 2 



In each of Problems 13 through 20 find the solution of the given initial value problem. 

13. y'-y = 2te 2 ', y(Q) = 1 

14. / + 2y = te~ 2 \ y( 1) = 0 

15. ty' + 2y = t 2 - t + 1, y(l) = |, t>0 

16. y' + (2 /t)y = (cos t)/t 2 , y(n) = 0, t > 0 

17. y'-2y = e 2 ‘, y(0) = 2 
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18. ty' + 2 y — sinf, y(;r/ 2) = 1, f > 0 

19. f 3 / + 4 t 2 y = e _t , >>(-1) = 0, f < 0 

20. f/ + (f + l)y = f, y{ ln2) = 1, r > 0 

In each of Problems 21 through 23: 

(a) Draw a direction field for the given differential equation. How do solutions appear to 
behave as f becomes large? Does the behavior depend on the choice of the initial value a? 
Let «o be the value of a for which the transition from one type of behavior to another occurs. 
Estimate the value of a 0 . 

(b) Solve the initial value problem and find the critical value a 0 exactly. 

(c) Describe the behavior of the solution corresponding to the initial value a 0 . 

21. y — \y = 2 cos f, y( 0) = a 

22. 2y' — y = e' /3 , y(0) = a 

23. 3 y — 2 y = e _ir</2 , y(0) = a 

In each of Problems 24 through 26: 

(a) Draw a direction field for the given differential equation. How do solutions appear to 
behave as t — > 0? Does the behavior depend on the choice of the initial value a ? Let ao be 
the value of a for which the transition from one type of behavior to another occurs. Estimate 
the value of a 0 . 

(b) Solve the initial value problem and find the critical value a 0 exactly. 

(c) Describe the behavior of the solution corresponding to the initial value a 0 . 

24. ty' + (t + 1 )y = 2 te~‘, y( 1) — a, t > 0 

25. ty' + 2 y = (sinf)/t, y{— jr/2) = a, t < 0 

26. (sinf)/ + (cosf)y = e‘, y(l) = a, 0<t<n 

27. Consider the initial value problem 

/ + \y = 2 cos t, y(0) — — 1. 

Find the coordinates of the first local maximum point of the solution for t > 0. 

28. Consider the initial value problem 

y + §y = l- if, y( 0) = yo. 

Find the value of y 0 for which the solution touches, but does not cross, the f-axis. 

29. Consider the initial value problem 

/ + \y = 3 + 2cos2f, y(0) = 0. 

(a) Find the solution of this initial value problem and describe its behavior for large f. 

(b) Determine the value of f for which the solution first intersects the line y = 12. 

30. Find the value of y 0 for which the solution of the initial value problem 

y'-y = l + 3sinf, y(0)=y 0 

remains finite as f — > oo. 

31. Consider the initial value problem 


y — \y = 3f + 2d, y(0)=y 0 - 
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Find the value of vo that separates solutions that grow positively as t — > oo from those 
that grow negatively. How does the solution that corresponds to this critical value of yo 
behave as t -> oo? 

32. Show that all solutions of 2/ + ty = 2 [Eq. (41) of the text] approach a limit as t — > oo, 
and find the limiting value. 

Hint: Consider the general solution, Eq. (47), and use L’Hospital’s rule on the first term. 

33. Show that if a and X are positive constants, and b is any real number, then every solution 
of the equation 

/ + ay = be~ x ' 


has the property that y — > 0 as t — > oo. 

Hint: Consider the cases a = X and a ^ X separately. 

In each of Problems 34 through 37 construct a first order linear differential equation whose 
solutions have the required behavior as t oo. Then solve your equation and confirm that 
the solutions do indeed have the specified property. 

34. All solutions have the limit 3 as t — »■ oo. 

35. All solutions are asymptotic to the line y = 3 — fast— > oo. 

36. All solutions are asymptotic to the line y = 2t — 5 as t — > oo. 

37. All solutions approach the curve y = 4 — t 2 as t — > oo. 

38. Variation of Parameters. Consider the following method of solving the general linear 
equation of first order: 

/ +p(t)y = g(t). (i) 


(a) If git) = 0 for all t, show that the solution is 


y = A exp 



p{t)dt , 


(ii) 


where A is a constant. 

(b) If g(t) is not everywhere zero, assume that the solution of Eq. (i) is of the form 


y = A(t) exp 



pit) dt , 


(iii) 


where A is now a function of t. By substituting for y in the given differential equation, 
show that A(t) must satisfy the condition 


A’(t) =g(t) exp 


J pit) dt 


(iv) 


(c) Find Ait) from Eq. (iv). Then substitute for Ait) in Eq. (iii) and determine y. Verify 
that the solution obtained in this manner agrees with that of Eq. (33) in the text. This 
technique is known as the method of variation of parameters; it is discussed in detail in 
Section 3.6 in connection with second order linear equations. 


In each of Problems 39 through 42 use the method of Problem 38 to solve the given differential 
equation. 

39. y - 2y = t 2 e 2 ‘ 

41. ty + 2y = sin t , 


t > 0 


40. y’ + (1 /t)y = 3cos2f, 
42. 2/ + y = 3 1 2 


t > 0 
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2.2 Separable Equations 


In Sections 1.2 and 2.1 we used a process of direct integration to solve first order 
linear equations of the form 

%= ay+b - (1) 

where a and b are constants. We will now show that this process is actually applicable 
to a much larger class of equations. 

We will use x, rather than t, to denote the independent variable in this section for 
two reasons. In the first place, different letters are frequently used for the variables in 
a differential equation, and you should not become too accustomed to using a single 
pair. In particular, x often occurs as the independent variable. Further, we want to 
reserve t for another purpose later in the section. 

The general first order equation is 

d f=f(x,y). ( 2 ) 

ax 

Linear equations were considered in the preceding section, but if Eq. (2) is nonlinear, 
then there is no universally applicable method for solving the equation. Here, we 
consider a subclass of first order equations that can be solved by direct integration. 
To identify this class of equations, we first rewrite Eq. (2) in the form 

M(x,y)+N(x,y)^ = 0. (3) 

ax 

It is always possible to do this by setting M(x,y) = —f(x,y) andlVfdqy) = l.butthere 
may be other ways as well. If it happens that M is a function of x only and N is a 
function of y only, then Eq. (3) becomes 

M(x) + N(y) d f = 0. (4) 

ax 

Such an equation is said to be separable, because if it is written in the differential 
form 

M(x) dx + N{y) dy — 0, (5) 

then, if you wish, terms involving each variable may be placed on opposite sides of 
the equation. The differential form (5) is also more symmetric and tends to suppress 
the distinction between independent and dependent variables. 

A separable equation can be solved by integrating the functions M and N. We 
illustrate the process by an example and then discuss it in general for Eq. (4). 


Show that the equation 

EXAMPLE 

1 


dy x 2 
dx 1 — y 2 


( 6 ) 


is separable, and then find an equation for its integral curves. 


2.2 Separable Equations 


43 


If we write Eq. (6) as 


-x 2 + (i -y 2 )-f = °’ 

ax 


(V) 


then it has the form (4) and is therefore separable. Recall from calculus that if y is a function 
of x, then by the chain rule 

d „ d „ dv dy 

~rf(y) = -J- =/ O')/- 

ax ay ax ax 

For example, if f(y) = y — y 3 / 3, then 

d a o dy 

— {y-y 3 /3) = a-y 2 )-r.- 


dx 


dx 


Thus the second term in Eq. (7) is the derivative with respect to x of y — y 3 / 3, and the first 
term is the derivative of — x 3 /3. Thus Eq. (7) can be written as 


dx V 3 7 dx 


dx ' * + - V ’ ' ° 


Therefore by integrating, we obtain 


—x 3 + 3 y — y 3 = c, 


( 8 ) 


where c is an arbitrary constant. Equation (8) is an equation for the integral curves of Eq. (6). 
A direction field and several integral curves are shown in Figure 2.2.1. Any differentiable 
function y = <p(x) that satisfies Eq. (8) is a solution of Eq. (6). An equation of the integral 
curve passing through a particular point (x 0 ,yo) can be found by substituting x 0 and y Q for x 
and y, respectively, in Eq. (8) and determining the corresponding value of c. 



FIGURE 2.2.1 Direction field and integral curves of y' — x 2 /(l — y 2 ). 
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Essentially the same procedure can be followed for any separable equation. Re- 
turning to Eq. (4), let H \ and H 2 be any antiderivatives of M and N . respectively. 
Thus 

H[(x) = M(x), H' 2 (y) = N(y), (9) 


and Eq. (4) becomes 


According to the chain rule, 


H[(x) + H' 1 (y) d ^ = Q. 

dx 


« dy d dy d „ 

= ~ r H 2 (y ) - = -H 2 (y). 

ax dy dx dx 


Consequently, we can write Eq. (10) as 


— [H x (x) + H 2 (y)] = 0. 
dx 


By integrating Eq. (12), we obtain 


( 10 ) 


( 11 ) 


(12) 


H i(x) + H 2 {y) = c. 


(13) 


where c is an arbitrary constant. Any differentiable function y = cp(x) that satisfies 
Eq. (13) is a solution of Eq. (4); in other words, Eq. (13) defines the solution implic- 
itly rather than explicitly. In practice, Eq. (13) is usually obtained from Eq. (5) by 
integrating the first term with respect to v and the second term with respect to y. 

The differential equation (4), together with an initial condition 

y(xo) = yo , (14) 


form an initial value problem. To solve this initial value problem, we must determine 
the appropriate value for the constant c in Eq. (13). We do this by setting x = x 0 and 
y — y 0 in Eq. (13) with the result that 

c = Hi(x 0 ) + H 2 (y 0 ). (15) 


Substituting this value of c in Eq. (13) and noting that 


H\ (x) - //i (vo) = f M{s) ds , 

Jxo 


H 2 (y ) - H 2 (yo) = 



N(s) ds, 


we obtain 


f M(s) ds 

Jx 0 


■ f N(s) ds — 0. 

Jy o 


(16) 


Equation ( 16) is an implicit representation of the solution of the differential equation 
(4) that also satisfies the initial condition (14). You should bear in mind that, to 
determine an explicit formula for the solution, Eq. (16) must be solved for y as a 
function of x. Unfortunately, it is often impossible to do this analytically; in such 
cases you can resort to numerical methods to find approximate values of y for given 
values of x. 
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EXAMPLE 

2 


Solve the initial value problem 

dy 

dx 


3x 2 + 4x + 2 
2(y-l) 


y( 0) = -l, 


and determine the interval in which the solution exists. 
The differential equation can be written as 


2 (y — 1 ) dy = (3x 2 + 4x + 2) dx. 

Integrating the left side with respect to y and the right side with respect to x gives 

y 2 — 2y — x 3 + 2x 2 +2 x + c, 


(17) 


(18) 


where c is an arbitrary constant. To determine the solution satisfying the prescribed initial 
condition, we substitute x = 0 and y = — 1 in Eq. (18), obtaining c = 3. Hence the solution of 
the initial value problem is given implicitly by 

y 2 — 2 y = x 3 + 2x 2 + 2x + 3. (19) 

To obtain the solution explicitly, we must solve Eq. (19) for y in terms of x. That is a simple 
matter in this case, since Eq. (19) is quadratic in y, and we obtain 

y = 1 ± , Jx 3 + 2x 2 + 2x + 4. (20) 

Equation (20) gives two solutions of the differential equation, only one of which, however, 
satisfies the given initial condition. This is the solution corresponding to the minus sign in 
Eq. (20), so we finally obtain 

y = <j>{x) = 1 — V* 3 + 2x 2 + 2x + 4 (21) 

as the solution of the initial value problem (17). Note that if the plus sign is chosen by mistake 
in Eq. (20), then we obtain the solution of the same differential equation that satisfies the 
initial condition y(0) = 3. Finally, to determine the interval in which the solution (21) is valid, 
we must find the interval in which the quantity under the radical is positive. The only real zero 
of this expression is x = —2, so the desired interval is x > —2. The solution of the initial value 
problem and some other integral curves of the differential equation are shown in Figure 2.2.2. 



FIGURE 2.2.2 Integral curves of y' = (3x 2 + Ax + 2)/2(y — 1). 
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I 


EXAMPLE 

3 


Observe that the boundary of the interval of validity of the solution (21) is determined by the 
point (—2, 1) at which the tangent line is vertical. 


Solve the equation 


dy 4x — x 3 
dx 4 + y 3 


( 22 ) 


and draw graphs of several integral curves. Also find the solution passing through the point 
(0, 1) and determine its interval of validity. 

Rewriting Eq. (22) as 

(4 + y 3 ) dy = (4x - x 3 ) dx, 


integrating each side, multiplying by 4, and rearranging the terms, we obtain 

y 4 + 16 y + x 4 — 8x 2 = c, (23) 

where c is an arbitrary constant. Any differentiable function y = 4>(x) that satisfies Eq. (23) 
is a solution of the differential equation (22). Graphs of Eq. (23) for several values of c are 
shown in Figure 2.2.3. 

To find the particular solution passing through (0, 1), we set x = 0 and y = 1 in Eq. (23) with 
the result that c = 17. Thus the solution in question is given implicitly by 

/ + 16y + x 4 - 8x 2 = 17. (24) 


It is shown by the heavy curve in Figure 2.2.3. The interval of validity of this solution extends 
on either side of the initial point as long as the function remains differentiable. From the figure 



FIGURE 2.2.3 Integral curves of y' = (4x — x 3 )/{4 + y 3 ). The solution passing through (0, 1) 
is shown by the heavy curve. 
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we see that the interval ends when we reach points where the tangent line is vertical. It follows 
from the differential equation (22) that these are points where 4 + y 3 = 0, or 
y = (— 4) 1/3 = —1.5874. From Eq. (24) the corresponding values of x are x = ±3.3488. These 
points are marked on the graph in Figure 2.2.3. 


Note 1: Sometimes an equation of the form (2) 


Ay 

dx 


= fix, y) 


has a constant solution y = y 0 . Such a solution is usually easy to find because if 
f(x,yo) = 0 for some value yo and for all x, then the constant function y = yo is a 
solution of the differential equation (2). For example, the equation 


Ay (y — 3) cos x 
dx 1 + 2 y 2 


has the constant solution y = 3. Other solutions of this equation can be found by 
separating the variables and integrating. 

Note 2: The investigation of a first order nonlinear equation can sometimes be 
facilitated by regarding both x and y as functions of a third variable t. Then 


Ay Ay / At 
dx dx/dt 


(26) 


If the differential equation is 


Ay _ F(x,y) 
dx G(x.y) ’ 


(27) 


then, by comparing numerators and denominators in Eqs. (26) and (27), we obtain 
the system 

dx/dt = G(x,y), dy/dt = F(x,y). (28) 


At first sight it may seem unlikely that a problem will be simplified by replacing a 
single equation by a pair of equations, but, in fact, the system (28) may well be more 
amenable to investigation than the single equation (27). Chapter 9 is devoted to 
nonlinear systems of the form (28). 

Note 3: In Example 2 it was not difficult to solve explicitly for y as a function 
of x. However, this situation is exceptional, and often it will be better to leave the 
solution in implicit form, as in Examples I and 3. Thus, in the problems below and 
in other sections where nonlinear equations appear, the words “solve the following 
differential equation” mean to find the solution explicitly if it is convenient to do so, 
but otherwise to find an equation defining the solution implicitly. 


PROBLEMS In each of Problems 1 through 8 solve the given differential equation. 

1. y' = x 2 /y 2. / = x 2 /y( 1 + x 3 ) 

3. y' + y 2 sinx = 0 4. / = (3x 2 - l)/(3 + 2 y) 

5. y = (cos 2 x)(cos 2 2 y) 6. xy' = (1 — y 2 ) 1/2 
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41 

41 

41 

41 

41 

41 

41 

41 


41 


41 


41 


41 


41 


41 


dy x — e x dy x 2 

7. — = 8. — - = 

dx y + ey dx 1 + y A 

In each of Problems 9 through 20: 

(a) Find the solution of the given initial value problem in explicit form. 

(b) Plot the graph of the solution. 

(c) Determine (at least approximately) the interval in which the solution is defined. 

9. y' = (1 - 2 x)y 2 , y( 0) = -1/6 10. / = (1 - 2 x)/y, y(\) = -2 

11. x dx + ye~ x dy = 0, y(0) = 1 12. dr/d6 = r 2 /9 , r(l) = 2 

13. y = 2 x/(y + x 2 y), y(0) = -2 14. y' = xy 3 ( 1 + x 2 )” 1 ' 2 , y(0) = 1 

15. y = 2x/(l + 2 y), y{ 2) = 0 16. y' = x(x 2 + l)/4y 3 , y(0) = -1/V2 

17. y = (3x 2 - e x )/(2 y - 5), y(0) = 1 

18. y = (er x - e x )/{3 + 4 y), y(0) = 1 

19. sin 2x dx + cos 3y dy = 0, y(n/2) = n/3 

20. y 2 (l — x 2 ) 1/2 dy = arcsinx dx, y(0) = 1 


Some of the results requested in Problems 21 through 28 can be obtained either by solving 
the given equations analytically or by plotting numerically generated approximations to the 
solutions. Try to form an opinion as to the advantages and disadvantages of each approach. 

21. Solve the initial value problem 

y = (1 + 3x 2 )/(3y 2 - 6 y), y(0) = 1 


and determine the interval in which the solution is valid. 

Hint: To find the interval of definition, look for points where the integral curve has a 
vertical tangent. 

22. Solve the initial value problem 

y = 3x 2 /(3 y 2 - 4), y (1) = o 


and determine the interval in which the solution is valid. 

Hint: To find the interval of definition, look for points where the integral curve has a 
vertical tangent. 

23. Solve the initial value problem 

y' = 2y 2 + xy 2 , y (0) = 1 


and determine where the solution attains its minimum value. 

24. Solve the initial value problem 

/ = (2 - e*)/(3 + 2y), y(0) = 0 


and determine where the solution attains its maximum value. 

25. Solve the initial value problem 

y' = 2 cos 2x/(3 + 2y), y(0) = — 1 


and determine where the solution attains its maximum value. 

26. Solve the initial value problem 

V = 2(1 + x)(l + y 2 ), y(0) = 0 


and determine where the solution attains its minimum value. 
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27. Consider the initial value problem 

y' = ty( 4 - y)/3, y(0) = y 0 . 

(a) Determine how the behavior of the solution as t increases depends on the initial 
value yo. 

(b) Suppose that vo = 0.5. Find the time T at which the solution first reaches the value 3.98. 
4fl 28. Consider the initial value problem 

/ = O' (4 - y)/( 1 + 0. >(0) = y 0 > 0. 


(a) Determine how the solution behaves as t — > oo. 

(b) If y 0 = 2, find the time T at which the solution first reaches the value 3.99. 

(c) Findtherangeofinitialvaluesforwhichthesolutionliesintheinterval3.99 < y < 4.01 
by the time t = 2. 

29. Solve the equation 

dy ay + b 

dx cy + d ’ 

where a, b , c, and d are constants. 


Homogeneous Equations. If the right side of the equation dy/dx =f(x,y) can be ex- 
pressed as a function of the ratio y/x only, then the equation is said to be homogeneous. 1 
Such equations can always be transformed into separable equations by a change of the 
dependent variable. Problem 30 illustrates how to solve first order homogeneous equa- 
tions. 


30. Consider the equation 


dy y — 4x 
dx x — y 


(a) Show that Eq. (i) can be rewritten as 


dy_ _ (y/x) - 4 
dx 1 — (y/x ) ’ 


(i) 


(ii) 


thus Eq. (i) is homogeneous. 

(b) Introduce a new dependent variable v so that v — y/x , or y = xv(x). Express dy/dx 
in terms of x, v, and dv/dx. 

(c) Replace y and dy/dx in Eq. (ii) by the expressions from part (b) that involve v and 
dv/dx. Show that the resulting differential equation is 


v + x 


d v 
dx 


v-4 

1-v’ 


or 


d v v 2 — 4 
dx 1 — v 


(hi) 


Observe that Eq. (iii) is separable. 


lr The word "homogeneous" has different meanings in different mathematical contexts. The homogeneous 
equations considered here have nothing to do with the homogeneous equations that will occur in Chapter 3 
and elsewhere. 
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(d) Solve Eq. (iii), obtaining v implicitly in terms of x. 

(e) Find the solution of Eq. (i) by replacing v by y/x in the solution in part (d). 

(f) Draw a direction field and some integral curves for Eq. (i). Recall that the right side 
of Eq. (i) actually depends only on the ratio y/x. This means that integral curves have the 
same slope at all points on any given straight line through the origin, although the slope 
changes from one line to another. Therefore the direction field and the integral curves are 
symmetric with respect to the origin. Is this symmetry property evident from your plot? 


The method outlined in Problem 30 can be used for any homogeneous equation. That is, 
the substitution y = xv(x) transforms a homogeneous equation into a separable equation. 
The latter equation can be solved by direct integration, and then replacing v by y/x gives 
the solution to the original equation. In each of Problems 31 through 38: 

(a) Show that the given equation is homogeneous. 

(b) Solve the differential equation. 

(c) Draw a direction field and some integral curves. Are they symmetric with respect to 




the 

origin? 






31. 

dy 

x 2 + xy + y 2 

*() 

32. 

dy 

x 2 + 3 y 2 

#1 > 

dx 

X 2 


dx 

2 xy 

41 

33. 

dy 

dx 

II 

1 I 

V £ 

41 

34. 

dy 

dx 

4x + 3y 

2 x + y 

41 

35. 

dy 

dx 

_ x + 3 y 
x-y 

41 

36. 

(x 2 

+ 3 xy + y 2 ) dx 

41 

37. 

dy 

dx 

x 2 - 3 y 2 

2xy 

41 

38. 

dy 

dx 

3 y 2 - x 2 

2 xy 


1 dy = 0 


2.3 Modeling with First Order Equations 


Differential equations are of interest to nonmathematicians primarily because of the 
possibility of using them to investigate a wide variety of problems in the physical, 
biological, and social sciences. One reason for this is that mathematical models and 
their solutions lead to equations relating the variables and parameters in the prob- 
lem. These equations often enable you to make predictions about how the natural 
process will behave in various circumstances. It is often easy to vary parameters in 
the mathematical model over wide ranges, whereas this may be very time-consuming 
or expensive, if not impossible, in an experimental setting. Nevertheless, mathemat- 
ical modeling and experiment or observation are both critically important and have 
somewhat complementary roles in scientific investigations. Mathematical models are 
validated by comparison of their predictions with experimental results. On the other 
hand, mathematical analyses may suggest the most promising directions to explore 
experimentally, and they may indicate fairly precisely what experimental data will 
be most helpful. 

In Sections 1.1 and 1.2 we formulated and investigated a few simple mathemati- 
cal models. We begin by recapitulating and expanding on some of the conclusions 
reached in those sections. Regardless of the specific field of application, there are 
three identifiable steps that are always present in the process of mathematical mod- 
eling. 
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Construction of the Model. In this step you translate the physical situation into mathe- 
matical terms, often using the steps listed at the end of Section 1.1. Perhaps most 
critical at this stage is to state clearly the physical principle(s) that are believed to 
govern the process. For example, it has been observed that in some circumstances 
heat passes from a warmer to a cooler body at a rate proportional to the temperature 
difference, that objects move about in accordance with Newton’s laws of motion, and 
that isolated insect populations grow at a rate proportional to the current population. 
Each of these statements involves a rate of change (derivative) and consequently, 
when expressed mathematically, leads to a differential equation. The differential 
equation is a mathematical model of the process. 

It is important to realize that the mathematical equations are almost always only 
an approximate description of the actual process. For example, bodies moving at 
speeds comparable to the speed of light are not governed by Newton’s laws, insect 
populations do not grow indefinitely as stated because of eventual lack of food or 
space, and heat transfer is affected by factors other than the temperature difference. 
Thus you should always be aware of the limitations of the model so that you will use it 
only when it is reasonable to believe that it is accurate. Alternatively, you can adopt 
the point of view that the mathematical equations exactly describe the operation of 
a simplified physical model, which has been constructed (or conceived of) so as to 
embody the most important features of the actual process. Sometimes, the process 
of mathematical modeling involves the conceptual replacement of a discrete process 
by a continuous one. For instance, the number of members in an insect population 
changes by discrete amounts; however, if the population is large, it seems reasonable 
to consider it as a continuous variable and even to speak of its derivative. 

Analysis of the Model. Once the problem has been formulated mathematically, you are 
often faced with the problem of solving one or more differential equations or, fail- 
ing that, of finding out as much as possible about the properties of the solution. It 
may happen that this mathematical problem is quite difficult, and if so, further ap- 
proximations may be indicated at this stage to make the problem mathematically 
tractable. For example, a nonlinear equation may be approximated by a linear one, 
or a slowly varying coefficient may be replaced by a constant. Naturally, any such 
approximations must also be examined from the physical point of view to make sure 
that the simplified mathematical problem still reflects the essential features of the 
physical process under investigation. At the same time, an intimate knowledge of the 
physics of the problem may suggest reasonable mathematical approximations that 
will make the mathematical problem more amenable to analysis. This interplay of 
understanding of physical phenomena and knowledge of mathematical techniques 
and their limitations is characteristic of applied mathematics at its best, and it is 
indispensable in successfully constructing useful mathematical models of intricate 
physical processes. 

Comparison with Experiment or Observation. Finally, having obtained the solution (or at 
least some information about it), you must interpret this information in the context 
in which the problem arose. In particular, you should always check that the math- 
ematical solution appears physically reasonable. If possible, calculate the values of 
the solution at selected points and compare them with experimentally observed val- 
ues. Or ask whether the behavior of the solution after a long time is consistent with 
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EXAMPLE 

1 


Mixing 


observations. Or examine the solutions corresponding to certain special values of pa- 
rameters in the problem. Of course, the fact that the mathematical solution appears 
to be reasonable does not guarantee that it is correct. However, if the predictions of 
the mathematical model are seriously inconsistent with observations of the physical 
system it purports to describe, this suggests that errors have been made in solving 
the mathematical problem, that the mathematical model itself needs refinement, or 
that observations must be made with greater care. 

The examples in this section are typical of applications in which first order differ- 
ential equations arise. 


At time ( = 0a tank contains Go lb of salt dissolved in 100 gal of water; see Figure 2.3.1. 
Assume that water containing f lb of salt/gal is entering the tank at a rate of r gal/min and 
that the well-stirred mixture is draining from the tank at the same rate. Set up the initial 
value problem that describes this flow process. Find the amount of salt Q(t) in the tank at any 
time, and also find the limiting amount Q L that is present after a very long time. If r = 3 and 
Go = 2 Q l , find the time T after which the salt level is within 2% of Q L . Also find the flow 
rate that is required if the value of T is not to exceed 45 min. 



FIGURE 2.3.1 The water tank in Example 1. 


We assume that salt is neither created nor destroyed in the tank. Therefore variations in 
the amount of salt are due solely to the flows in and out of the tank. More precisely, the rate 
of change of salt in the tank, dQ/dt , is equal to the rate at which salt is flowing in minus the 
rate at which it is flowing out. In symbols, 

= rate in — rate out. (1) 

dt 

The rate at which salt enters the tank is the concentration f lb/gal times the flow rate r gal/min, 
or (r/4) lb/min. To find the rate at which salt leaves the tank, we need to multiply the concen- 
tration of salt in the tank by the rate of outflow, r gal/min. Since the rates of flow in and out 
are equal, the volume of water in the tank remains constant at 100 gal, and since the mixture 
is “well-stirred," the concentration throughout the tank is the same, namely, [Q(t)/ 100] lb/gal. 
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Therefore the rate at which salt leaves the tank is [r<2(t)/T00] lb/min. Thus the differential 
equation governing this process is 


The initial condition is 


dQ_ _ r_ _ rQ^ 
~dt ~ 4 ~ 100' 


G(0) = Go. 


( 2 ) 

(3) 


Upon thinking about the problem physically, we might anticipate that eventually the mixture 
originally in the tank will be essentially replaced by the mixture flowing in, whose concentration 
is | lb/gal. Consequently, we might expect that ultimately the amount of salt in the tank would 
be very close to 25 lb. We can also find the limiting amount Gl = 25 by setting dQ/dt equal 
to zero in Eq. (2) and solving the resulting algebraic equation for Q. 

To solve the initial value problem (2), (3) analytically, note that Eq. (2) is both linear and 
separable. Rewriting it in the standard form for a linear equation, we have 


dQ rQ _ r 
dt + 100 “ 4' 


(4) 


Thus the integrating factor is e rt/10 ° and the general solution is 


Q(f) = 25 + ce~ r,/m , 


(5) 


where c is an arbitrary constant. To satisfy the initial condition (3), we must choose c = Go — 25. 
Therefore the solution of the initial value problem (2), (3) is 

G(0 = 25 + (Go - 25)e-" /10 °, (6) 

or 

Q(t) = 25(1 - e - nnoa ) + Goe" rt/10 °. (V) 


From Eq. (6) or (7), you can see that Q(t) -> 25 (lb) as t — > oo, so the limiting value Q L is 
25, confirming our physical intuition. Further, Q(t) approaches the limit more rapidly as r 
increases. In interpreting the solution (7), note that the second term on the right side is the 
portion of the original salt that remains at time f , while the first term gives the amount of salt in 
the tank due to the action of the flow processes. Plots of the solution for r = 3 and for several 
values of Go are shown in Figure 2.3.2. 

Now suppose that r = 3 and Go = 2 Q L = 50; then Eq. (6) becomes 

Q(t) = 25 + 25e~ 0 03 ' . (8) 


Since 2% of 25 is 0.5, we wish to find the time T at which Q(t) has the value 25.5. Substituting 
t = T and Q = 25.5 in Eq. (8) and solving for T, we obtain 

T = (ln50)/0.03 = 130.4 (min). (9) 

To determine r so that T = 45, return to Eq. (6), set t = 45, Go = 50, Q(t) = 25.5, and solve 
for r. The result is 

r = (100/45) In 50 = 8.69 gal/min. (10) 


Since this example is hypothetical, the validity of the model is not in question. If the flow 
rates are as stated, and if the concentration of salt in the tank is uniform, then the differential 
equation (1) is an accurate description of the flow process. Although this particular example 
has no special significance, models of this kind are often used in problems involving a pollutant 
in a lake, or a drug in an organ of the body, for example, rather than a tank of salt water. In 
such cases the flow rates may not be easy to determine or may vary with time. Similarly, the 
concentration may be far from uniform in some cases. Finally, the rates of inflow and outflow 
may be different, which means that the variation of the amount of liquid in the problem must 
also be taken into account. 
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FIGURE 2.3.2 Solutions of the initial value problem (2), (3) for r = 3 and several values 
of Go- 


example 

2 


Compound 
Int erest 


Suppose that a sum of money is deposited in a bank or money fund that pays interest at an 
annual rate r. The value 5(0 of the investment at any time t depends on the frequency with 
which interest is compounded as well as on the interest rate. Financial institutions have various 
policies concerning compounding: some compound monthly, some weekly, some even daily. 
If we assume that compounding takes place continuously , then we can set up a simple initial 
value problem that describes the growth of the investment. 

The rate of change of the value of the investment is dS/dt , and this quantity is equal to 
the rate at which interest accrues, which is the interest rate r times the current value of the 
investment 5(0- Thus 

dS/dt = rS (11) 


is the differential equation that governs the process. Suppose that we also know the value of 
the investment at some particular time, say, 


5(0) = S 0 . (12) 

Then the solution of the initial value problem (11), (12) gives the balance S(t) in the account 
at any time t. This initial value problem is readily solved, since the differential equation (11) 
is both linear and separable. Consequently, by solving Eqs. (11) and (12), we find that 


5(1 ) = S 0 e rl . 


(13) 


Thus a bank account with continuously compounding interest grows exponentially. 

Let us now compare the results from this continuous model with the situation in which 
compounding occurs at finite time intervals. If interest is compounded once a year, then after 
t years 


5(0 = 5 0 (1 + r)‘. 
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If interest is compounded twice a year, then at the end of 6 months the value of the investment 
is So[l + (r/ 2)], and at the end of 1 year it is So[l + 0/2)] 2 . Thus, after t years we have 

S(t ) = So (l + ■ 

In general, if interest is compounded m times per year, then 

/ r \ mt 

S(f) = S 0 { 1+-) . (14) 

V m> 

The relation between formulas (13) and (14) is clarified if we recall from calculus that 

/ r \ mt 

lim So ( 1 H ) =S 0 e r ‘. 

The same model applies equally well to more general investments in which dividends and 
perhaps capital gains can also accumulate, as well as interest. In recognition of this fact, we 
will from now on refer to r as the rate of return. 

Table 2.3.1 shows the effect of changing the frequency of compounding for a return rate 
r of 8%. The second and third columns are calculated from Eq. (14) for quarterly and daily 
compounding, respectively, and the fourth column is calculated from Eq. (13) for continuous 
compounding. The results show that the frequency of compounding is not particularly im- 
portant in most cases. For example, during a 10-year period the difference between quarterly 
and continuous compounding is $17.50 per $1000 invested, or less than $2/year. The differ- 
ence would be somewhat greater for higher rates of return and less for lower rates. From the 
first row in the table, we see that for the return rate r — 8%, the annual yield for quarterly 
compounding is 8.24% and for daily or continuous compounding it is 8.33%. 


TABLE 2.3.1 Growth of Capital at a Return Rate r = 8% 
for Several Modes of Compounding 


Years 

S(f)/SRo)fromEq. ^4) 

S{t)/S(t 0 ) 
from Eq. (13) 

m = 4 

in 

'O 

cn 

II 

s 

1 

1.0824 

1.0833 

1.0833 

2 

1.1717 

1.1735 

1.1735 

5 

1.4859 

1.4918 

1.4918 

10 

2.2080 

2.2253 

2.2255 

20 

4.8754 

4.9522 

4.9530 

30 

10.7652 

11.0203 

11.0232 

40 

23.7699 

24.5239 

24.5325 


Returning now to the case of continuous compounding, let us suppose that there may be 
deposits or withdrawals in addition to the accrual of interest, dividends, or capital gains. If 
we assume that the deposits or withdrawals take place at a constant rate k, then Eq. (11) is 
replaced by 

dS /dt = rS + k, 

or, in standard form, 

dS/dt — rS — k, (15) 

where k is positive for deposits and negative for withdrawals. 

Equation (15) is linear with the integrating factor e~", so its general solution is 

S(t) = ce rt — ( k/r ), 
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where c is an arbitrary constant. To satisfy the initial condition (12), we must choose 
c = So + (k/r). Thus the solution of the initial value problem (15), (12) is 

5(f) = S n e n + (k / r) (e n - 1). (16) 

The first term in expression (16) is the part of 5(f) that is due to the return accumulated on 
the initial amount So , and the second term is the part that is due to the deposit or withdrawal 
rate k. 

The advantage of stating the problem in this general way without specific values for So, r , or 
k lies in the generality of the resulting formula (16) for S(t). With this formula we can readily 
compare the results of different investment programs or different rates of return. 

For instance, suppose that one opens an individual retirement account (IRA) at age 25 and 
makes annual investments of $2000 thereafter in a continuous manner. Assuming a rate of 
return of 8%, what will be the balance in the IRA at age 65? We have So = 0, r = 0.08, and 
k = $2000, and we wish to determine 5(40). From Eq. (16) we have 

5(40) = (25,000) (e 3 2 - 1) = $588,313. (17) 

It is interesting to note that the total amount invested is $80,000, so the remaining amount of 
$508,313 results from the accumulated return on the investment. The balance after 40 years 
is also fairly sensitive to the assumed rate. For instance, 5(40) = $508,948 if r = 0.075 and 
5(40) = $681,508 if /• = 0.085. 

Let us now examine the assumptions that have gone into the model. First, we have assumed 
that the return is compounded continuously and that additional capital is invested continu- 
ously. Neither of these is true in an actual financial situation. We have also assumed that the 
return rate r is constant for the entire period involved, whereas in fact it is likely to fluctuate 
considerably. Although we cannot reliably predict future rates, we can use expression (16) to 
determine the approximate effect of different rate projections. It is also possible to consider r 
and k in Eq. (15) to be functions of f rather than constants; in that case, of course, the solution 
may be much more complicated than Eq. (16). 

The initial value problem (15), (12) and the solution (16) can also be used to analyze a 
number of other financial situations, including annuities, mortgages, and automobile loans. 


EXAMPLE 

3 


Chemicals in 
a Pond 


Consider a pond that initially contains 10 million gal of fresh water. Water containing an 
undesirable chemical flows into the pond at the rate of 5 million gal/yr, and the mixture in the 
pond flows out at the same rate. The concentration y (f) of chemical in the incoming water 
varies periodically with time according to the expression y(t) = 2 + sin2f g/gal. Construct a 
mathematical model of this flow process and determine the amount of chemical in the pond 
at any time. Plot the solution and describe in words the effect of the variation in the incoming 
concentration. 

Since the incoming and outgoing flows of water are the same, the amount of water in the 
pond remains constant at 10 7 gal. Let us denote time by t, measured in years, and the chemical 
by Q(t ), measured in grams. This example is similar to Example 1, and the same inflow/outflow 
principle applies. Thus 

dQ 

— — = rate in — rate out, 
dt 


where “rate in” and “rate out” refer to the rates at which the chemical flows into and out of 
the pond, respectively. The rate at which the chemical flows in is given by 


rate in = (5 x 10 6 ) gal/yr (2 + sin 2 1) g/gal. 


(18) 
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The concentration of chemical in the pond is <2(/)/10 7 g/gal, so the rate of flow out is 

rate out = (5 x 10 6 ) gal/yr [Q(t)/ 10 7 ] g/gal = Q(t)/2 g/yr. (19) 

Thus we obtain the differential equation 

y = (5x 10 6 )(2 + sin 2 1) - (20) 

at 2 

where each term has the units of g/yr. 

To make the coefficients more manageable, it is convenient to introduce a new dependent 
variable defined by q(t) = <2(0/ 10 6 or Q(t ) = 10 6 q(t). This means that q(t) is measured in 
millions of grams, or megagrams (metric tons). If we make this substitution in Eq. (20), then 
each term contains the factor 10 6 , which can be canceled. If we also transpose the term 
involving q(t) to the left side of the equation, we finally have 

+ \q = 10 + 5 sin2f. (21) 

Originally, there is no chemical in the pond, so the initial condition is 

q ( 0) = 0. (22) 

Equation (21) is linear, and although the right side is a function of time, the coefficient of 
q is a constant. Thus the integrating factor is e'/ 2 . Multiplying Eq. (21) by this factor and 
integrating the resulting equation, we obtain the general solution 

q(t) = 20 — cos It + 12 s i n 2 1 + ce“' /2 . (23) 

The initial condition (22) requires that c = —300/17, so the solution of the initial value problem 
(21), (22) is 

q(f) = 20 — Yj cos 2 1 + sin2f — ^ye -,/2 . (24) 

A plot of the solution (24) is shown in Figure 2.3.3, along with the line q = 20. The exponential 
term in the solution is important for small t, but it diminishes rapidly as t increases. Later, the 
solution consists of an oscillation, due to the sin 2 1 and cos 2 1 terms, about the constant level 
q = 20. Note that if the sin2f term were not present in Eq. (21), then q = 20 would be the 
equilibrium solution of that equation. 



FIGURE 2.3.3 Solution of the initial value problem (21), (22). 
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Let us now consider the adequacy of the mathematical model itself for this problem. The 
model rests on several assumptions that have not yet been stated explicitly. In the first place, 
the amount of water in the pond is controlled entirely by the rates of flow in and out — none 
is lost by evaporation or by seepage into the ground, and none is gained by rainfall. The same 
is true of the chemical; it flows into and out of the pond, but none is absorbed by fish or other 
organisms living in the pond. In addition, we assume that the concentration of chemical in 
the pond is uniform throughout the pond. Whether the results obtained from the model are 
accurate depends strongly on the validity of these simplifying assumptions. 


EXAMPLE 

4 

Escape 

Velocity 


A body of constant mass m is projected away from the earth in a direction perpendicular to the 
earth’s surface with an initial velocity vq. Assuming that there is no air resistance, but taking 
into account the variation of the earth’s gravitational field with distance, find an expression for 
the velocity during the ensuing motion. Also find the initial velocity that is required to lift the 
body to a given maximum altitude f above the surface of the earth, and find the least initial 
velocity for which the body will not return to the earth; the latter is the escape velocity. 

Let the positive x-axis point away from the center of the earth along the line of motion with 
x = 0 lying on the earth’s surface; see Figure 2.3.4. The figure is drawn horizontally to remind 
you that gravity is directed toward the center of the earth, which is not necessarily downward 
from a perspective away from the earth’s surface. The gravitational force acting on the body 
(that is, its weight) is inversely proportional to the square of the distance from the center of 
the earth and is given by w(x) — —k/(x + R) 2 , where A: is a constant, R is the radius of the 
earth, and the minus sign signifies that w(x) is directed in the negative x direction. We know 
that on the earth’s surface w(0) is given by —mg, where g is the acceleration due to gravity at 
sea level. Therefore k = mgR 2 and 


w(x) = 


mgR 2 
(R + x) 2 ' 


(25) 


Since there are no other forces acting on the body, the equation of motion is 

d v mgR 2 

yyi — 

dt (R + x) 2 ’ 


(26) 


and the initial condition is 

d(0) = v 0 . (27) 


Unfortunately, Eq. (26) involves too many variables since it depends on t, x , and v. To 
remedy this situation, we can eliminate t from Eq. (26) by thinking of x, rather than t, as the 
independent variable. Then we can express dv/dt in terms of dv/dx by using the chain rule; 
hence 

d v d v dx dv 

dt dx dt dx 



FIGURE 2.3.4 A body in the earth’s gravitational field. 
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and Eq. (26) is replaced by 

dv gR 1 2 

dx (R + x) 2 


(28) 


Equation (28) is separable but not linear, so by separating the variables and integrating, we 


obtain 


2 


gR 2 

R + x 


+ c. 


(29) 


Since x = 0 when t = 0, the initial condition (27) at t = 0 can be replaced by the condition that 
v = vo when x = 0. Hence c = (Vq/2) — gR and 



2 gR + 


2 gR 2 
R + x 


(30) 


Note that Eq. (30) gives the velocity as a function of altitude rather than as a function of time. 
The plus sign must be chosen if the body is rising, and the minus sign if it is falling back to 
earth. 

To determine the maximum altitude £ that the body reaches, we set v = 0 and x — £ in 
Eq. (30) and then solve for §, obtaining 


$ = 


vl R 

2 gR - v 2 a ' 


(31) 


Solving Eq. (31) for vq, we find the initial velocity required to lift the body to the altitude §, 
namely, 


vo 



(32) 


The escape velocity v e is then found by letting | — >■ oo. Consequently, 

V e = yjl gR. 


(33) 


The numerical value of v e is approximately 6.9 mi/s, or 11.1 km/s. 

The preceding calculation of the escape velocity neglects the effect of air resistance, so the 
actual escape velocity (including the effect of air resistance) is somewhat higher. On the other 
hand, the effective escape velocity can be significantly reduced if the body is transported a 
considerable distance above sea level before being launched. Both gravitational and frictional 
forces are thereby reduced; air resistance, in particular, diminishes quite rapidly with increasing 
altitude. You should keep in mind also that it may well be impractical to impart too large an 
initial velocity instantaneously; space vehicles, for instance, receive their initial acceleration 
during a period of a few minutes. 


1. Consider a tank used in certain hydrodynamic experiments. After one experiment the 
tank contains 200 L of a dye solution with a concentration of 1 g/L. To prepare for the 
next experiment, the tank is to be rinsed with fresh water flowing in at a rate of 2 L/min, 
the well-stirred solution flowing out at the same rate. Find the time that will elapse before 
the concentration of dye in the tank reaches 1 % of its original value. 

2. A tank initially contains 120 L of pure water. A mixture containing a concentration of 
y g/L of salt enters the tank at a rate of 2 L/min, and the well-stirred mixture leaves the 
tank at the same rate. Find an expression in terms of y for the amount of salt in the tank 
at any time t. Also find the limiting amount of salt in the tank as t — > oo. 
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3. A tank originally contains 100 gal of fresh water. Then water containing | lb of salt per 
gallon is poured into the tank at a rate of 2 gal/min, and the mixture is allowed to leave at 
the same rate. After 10 min the process is stopped, and fresh water is poured into the tank 
at a rate of 2 gal/min, with the mixture again leaving at the same rate. Find the amount of 
salt in the tank at the end of an additional 10 min. 

4. A tank with a capacity of 500 gal originally contains 200 gal of water with 100 lb of salt 
in solution. Water containing 1 lb of salt per gallon is entering at a rate of 3 gal/min, and 
the mixture is allowed to flow out of the tank at a rate of 2 gal/min. Find the amount 
of salt in the tank at any time prior to the instant when the solution begins to overflow. 
Find the concentration (in pounds per gallon) of salt in the tank when it is on the point 
of overflowing. Compare this concentration with the theoretical limiting concentration if 
the tank had infinite capacity. 

5. A tank contains 100 gal of water and 50 oz of salt. Water containing a salt concentration 
of i (1 + i sin t) oz/gal flows into the tank at a rate of 2 gal/min, and the mixture in the 
tank flows out at the same rate. 

(a) Find the amount of salt in the tank at any time. 

(b) Plot the solution for a time period long enough so that you see the ultimate behavior 
of the graph. 

(c) The long-time behavior of the solution is an oscillation about a certain constant level. 
What is this level? What is the amplitude of the oscillation? 

6. Suppose that a tank containing a certain liquid has an outlet near the bottom. Let h(t) be 
the height of the liquid surface above the outlet at time t. Torricelli’s 2 principle states that 
the outflow velocity v at the outlet is equal to the velocity of a particle falling freely (with 
no drag) from the height h. 

(a) Show that v = v / 2gh , where g is the acceleration due to gravity. 

(b) By equating the rate of outflow to the rate of change of liquid in the tank, show that 
h(t) satisfies the equation 

df, 

A{h)— = —uay/2gh, (i) 

at 

where A (h) is the area of the cross section of the tank at height h and a is the area of 
the outlet. The constant a is a contraction coefficient that accounts for the observed fact 
that the cross section of the (smooth) outflow stream is smaller than a. The value of a for 
water is about 0.6. 

(c) Consider a water tank in the form of a right circular cylinder that is 3 m high above 
the outlet. The radius of the tank is 1 m and the radius of the circular outlet is 0.1 m. If 
the tank is initially full of water, determine how long it takes to drain the tank down to 
the level of the outlet. 

7. Suppose that a sum So is invested at an annual rate of return r compounded continuously. 

(a) Find the time T required for the original sum to double in value as a function of r. 

(b) Determine T if r = 7%. 

(c) Find the return rate that must be achieved if the initial investment is to double in 
8 years. 


2 Evangelista Torricelli (1608-1647), successor to Galileo as court mathematician in Florence, published 
this result in 1644. He is also known for constructing the first mercury barometer and for making important 
contributions to geometry. 
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8. A young person with no initial capital invests k dollars per year at an annual rate of 
return r. Assume that investments are made continuously and that the return is com- 
pounded continuously. 

(a) Determine the sum S{t) accumulated at any time t. 

(b) If r = 7.5%, determine k so that $1 million will be available for retirement in 40 years. 

(c) If k = $2000/year, determine the return rate r that must be obtained to have $1 million 
available in 40 years. 

9. A certain college graduate borrows $8000 to buy a car. The lender charges interest at 
an annual rate of 10%. Assuming that interest is compounded continuously and that 
the borrower makes payments continuously at a constant annual rate k, determine the 
payment rate k that is required to pay off the loan in 3 years. Also determine how much 
interest is paid during the 3-year period. 

10. A home buyer can afford to spend no more than $800/month on mortgage payments. 
Suppose that the interest rate is 9% and that the term of the mortgage is 20 years. Assume 
that interest is compounded continuously and that payments are also made continuously. 

(a) Determine the maximum amount that this buyer can afford to borrow. 

(b) Determine the total interest paid during the term of the mortgage. 

11. A recent college graduate borrows $100,000 at an interest rate of 9% to purchase a con- 
dominium. Anticipating steady salary increases, the buyer expects to make payments at a 
monthly rate of 800(1 + t/120), where t is the number of months since the loan was made. 

(a) Assuming that this payment schedule can be maintained, when will the loan be fully 
paid? 

(b) Assuming the same payment schedule, how large a loan could be paid off in exactly 
20 years? 

12. An important tool in archeological research is radiocarbon dating, developed by the Amer- 
ican chemist Willard F. Libby. 3 This is a means of determining the age of certain wood 
and plant remains, hence of animal or human bones or artifacts found buried at the same 
levels. Radiocarbon dating is based on the fact that some wood or plant remains contain 
residual amounts of carbon-14, a radioactive isotope of carbon. This isotope is accumu- 
lated during the lifetime of the plant and begins to decay at its death. Since the half-life of 
carbon-14 is long (approximately 5730 years 4 ), measurable amounts of carbon-14 remain 
after many thousands of years. If even a tiny fraction of the original amount of carbon-14 
is still present, then by appropriate laboratory measurements the proportion of the orig- 
inal amount of carbon-14 that remains can be accurately determined. In other words, if 
< 2(0 is the amount of carbon-14 at time t and Q 0 is the original amount, then the ratio 
< 2(0 /Qo can be determined, as long as this quantity is not too small. Present measurement 
techniques permit the use of this method for time periods of 50,000 years or more. 

(a) Assuming that Q satisfies the differential equation Q' — — rQ , determine the decay 
constant r for carbon-14. 

(b) Find an expression for <2(0 at any time t, if Q( 0) = <2o- 


3 Willard F. Libby (1908-1980) was born in rural Colorado and received his education at the University of 
California at Berkeley. He developed the method of radiocarbon dating beginning in 1947 while he was 
at the University of Chicago. For this work he was awarded the Nobel Prize in chemistry in 1960. 

4 McGraw-Hill Encyclopedia of Science and Technology (8th ed.) (New York: McGraw-Hill, 1997), Vol. 5, 
p. 48. 
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(c) Suppose that certain remains are discovered in which the current residual amount of 
carbon-14 is 20% of the original amount. Determine the age of these remains. 

The population of mosquitoes in a certain area increases at a rate proportional to the 
current population, and in the absence of other factors, the population doubles each 
week. There are 200,000 mosquitoes in the area initially, and predators (birds, bats, and 
so forth) eat 20,000 mosquitoes/day. Determine the population of mosquitoes in the area 
at any time. 

Suppose that a certain population has a growth rate that varies with time and that this 
population satisfies the differential equation 

dy/dt = (0.5 + sinf)y/5. 

(a) If y(0) = 1, find (or estimate) the time r at which the population has doubled. Choose 
other initial conditions and determine whether the doubling time r depends on the initial 
population. 

(b) Suppose that the growth rate is replaced by its average value 1/10. Determine the 
doubling time r in this case. 

(c) Suppose that the term sinr in the differential equation is replaced by sm2nt\ that is, 
the variation in the growth rate has a substantially higher frequency. What effect does this 
have on the doubling time x ? 

(d) Plot the solutions obtained in parts (a), (b), and (c) on a single set of axes. 

15. Suppose that a certain population satisfies the initial value problem 

dy/dt = r(t)y - k, y(0) = y 0 , 

where the growth rate r(t) is given by r{t) = (1 + sinf)/5, and k represents the rate of 
predation. 

(a) Suppose that k = 1/5. Ploty versus t for several values of yo between 1/2 and 1. 

(b) Estimate the critical initial population y c below which the population will become 
extinct. 

(c) Choose other values of k and find the corresponding y c for each one. 

(d) Use the data you have found in parts (b) and (c) to plot y c versus k. 

Newton’s law of cooling states that the temperature of an object changes at a rate pro- 
portional to the difference between its temperature and that of its surroundings. Suppose 
that the temperature of a cup of coffee obeys Newton’s law of cooling. If the coffee has a 
temperature of 200°F when freshly poured, and 1 min later has cooled to 190°F in a room 
at 70°F, determine when the coffee reaches a temperature of 150° F. 

Heat transfer from a body to its surroundings by radiation, based on the Stefan- 
Boltzmann 5 law, is described by the differential equation 

dll A A 

— = —a (m 4 - T 4 ), (l) 

dt 

where u(t) is the absolute temperature of the body at time t, T is the absolute temperature 
of the surroundings, and a is a constant depending on the physical parameters of the body. 


16. 


4fl 17 . 


13. 


14 . 


5 Jozef Stefan (1835-1893), professor of physics at Vienna, stated the radiation law on empirical grounds 
in 1879. His student Ludwig Boltzmann (1844-1906) derived it theoretically from the principles of ther- 
modynamics in 1884. Boltzmann is best known for his pioneering work in statistical mechanics. 
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However, if it is much larger than T, then solutions of Eq. (i) are well approximated by 
solutions of the simpler equation 


du 

dt 


= — au 


4 


(ii) 


Suppose that a body with initial temperature 2000°K is surrounded by a medium with 
temperature 300°K and that a = 2.0 x 10~ 12 °K _3 /s. 

(a) Determine the temperature of the body at any time by solving Eq. (ii). 

(b) Plot the graph of u versus t. 

(c) Find the time r at which u(z) = 600, that is, twice the ambient temperature. Up to this 
time the error in using Eq. (ii) to approximate the solutions of Eq. (i) is no more than 1 % . 

18. Consider an insulated box (a building, perhaps) with internal temperature u(t). According 
to Newton’s law of cooling, u satisfies the differential equation 


du 

dt 


—k[u — 7X0], 


(i) 


where T{t) is the ambient (external) temperature. Suppose that T(t) varies sinusoidally; 
for example, assume that T(t) = T 0 + 7) cos cut. 

(a) Solve Eq. (i) and express u(t) in terms of t, k , T 0 , 7), and a>. Observe that part of 
your solution approaches zero as t becomes large; this is called the transient part. The 
remainder of the solution is called the steady state; denote it by S(t). 

(b) Suppose that t is measured in hours and that co = jt/ 12, corresponding a period of 24 h 
for T(t). Further, let T 0 = 60° F, 7) = 15°F, and k = 0.2/h. Draw graphs of S(t) and T(t) 
versus t on the same axes. From your graph estimate the amplitude R of the oscillatory 
part of S{t). Also estimate the time lag r between corresponding maxima of T (f) and S(t). 

(c) Let k, T 0 , 7), and a> now be unspecified. Write the oscillatory part of S(t) in the form 
R cos[a>(/ — r )]. Use trigonometric identities to find expressions for R and r. Let 7) and 
u> have the values given in part (b), and plot graphs of R and r versus k. 

19. Consider a lake of constant volume V containing at time t an amount Q(t) of pollutant, 
evenly distributed throughout the lake with a concentration c(f), where c(t) = Q(t)/V. 
Assume that water containing a concentration k of pollutant enters the lake at a rate r, 
and that water leaves the lake at the same rate. Suppose that pollutants are also added 
directly to the lake at a constant rate P. Note that the given assumptions neglect a num- 
ber of factors that may, in some cases, be important — for example, the water added or 
lost by precipitation, absorption, and evaporation; the stratifying effect of temperature 
differences in a deep lake; the tendency of irregularities in the coastline to produce shel- 
tered bays; and the fact that pollutants are not deposited evenly throughout the lake but 
(usually) at isolated points around its periphery. The results below must be interpreted in 
the light of the neglect of such factors as these. 

(a) If at time t = 0 the concentration of pollutant is Co, find an expression for the concen- 
tration c(t) at any time. What is the limiting concentration as t — > oo? 

(b) If the addition of pollutants to the lake is terminated (k = 0 and P = 0 for t > 0), 
determine the time interval T that must elapse before the concentration of pollutants is 
reduced to 50% of its original value; to 10% of its original value. 

(c) Table 2.3.2 contains data 6 for several of the Great Lakes. Using these data, determine 


6 This problem is based on R. H. Rainey, “Natural Displacement of Pollution from the Great Lakes,” 
Science 155 (1967), pp. 1242-1243; the information in the table was taken from that source. 
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from part (b) the time T necessary to reduce the contamination of each of these lakes to 
10% of the original value. 


TABLE 2.3.2 Volume and Flow Data for the Great 
Lakes 


Lake 

V (km 3 x 10 3 ) 

r (km 3 /year) 

Superior 

12.2 

65.2 

Michigan 

4.9 

158 

Erie 

0.46 

175 

Ontario 

1.6 

209 


20. A ball with mass 0.15 kg is thrown upward with initial velocity 20 m/s from the roof of a 
building 30 m high. Neglect air resistance. 

(a) Find the maximum height above the ground that the ball reaches. 

(b) Assuming that the ball misses the building on the way down, find the time that it hits 
the ground. 

(c) Plot the graphs of velocity and position versus time. 

21. Assume that the conditions are as in Problem 20 except that there is a force due to air 
resistance of |u|/30, where the velocity v is measured in m/s. 

(a) Find the maximum height above the ground that the ball reaches. 

(b) Find the time that the ball hits the ground. 

(c) Plot the graphs of velocity and position versus time. Compare these graphs with the 
corresponding ones in Problem 20. 

22. Assume that the conditions are as in Problem 20 except that there is a force due to air 
resistance of ii 2 /1325, where the velocity v is measured in m/s. 

(a) Find the maximum height above the ground that the ball reaches. 

(b) Find the time that the ball hits the ground. 

(c) Plot the graphs of velocity and position versus time. Compare these graphs with the 
corresponding ones in Problems 20 and 21. 

4fl 23. A sky diver weighing 180 lb (including equipment) falls vertically downward from an 
altitude of 5000 ft and opens the parachute after 10 s of free fall. Assume that the force 
of air resistance is 0.75|a| when the parachute is closed and 12|u| when the parachute is 
open, where the velocity v is measured in ft/s. 

(a) Find the speed of the sky diver when the parachute opens. 

(b) Find the distance fallen before the parachute opens. 

(c) What is the limiting velocity vl after the parachute opens? 

(d) Determine how long the sky diver is in the air after the parachute opens. 

(e) Plot the graph of velocity versus time from the beginning of the fall until the skydiver 
reaches the ground. 

24. A rocket sled having an initial speed of 150 mi/h is slowed by a channel of water. Assume 
that, during the braking process, the acceleration a is given by a( v) = — fiv 2 , where v is the 
velocity and /x is a constant. 

(a) As in Example 4 in the text, use the relation dv/dt — v(dv/dx) to write the equation 
of motion in terms of v and x. 

(b) If it requires a distance of 2000 ft to slow the sled to 15 mi/h, determine the value of /x. 

(c) Find the time r required to slow the sled to 15 mi/h. 
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25. A body of constant mass m is projected vertically upward with an initial velocity vq in a 
medium offering a resistance k\v\, where k is a constant. Neglect changes in the gravita- 
tional force. 

(a) Find the maximum height x m attained by the body and the time t m at which this 
maximum height is reached. 

(b) Show that if kvo/mg < 1, then t m and x m can be expressed as 




(c) Show that the quantity kvo/mg is dimensionless. 

26. A body of mass m is projected vertically upward with an initial velocity v 0 in a medium 
offering a resistance k\v\, where k is a constant. Assume that the gravitational attraction 
of the earth is constant. 

(a) Find the velocity v(t) of the body at any time. 

(b) Use the result of part (a) to calculate the limit of v(t) as k — ► 0, that is, as the resistance 
approaches zero. Does this result agree with the velocity of a mass m projected upward 
with an initial velocity v 0 in a vacuum? 

(c) Use the result of part (a) to calculate the limit of v(t) as m — > 0, that is, as the mass 
approaches zero. 

27. A body falling in a relatively dense fluid, oil for example, is acted on by three forces 
(see Figure 2.3.5): a resistive force R, a buoyant force B , and its weight w due to gravity. 
The buoyant force is equal to the weight of the fluid displaced by the object. For a slowly 
moving spherical body of radius a , the resistive force is given by Stokes’s law, R = 67r/xa| u| , 
where v is the velocity of the body, and /r is the coefficient of viscosity of the surrounding 
fluid. 7 


R B 



w 


FIGURE 2.3.5 A body falling in a dense fluid. 


7 George Gabriel Stokes (1819-1903), professor at Cambridge, was one of the foremost applied mathe- 
maticians of the nineteenth century. The basic equations of fluid mechanics (the Navier-Stokes equations) 
are named partly in his honor, and one of the fundamental theorems of vector calculus bears his name. 
He was also one of the pioneers in the use of divergent (asymptotic) series, a subject of great interest and 
importance today. 
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(a) Find the limiting velocity of a solid sphere of radius a and density p falling freely in a 
medium of density p' and coefficient of viscosity fi. 

(b) In 1910 R. A. Millikan 8 studied the motion of tiny droplets of oil falling in an electric 
Held. A field of strength E exerts a force Ee on a droplet with charge e. Assume that E 
has been adjusted so the droplet is held stationary (v = 0) and that w and B are as given 
above. Find an expression for e. Millikan repeated this experiment many times, and from 
the data that he gathered he was able to deduce the charge on an electron. 

41 28. A mass of 0.25 kg is dropped from rest in a medium offering a resistance of 0.2 |v|, where 
v is measured in m/s. 

(a) If the mass is dropped from a height of 30 m, find its velocity when it hits the ground. 

(b) If the mass is to attain a velocity of no more than 10 m/s, find the maximum height 
from which it can be dropped. 

(c) Suppose that the resistive force is k\v\, where v is measured in m/s and k is a constant. 
If the mass is dropped from a height of 30 m and must hit the ground with a velocity of no 
more than 10 m/s, determine the coefficient of resistance k that is required. 

29. Suppose that a rocket is launched straight up from the surface of the earth with initial 
velocity vq = i/2gR, where R is the radius of the earth. Neglect air resistance. 

(a) Find an expression for the velocity v in terms of the distance x from the surface of the 
earth. 

(b) Find the time required for the rocket to go 240,000 mi (the approximate distance from 
the earth to the moon). Assume that R — 4000 mi. 

30. Let v (t) and w(t), respectively, be the horizontal and vertical components of the velocity 
of a batted (or thrown) baseball. In the absence of air resistance, v and w satisfy the 
equations 

dv/dt = 0, dw/dt = — g. 

(a) Show that 

v = u cos A, w = —gt + u sin A, 

where u is the initial speed of the ball and A is its initial angle of elevation. 

(b) Let x(t ) and y(t), respectively, be the horizontal and vertical coordinates of the ball at 
time t. If x(0) = 0 and y (0) = h, find x(t ) and y(t) at any time t. 

(c) Let g = 32 ft/s 2 , u = 125 ft/s, and h = 3 ft. Plot the trajectory of the ball for several 
values of the angle A; that is, plot x(t) and y(t) parametrically. 

(d) Suppose the outfield wall is at a distance L and has height H. Find a relation between 
u and A that must be satisfied if the ball is to clear the wall. 

(e) Suppose that L — 350 ft and H = 10 ft. Using the relation in part (d), find (or estimate 
from a plot) the range of values of A that correspond to an initial velocity of u = 110 ft/s. 

(f) For L = 350 and El = 10, find the minimum initial velocity u and the corresponding 
optimal angle A for which the ball will clear the wall. 

41 31. A more realistic model (than that in Problem 30) of a baseball in flight includes the effect 
of air resistance. In this case the equations of motion are 

dv/dt — —rv, dw/dt = — g — rw , 


8 Robert A. Millikan (1868-1953) was educated at Oberlin College and Columbia University. Later he 
was a professor at the University of Chicago and California Institute of Technology. His determination of 
the charge on an electron was published in 1910. For this work, and for other studies of the photoelectric 
effect, he was awarded the Nobel Prize in 1923. 
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where r is the coefficient of resistance. 

(a) Determine v(t) and w(t) in terms of initial speed u and initial angle of elevation A. 

(b) Find x(t) and y(t) if x(0) = 0 and y(0) = h. 

(c) Plot the trajectory of the ball for r = 1/5, u = 125,/; = 3, and for several values of A. 
How do the trajectories differ from those in Problem 31 with r = 0? 

(d) Assuming that r = 1/5 and h = 3, find the minimum initial velocity u and the optimal 
angle A for which the ball will clear a wall that is 350 ft distant and 10 ft high. Compare 
this result with that in Problem 30(f). 

32. Brachistochrone Problem. One of the famous problems in the history of mathematics is 
the brachistochrone 9 problem: to find the curve along which a particle will slide without 
friction in the minimum time from one given point P to another Q , the second point being 
lower than the first but not directly beneath it (see Figure 2.3.6). This problem was posed 
by Johann Bernoulli in 1696 as a challenge problem to the mathematicians of his day. 
Correct solutions were found by Johann Bernoulli and his brother Jakob Bernoulli and 
by Isaac Newton, Gottfried Leibniz, and the Marquis de L’Hospital. The brachistochrone 
problem is important in the development of mathematics as one of the forerunners of the 
calculus of variations. 

In solving this problem, it is convenient to take the origin as the upper point P and to 
orient the axes as shown in Figure 2.3.6. The lower point Q has coordinates (jko.J'o)- It is 
then possible to show that the curve of minimum time is given by a function y = <p(x) that 
satisfies the differential equation 


(1 +y 2 )y = k 2 . 


where k 2 is a certain positive constant to be determined later. 


(i) 



FIGURE 2.3.6 The brachistochrone. 


(a) Solve Eq. (i) for yf . Why is it necessary to choose the positive square root? 

(b) Introduce the new variable t by the relation 

y = k 2 sin 2 t. (ii) 

Show that the equation found in part (a) then takes the form 

2 k 2 sin 2 t dt = dx. (iii) 


9 The word “brachistochrone” comes from the Greek words brachistos , meaning shortest, and chronos , 
meaning time. 
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(c) Letting 9 = 2 t, show that the solution of Eq. (iii) for which x = 0 when y = 0 is given 
by 

x = k 2 (d — sin0)/2, y = k 2 (\ — cos(?)/2. (iv) 

Equations (iv) are parametric equations of the solution of Eq. (i) that passes through 
(0, 0). The graph of Eqs. (iv) is called a cycloid. 

(d) If we make a proper choice of the constant k , then the cycloid also passes through 
the point ( xo,yo ) and is the solution of the brachistochrone problem. Find k if.ro = 1 and 
yo = 2. 


2.4 Differences Between Linear and Nonlinear Equations 


Up to now, we have been primarily concerned with showing that first order differ- 
ential equations can be used to investigate many different kinds of problems in the 
natural sciences, and with presenting methods of solving such equations if they are 
either linear or separable. Now it is time to turn our attention to some more general 
questions about differential equations and to explore in more detail some important 
ways in which nonlinear equations differ from linear ones. 

Existence and Uniqueness of Solutions. So far, we have discussed a number of initial value 
problems, each of which had a solution and apparently only one solution. This raises 
the question of whether this is true of all initial value problems for first order equa- 
tions. In other words, does every initial value problem have exactly one solution? 
This may be an important question even for nonmathematicians. If you encounter 
an initial value problem in the course of investigating some physical problem, you 
might want to know that it has a solution before spending very much time and effort 
in trying to find it. Further, if you are successful in finding one solution, you might 
be interested in knowing whether you should continue a search for other possible 
solutions or whether you can be sure that there are no other solutions. For linear 
equations the answers to these questions are given by the following fundamental 
theorem. 


Theorem 2.4.1 If the functions p and g are continuous on an open interval I:a < t < /3 con- 
taining the point t = to, then there exists a unique function y — <j>(t) that satisfies 
the differential equation 

/ + P(t)y = g(t) (1) 

for each t in /, and that also satisfies the initial condition 

y(fo) = yo, (2) 

where yo is an arbitrary prescribed initial value. 


Observe that Theorem 2.4. 1 states that the given initial value problem has a solution 
and also that the problem has only one solution. In other words, the theorem asserts 
both the existence and uniqueness of the solution of the initial value problem (1), (2). 
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In addition, it states that the solution exists throughout any interval I containing the 
initial point to in which the coefficients p and g are continuous. That is, the solution 
can be discontinuous or fail to exist only at points where at least one of p and g is 
discontinuous. Such points can often be identified at a glance. 

The proof of this theorem is partly contained in the discussion in Section 2.1 leading 
to the formula [Eq. (32) in Section 2.1] 

K t )y= J iMt)g(t) dt + c, (3) 

where [Eq. (30) in Section 2.1] 

pit ) = exp J p(t)dt. (4) 

The derivation in Section 2.1 shows that if Eq. (1) has a solution, then it must be 
given by Eq. (3). By looking slightly more closely at that derivation, we can also 
conclude that the differential equation (1) must indeed have a solution. Since p is 
continuous for a < t < ft, it follows that p is defined in this interval and is a nonzero 
differentiable function. Upon multiplying Eq. (1) by pit), we obtain 

[m(0xT = H(t)g(t). (5) 


Since both p and g are continuous, the function /ig is integrable, and Eq. (3) follows 
from Eq. (5). Further, the integral of pg is differentiable, so y as given by Eq. (3) 
exists and is differentiable throughout the interval a < t < By substituting the 
expression for y from Eq. (3) into either Eq. (1) or Eq. (5), you can verify that 
this expression satisfies the differential equation throughout the interval a < t < 
Finally, the initial condition (2) determines the constant c uniquely, so there is only 
one solution of the initial value problem; this completes the proof. 

Equation (4) determines the integrating factor pit) only up to a multiplicative 
factor that depends on the lower limit of integration. If we choose this lower limit to 
be t 0 , then 

pit) = exp / pis) ds, (6) 

J to 


and it follows that p(to) — 1. Using the integrating factor given by Eq. (6), and 
choosing the lower limit of integration in Eq. (3) also to be to, we obtain the general 
solution of Eq. (1) in the form 


y = 


1 

iMt) 


f 

J to 


p(s)gis) 


ds + c 


( 7 ) 


To satisfy the initial condition (2), we must choose c = yo- Thus the solution of the 
initial value problem (1), (2) is 


1 

pit) 


f 

J to 


v(s)g(s) 


ds + y 0 , 


( 8 ) 


where p{t) is given by Eq. (6). 

Turning now to nonlinear differential equations, we must replace Theorem 2.4.1 
by a more general theorem, such as the following. 
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Theorem 2.4.2 


EXAMPLE 

1 


Let the functions / and df/dy be continuous in some rectangle a < t < /3, 
y < y < S containing the point (7o,yo). Then, in some interval to — h < t < to + h 
contained in a < t < there is a unique solution y = <j>(t) of the initial value prob- 
lem 

y'=f(t,y), y(t 0 )=y 0 . (9) 


Observe that the hypotheses in Theorem 2.4.2 reduce to those in Theorem 2.4.1 if 
the differential equation is linear. For then f(t,y) = — p(t)y + g(t) and df(t,y)/dy = 
—p(t), so the continuity of / and df/dy is equivalent to the continuity of p and g in 
this case. The proof of Theorem 2.4.1 was comparatively simple because it could be 
based on the expression (3) that gives the solution of an arbitrary linear equation. 
There is no corresponding expression for the solution of the differential equation (9), 
so the proof of Theorem 2.4.2 is much more difficult. It is discussed to some extent in 
Section 2.8 and in greater depth in more advanced books on differential equations. 

Here we note that the conditions stated in Theorem 2.4.2 are sufficient to guarantee 
the existence of a unique solution of the initial value problem (9) in some interval 
t 0 — h < t < t 0 + h, but they are not necessary. That is, the conclusion remains true 
under slightly weaker hypotheses about the function /. In fact, the existence of a 
solution (but not its uniqueness) can be established on the basis of the continuity of 
/ alone. 

An important geome trical consequence of the uniqueness parts of Theorems 2.4.1 
and 2.4.2 is that the graphs of two solutions cannot intersect each other. Otherwise, 
there would be two solutions that satisfy the initial condition corresponding to the 
point of intersection, in contradiction to Theorem 2.4.1 or 2.4.2. 

We now consider some examples. 


Use Theorem 2.4.1 to find an interval in which the initial value problem 

ty'+2y = 4t 2 , (10) 

yd) = 2 (11) 

has a unique solution. 

Rewriting Eq. (10) in the standard form (1), we have 

/ + (2/t)y = 4f, 

so p(t) = 2/fandg(f) = 4 1. Thus, for this equation, g is continuous for all?, while pis continuous 
only for t < 0 or for t > 0. The interval t > 0 contains the initial point; consequently, Theorem 
2.4.1 guarantees that the problem (10), (11) has a unique solution on the interval 0 < t < oo. 
In Example 3 of Section 2.1 we found the solution of this initial value problem to be 

y = f 2 +^, t > 0. (12) 

Now suppose that the initial condition (11) is changed to y(— 1) = 2. Then Theorem 2.4.1 

asserts the existence of a unique solution for t < 0. As you can readily verify, the solution is 

again given by Eq. (12), but now on the interval — oo < t < 0. 
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EXAMPLE 

2 


EXAMPLE 

3 


Apply Theorem 2.4.2 to the initial value problem 

dy 3x 2 + 4x + 2 
dx 2(y — 1) 


y(0) = -l. 


(13) 


Note that Theorem 2.4.1 is not applicable to this problem since the differential equation is 
nonlinear. To apply Theorem 2.4.2, observe that 


fix.y) 


3x 2 + 4x + 2 

2(y - 1) 


3 /, . 

■5- ( x ,y) 
3 y 


3xr + 4x + 2 

2 (y - l) 2 


Thus each of these functions is continuous everywhere except on the line y — 1. Consequently, 
a rectangle can be drawn about the initial point (0, —1) in which both/ and 9 // dy are continu- 
ous. Therefore Theorem 2.4.2 guarantees that the initial value problem has a unique solution in 
some interval about x — 0. However, even though the rectangle can be stretched infinitely far 
in both the positive and negative x directions, this does not necessarily mean that the solution 
exists for all x. Indeed, the initial value problem (13) was solved in Example 2 of Section 2.2 
and the solution exists only for x > —2. 

Now suppose we change the initial condition to y(0) = 1. The initial point now lies on 
the line y = 1 so no rectangle can be drawn about it, within which / and 9 / /dy are contin- 
uous. Consequently, Theorem 2.4.2 says nothing about possible solutions of this modified 
problem. However, if we separate the variables and integrate, as in Section 2.2, we find 
that 

y 2 — 2y = x 3 + 2x 2 + 2 x + c. 


Further, if x = 0 and y = 1, then c = — 1. Finally, by solving for y , we obtain 

y = 1 ± 7*3+ 2x 2 + 2x. 


(14) 


Equation (14) provides two functions that satisfy the given differential equation for* > 0 and 
also satisfy the initial condition y(0) = 1. 


Consider the initial value problem 

/ = y 1/3 , y(0) = 0 (15) 

for t > 0. Apply Theorem 2.4.2 to this initial value problem and then solve the problem. 

The function f(t.y) = y 1/3 is continuous everywhere, but 9 / /dy does not exist when y = 0, 
and hence is not continuous there. Thus Theorem 2.4.2 does not apply to this problem and 
no conclusion can be drawn from it. However, by the remark following Theorem 2.4.2 the 
continuity off does ensure the existence of solutions, but not their uniqueness. 

To understand the situation more clearly, we must actually solve the problem, which is easy 
to do since the differential equation is separable. Thus we have 

y~ 1/3 dy = dt, 


so 


and 


ly 2 ' 3 = t + c 

y — [§( r + c )] 3/2 • 


The initial condition is satisfied if c = 0, so 

y = Mt) = (|f) 3/2 , t> 0 


( 16 ) 
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satisfies both of Eqs. (15). On the other hand, the function 

y = <fc(0 = - (| t) 3/2 , t > o 

is also a solution of the initial value problem. Moreover, the function 

y = i/r(f) = 0, t > 0 

is yet another solution. Indeed, for an arbitrary positive to, the functions 

[0, if 0 < t < t 0 , 

l±[=(f-t 0 )] 3/2 , if t > t 0 


y = x(0 = \ , r->,. , ,-,3/2 


(17) 

(18) 

(19) 


are continuous, differentiable (in particular at t — to), and are solutions of the initial value 
problem (15). Hence this problem has an infinite family of solutions; see Figure 2.4.1, where 
a few of these solutions are shown. 



FIGURE 2.4.1 Several solutions of the initial value problem y' = y 1/3 , y(0) = 0. 

As already noted, the nonuniqueness of the solutions of the problem (15) does not contradict 
the existence and uniqueness theorem, since the theorem is not applicable if the initial point 
lies on the f-axis. If ( t 0 , yo) is any point not on the f-axis, however, then the theorem guarantees 
that there is a unique solution of the differential equation y' = y 1/3 passing through (fo,yo). 


Interval of Definition. According to Theorem 2.4.1, the solution of a linear equation (1) 

y' +p(t)y = g(t), 

subject to the initial condition y(t 0 ) — >'o, exists throughout any interval about t = t 0 
in which the functions p and g are continuous. Thus, vertical asymptotes or other 
discontinuities in the solution can occur only at points of discontinuity of p or g. For 
instance, the solutions in Example 1 (with one exception) are asymptotic to the y- 
axis, corresponding to the discontinuity at t = 0 in the coefficient pit) = 2/t, but none 
of the solutions has any other point where it fails to exist and to be differentiable. 
The one exceptional solution shows that solutions may sometimes remain continuous 
even at points of discontinuity of the coefficients. 
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On the other hand, for a nonlinear initial value problem satisfying the hypotheses 
of Theorem 2.4.2, the interval in which a solution exists may be difficult to determine. 
The solution y = 0(0 is certain to exist as long as the point [f, (pit)] remains within a 
region in which the hypotheses of Theorem 2.4.2 are satisfied. This is what determines 
the value of h in that theorem. However, since </> (/) is usually not known, it may be 
impossible to locate the point [f, </>(/)] with respect to this region. In any case, the 
interval in which a solution exists may have no simple relationship to the function / 
in the differential equation / = f(t,y). This is illustrated by the following example. 


EXAMPLE 

4 


Solve the initial value problem 

y = y 2 , y( 0) = 1, (20) 

and determine the interval in which the solution exists. 

Theorem 2.4.2 guarantees that this problem has a unique solution since f(t,y) = y 2 and 
df/dy = 2 y are continuous everywhere. To find the solution, we separate the variables and 
integrate with the result that 

y~ 2 dy — dt (21) 


Then, solving for y, we have 


1 


y = - 


f Tc 


To satisfy the initial condition, we must choose c = —1, so 

1 


( 22 ) 


(23) 


is the solution of the given initial value problem. Clearly, the solution becomes unbounded as 
t -> 1; therefore, the solution exists only in the interval — oo < t < 1. There is no indication 
from the differential equation itself, however, that the point t = 1 is in any way remarkable. 
Moreover, if the initial condition is replaced by 


y(0) = y 0 . 


(24) 


then the constant c in Eq. (22) must be chosen to be c = — l/y 0 , and it follows that 

yo 

y= T^Wt 


(25) 


is the solution of the initial value problem with the initial condition (24). Observe that the 
solution (25) becomes unbounded as t — > 1/yo, so the interval of existence of the solution is 
— oo < t < l/y 0 if yo > 0, and is 1 /y 0 < t < oo if y 0 < 0. This example illustrates another fea- 
ture of initial value problems for nonlinear equations; namely, the singularities of the solution 
may depend in an essential way on the initial conditions as well as on the differential equation. 


General Solution. Another way in which linear and nonlinear equations differ concerns 
the concept of a general solution. For a first order linear equation it is possible to 
obtain a solution containing one arbitrary constant, from which all possible solutions 
follow by specifying values for this constant. For nonlinear equations this may not be 
the case; even though a solution containing an arbitrary constant may be found, there 
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may be other solutions that cannot be obtained by giving values to this constant. For 
instance, for the differential equation y' = y 2 in Example 4, the expression in Eq. (22) 
contains an arbitrary constant, but does not include all solutions of the differential 
equation. To show this, observe that the function y = 0 for all t is certainly a solution of 
the differential equation, but it cannot be obtained from Eq. (22) by assigning a value 
to c. In this example we might anticipate that something of this sort might happen 
because to rewrite the original differential equation in the form (21), we must require 
that y is not zero. Flowever, the existence of “additional” solutions is not uncommon 
for nonlinear equations; a less obvious example is given in Problem 22. Thus we will 
use the term “general solution” only when discussing linear equations. 

Implicit Solutions. Recall again that, for an initial value problem for a first order linear 
equation, Eq. (8) provides an explicit formula for the solution y = <p(t). As long as 
the necessary antiderivatives can be found, the value of the solution at any point can 
be determined merely by substituting the appropriate value of t into the equation. 
The situation for nonlinear equations is much less satisfactory. Usually, the best that 
we can hope for is to find an equation 

F(t,y) = 0 (26) 

involving t and y that is satisfied by the solution y = <p{t). Even this can be done only 
for differential equations of certain particular types, of which separable equations 
are the most important. The equation (26) is called an integral, or first integral, of the 
differential equation, and (as we have already noted) its graph is an integral curve, or 
perhaps a family of integral curves. Equation (26), assuming it can be found, defines 
the solution implicitly; that is, for each value of t we must solve Eq. (26) to find the 
corresponding value of y. If Eq. (26) is simple enough, it may be possible to solve 
it for y by analytical means and thereby obtain an explicit formula for the solution. 
However, more frequently this will not be possible, and you will have to resort to a 
numerical calculation to determine (approximately) the value of y for a given value 
of t. Once several pairs of values of f and v have been calculated, it is often helpful 
to plot them and then to sketch the integral curve that passes through them. You 
should arrange for a computer to do this for you, if possible. 

Examples 2, 3, and 4 are nonlinear problems in which it is easy to solve for an 
explicit formula for the solution y = On the other hand, Examples 1 and 3 in 
Section 2.2 are cases in which it is better to leave the solution in implicit form, and to 
use numerical means to evaluate it for particular values of the independent variable. 
The latter situation is more typical; unless the implicit relation is quadratic in y, or 
has some other particularly simple form, it is unlikely that it can be solved exactly 
by analytical methods. Indeed, more often than not, it is impossible even to find an 
implicit expression for the solution of a first order nonlinear equation. 

Graphical or Numerical Construction of Integral Curves. Because of the difficulty in obtain- 
ing exact analytical solutions of nonlinear differential equations, methods that yield 
approximate solutions or other qualitative information about solutions are of cor- 
respondingly greater importance. We have already described, in Section 1.1, how 
the direction field of a differential equation can be constructed. The direction field 
can often show the qualitative form of solutions and can also be helpful in identi- 
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PROBLEMS 


fying regions of the fy-plane where solutions exhibit interesting features that merit 
more detailed analytical or numerical investigation. Graphical methods for first or- 
der equations are discussed further in Section 2.5. An introduction to numerical 
methods for first order equations is given in Section 2.7, and a systematic discussion 
of numerical methods appears in Chapter 8. However, it is not necessary to study 
the numerical algorithms themselves in order to use effectively one of the many 
software packages that generate and plot numerical approximations to solutions of 
initial value problems. 

Summary. The linear equation y' + p(t)y = g(t ) has several nice properties that can 
be summarized in the following statements: 

1. Assuming that the coefficients are continuous, there is a general solution, containing an 
arbitrary constant, that includes all solutions of the differential equation. A particular 
solution that satisfies a given initial condition can be picked out by choosing the proper 
value for the arbitrary constant. 

2. There is an expression for the solution, namely, Eq. (7) or Eq. (8). Moreover, although it 
involves two integrations, the expression is an explicit one for the solution y = 4>(t) rather 
than an equation that defines (j> implicitly. 

3. The possible points of discontinuity, or singularities, of the solution can be identified (with- 
out solving the problem) merely by finding the points of discontinuity of the coefficients. 
Thus, if the coefficients are continuous for all t, then the solution also exists and is differ- 
entiable for all t. 

None of these statements is true, in general, of nonlinear equations. Although a 
nonlinear equation may well have a solution involving an arbitrary constant, there 
may also be other solutions. There is no general formula for solutions of nonlinear 
equations. If you are able to integrate a nonlinear equation, you are likely to obtain an 
equation defining solutions implicitly rather than explicitly. Finally, the singularities 
of solutions of nonlinear equations can usually be found only by solving the equation 
and examining the solution. It is likely that the singularities will depend on the initial 
condition as well as the differential equation. 


In each of Problems 1 through 6 determine (without solving the problem) an interval in which 
the solution of the given initial value problem is certain to exist. 

1. (f - 3)/ + (In t)y = 2t, y( 1) = 2 

2. t(t - 4)/ + y = 0, y( 2) = 1 

3. y + (tan f)y = sin t, y(: r) = 0 4. (4 — t 2 )y' + 2ty = 3f 2 , y(— 3) = 1 

5. (4 — t 2 )y' + 2ty = 3t 2 , jv(l) = — 3 6. (In t)y' + y = cot t, y(2) = 3 

In each of Problems 7 through 12 state where in the fy-plane the hypotheses of Theorem 2.4.2 
are satisfied. 

7. / = f ~ y 8. y' = (1 — t 2 - y 2 ) 1/2 

In Kyi 

1 - t 2 + y 2 


9. / = 


10. y’ = (t 2 + y 2 ) 3 / 2 
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r| dy _ 1 + t 2 dy _ (cot t)y 

dt 3y — y 2 dt 1 + y 

In each of Problems 13 through 16 solve the given initial value problem and determine how 
the interval in which the solution exists depends on the initial value yo. 

13. y = -4 t/y, y(0) = y 0 14. / = 2 tv 2 , y( 0) = y 0 

15. y' + y 3 = 0, y(0) = y 0 16. / = t 2 /y( 1 + t 3 ), y(0) = y 0 


In each of Problems 17 through 20 draw a direction field and plot (or sketch) several solutions 
of the given differential equation. Describe how solutions appear to behave as t increases and 
how their behavior depends on the initial value yo when t = 0. 

41 17. / = ty(3 -y) 41 18. y' = >-(3 - ty) 

41 19. y = -3,(3 _ ty) 41 20. y = f-l-y 2 

21. Consider the initial value problem y' = y 1/3 ,y(0) = 0 from Example 3 in the text. 

(a) Is there a solution that passes through the point (1, 1)? If so, find it. 

(b) Is there a solution that passes through the point (2, 1)? If so, find it. 

(c) Consider all possible solutions of the given initial value problem. Determine the set 
of values that these solutions have at t = 2. 

22. (a) Verify that both yi(t) = 1 — t and y 2 (f) = — t 2 / 4 are solutions of the initial value 
problem 

— t + (t 2 + 4y) 1/2 

y= ,, . y(2) = -l. 


Where are these solutions valid? 

(b) Explain why the existence of two solutions of the given problem does not contradict 
the uniqueness part of Theorem 2.4.2. 

(c) Show that y = ct + c 2 , where c is an arbitrary constant, satisfies the differential equa- 
tion in part (a) for t > —2c. If c = —1, the initial condition is also satisfied, and the 
solution y = y 1 ( t ) is obtained. Show that there is no choice of c that gives the second 
solution y = y 2 ( 0 - 

23. (a) Show that cf>(t ) = e 2 ' is a solution of y' — 2y = 0 and that y = ccj)(t) is also a solution 
of this equation for any value of the constant c. 

(b) Show that 4>(t) = 1/f is a solution of y' + y 2 = 0 for t > 0 but that y = c(f>(t) is not 
a solution of this equation unless c = 0 or c = 1. Note that the equation of part (b) is 
nonlinear, while that of part (a) is linear. 

24. Show that if y = 4>(t ) is a solution of y' + p(t)y = 0, then y = c<p(t) is also a solution for 
any value of the constant c. 

25. Let y = yi(f) be a solution of 

y' + p(t)y = 0, (i) 


and let y = y 2 (t) be a solution of 

y' + P(t)y = g(t). (ii) 

Show that y = yi(t) + y 2 (t) is also a solution of Eq. (ii). 

26. (a) Show that the solution (7) of the general linear equation (1) can be written in the 
form 

y = cyi(t) + y 2 (t), (i) 


where c is an arbitrary constant. Identify the functions yi and y 2 - 
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(b) Show that y i is a solution of the differential equation 

y'+p(t)y = 0 , (ii) 


corresponding to g(t) = 0. 

(c) Show that y 2 is a solution of the full linear equation (1). We see later (for example, 
in Section 3.5) that solutions of higher order linear equations have a pattern similar to 
Eq. (i). 

Bernoulli Equations. Sometimes it is possible to solve a nonlinear equation by making a 
change of the dependent variable that converts it into a linear equation. The most important 
such equation has the form 

y' + p(t)y = q(t)y n , 

and is called a Bernoulli equation after Jakob Bernoulli. Problems 27 through 31 deal with 
equations of this type. 

27. (a) Solve Bernoulli’s equation when n = 0; when n = 1. 

(b) Show that if n ^ 0, 1, then the substitution v = y 1_ ” reduces Bernoulli’s equation to a 
linear equation. This method of solution was found by Leibniz in 1696. 

In each of Problems 28 through 31 the given equation is a Bernoulli equation. In each case 
solve it by using the substitution mentioned in Problem 27(b). 

28. f 2 y' + 2fy - y 3 = 0, t > 0 

29. y' = ry — ky 2 , r > 0 and k > 0. This equation is important in population dynamics and is 
discussed in detail in Section 2.5. 

30. y' = ey — cry 3 , e > 0 and a > 0. This equation occurs in the study of the stability of fluid 
flow. 

31. dy/dt = (T cos t + T)y — y 3 , where T and T are constants. This equation also occurs in 
the study of the stability of fluid flow. 

Discontinuous Coefficients. Linear differential equations sometimes occur in which one or 
both of the functions p and g have jump discontinuities. If to is such a point of discontinuity, 
then it is necessary to solve the equation separately for t < t 0 and t > t 0 . Afterward, the two 
solutions are matched so that y is continuous at t 0 ; this is accomplished by a proper choice of 
the arbitrary constants. The following two problems illustrate this situation. Note in each case 
that it is impossible also to make y' continuous at t 0 . 

32. Solve the initial value problem 

y' + 2 y = gif), y(0) = 0, 


where 



0 < t < 1 , 
t > 1 . 


33. Solve the initial value problem 


y' + p{t)y = 0, y(0) = 1, 


Pit) = 


0 < t < 1 , 
t > 1 . 


where 
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2.5 Autonomous Equations and Population Dynamics 


An important class of first order equations consists of those in which the independent 
variable does not appear explicitly. Such equations are called autonomous and have 
the form 

dy/dt = f(y). (1) 

We will discuss these equations in the context of the growth or decline of the popula- 
tion of a given species, an important issue in fields ranging from medicine to ecology 
to global economics. A number of other applications are mentioned in some of the 
problems. Recall that in Sections 1.1 and 1.2 we considered the special case of Eq. (1) 
in which /(y) = ay + b. 

Equation (1) is separable, so the discussion in Section 2.2 is applicable to it, but 
the main purpose of this section is to show how geometrical methods can be used 
to obtain important qualitative information directly from the differential equation, 
without solving the equation. Of fundamental importance in this effort are the 
concepts of stability and instability of solutions of differential equations. These ideas 
were introduced informally in Chapter 1, but without using this terminology. They 
are discussed further here and will be examined in greater depth and in a more 
general setting in Chapter 9. 

Exponential Growth. Let y = (pit) be the population of the given species at time t. The 
simplest hypothesis concerning the variation of population is that the rate of change 
of y is proportional 10 to the current value of y; that is, 

dy/dt = ry, ( 2 ) 

where the constant of proportionality r is called the rate of growth or decline, de- 
pending on whether it is positive or negative. Here, we assume that r > 0, so the 
population is growing. 

Solving Eq. (2) subject to the initial condition 

y(°) = yo , (3) 

we obtain 

y = y^ rt . (4) 

Thus the mathematical model consisting of the initial value problem (2), (3) with 
r > 0 predicts that the population will grow exponentially for all time, as shown 
in Figure 2.5.1 for several values of yo- Under ideal conditions, Eq. (4) has been 
observed to be reasonably accurate for many populations, at least for limited periods 
of time. However, it is clear that such ideal conditions cannot continue indefinitely; 
eventually, limitations on space, food supply, or other resources will reduce the growth 
rate and bring an end to uninhibited exponential growth. 


10 It was apparently the British economist Thomas Malthus (1766-1834) who first observed that many 
biological populations increase at a rate proportional to the population. His first paper on populations 
appeared in 1798. 
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FIGURE 2.5.1 Exponential growth: y versus t for dy/dt — ry. 


Logistic Growth. To take account of the fact that the growth rate actually depends on 
the population, we replace the constant r in Eq. (2) by a function h{y) and thereby 
obtain the modified equation 

dy/dt = h(y)y. (5) 

We now want to choose h(y) so that h(y) = r > 0 when y is small, h{y) decreases as 
y grows larger, and My) < 0 when y is sufficiently large. The simplest function that 
has these properties is h(y) = r — ay, where a is also a positive constant. Using this 
function in Eq. (5), we obtain 


dy/dt = (r — ay)y. (6) 

Equation (6) is known as the Verhulst 11 equation or the logistic equation. It is often 
convenient to write the logistic equation in the equivalent form 



where K = r/a. The constant r is called the intrinsic growth rate, that is, the growth 
rate in the absence of any limiting factors. The interpretation of K will become clear 
shortly. 

We will investigate the solutions of Eq. (7) in some detail later in this section. 
Before doing that, however, we will show how you can easily draw a qualitatively 
correct sketch of the solutions. The same methods also apply to the more general 
Eq. (1). 


U P. F. Verhulst (1804-1849) was a Belgian mathematician who introduced Eq. (6) as a model for human 
population growth in 1838. Fie referred to it as logistic growth; hence Eq. (6) is often called the logistic 
equation. Fie was unable to test the accuracy of his model because of inadequate census data, and it did 
not receive much attention until many years later. Reasonable agreement with experimental data was 
demonstrated by R. Pearl (1930) for Drosophila melanogaster (fruit fly) populations and by G. F. Cause 
(1935) for Paramecium and Tribolium (flour beetle) populations. 


80 


Chapter 2. First Order Differential Equations 


We first seek solutions of Eq. (7) of the simplest possible type, that is, constant 
functions. For such a solution dy/dt = 0 for all t , so any constant solution of Eq. (7) 
must satisfy the algebraic equation 

r(l-y/K)y = 0. 

Thus the constant solutions are y = 0i(f) — 0 and y = <p 2 (t) — K. These solutions 
are called equilibrium solutions of Eq. (7) because they correspond to no change or 
variation in the value of y as t increases. In the same way, any equilibrium solutions 
of the more general Eq. (1) can be found by locating the roots of f(y) = 0. The zeros 
of fly) are also called critical points. 

To visualize other solutions of Eq. (7) and to sketch their graphs quickly, we start 
by drawing the graph of f(y) versus y. In the case of Eq. (7),/(y) — r(l — y/K)y, 
so the graph is the parabola shown in Figure 2.5.2. The intercepts are (0,0) and 
( K , 0), corresponding to the critical points of Eq. (7), and the vertex of the parabola 
is (K/2,rK/4). Observe that dy/dt > 0 for 0 < y < K; therefore, y is an increasing 
function of t when y is in this interval; this is indicated by the rightward-pointing 
arrows near the y-axis in Figure 2.5.2. Similarly, ify > K. then dy/dt < 0; hence y is 
decreasing, as indicated by the leftward-pointing arrow in Figure 2.5.2. 



In this context the y-axis is often called the phase line, and it is reproduced in its 
more customary vertical orientation in Figure 2.53a. The dots at y = 0 and y = K 
are the critical points, or equilibrium solutions. The arrows again indicate that y is 
increasing whenever 0 < y < K and that y is decreasing whenever y > K. 

Further, from Figure 2.5.2, note that if y is near zero or K , then the slope f(y) is 
near zero, so the solution curves are relatively flat. They become steeper as the value 
of y leaves the neighborhood of zero or K. 

To sketch the graphs of solutions of Eq. (7) in the ty-plane, we start with the equi- 
librium solutions y = 0 and y = K\ then we draw other curves that are increasing 
when 0 < y < K. decreasing when y > K. and flatten out as y approaches either of 
the values 0 or K. Thus the graphs of solutions of Eq. (7) must have the general 
shape shown in Figure 2.53b. regardless of the values of r and K. 
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(a) 

FIGURE 2.5.3 

versus t. 



Logistic growth: dy/dt = r(l — y/K)y. (a) The phase line, (b) Plots of y 


Figure 2.5.3 b may seem to show that other solutions intersect the equilibrium 
solution y = K , but is this really possible? No, the uniqueness part of Theorem 2.4.2, 
the fundamental existence and uniqueness theorem, states that only one solution can 
pass through a given point in the fy -plane. Thus, although other solutions may be 
asymptotic to the equilibrium solution as t -» oo, they cannot intersect it at any finite 
time. 

To carry the investigation one step further, we can determine the concavity of the 
solution curves and the location of inflection points by finding dry /dt 2 . From the 
differential equation (1) we obtain (using the chain rule) 


d 2 y d dy 
dt 2 dt dt 


j r f(y) =f ( y)-j t = /'Cy)/(y)- 


( 8 ) 


The graph of y versus t is concave up when y" > 0, that is, when / and f have the 
same sign. Similarly, it is concave down when y" < 0, which occurs when / and f 
have opposite signs. The signs of / and /' can be easily identified from the graph of 
f(y) versus y. Inflection points may occur when f'(y) = 0. 

In the case of Eq. (7), solutions are concave up for 0 < y < K /2 where / is positive 
and increasing (see Figure 2.5.2), so that both / and/' are positive. Solutions are 
also concave up for y > K where / is negative and decreasing (both / and /' are 
negative). For K/2 < y < K, solutions are concave down since here / is positive and 
decreasing, so / is positive but f is negative. There is an inflection point whenever 
the graph of y versus t crosses the line y — K/2. The graphs in Figure 2.5.3 b exhibit 
these properties. 

Finally, observe that K is the upper bound that is approached, but not exceeded, 
by growing populations starting below this value. Thus it is natural to refer to K as 
the saturation level, or the environmental carrying capacity, for the given species. 

A comparison of Figures 2.5.1 and 2.5.3 b reveals that solutions of the nonlinear 
equation (7) are strikingly different from those of the linear equation (1), at least 
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for large values of t. Regardless of the value of K , that is, no matter how small the 
nonlinear term in Eq. (7), solutions of that equation approach a finite value as t — > oo, 
whereas solutions of Eq. (1) grow (exponentially) without bound as t — »• oo. Thus 
even a tiny nonlinear term in the differential equation (7) has a decisive effect on 
the solution for large t. 

In many situations it is sufficient to have the qualitative information about a solu- 
tion y — (p (t) of Eq. (7) that is shown in Figure 2.5.3 b. This information was obtained 
entirely from the graph of f(y) versus y, and without solving the differential equation 
(7). Flowever, if we wish to have a more detailed description of logistic growth — for 
example, if we wish to know the value of the population at some particular time — 
then we must solve Eq. (7) subject to the initial condition (3). Provided that y / 0 
and y ^ K, we can write Eq. (7) in the form 


dy 

(1 ~y/K)y 


= rdt. 


Using a partial fraction expansion on the left side, we have 


1 

y 


VK \ 
i- y/K ) 


dy = r dt. 


Then, by integrating both sides, we obtain 


In |y| — In 1 


y_ 

K 


rt + c. 


( 9 ) 


where c is an arbitrary constant of integration to be determined from the initial 
condition y(0) = y 0 . We have already noted that if 0 < yo < K, then y remains in 
this interval for all time. Thus in this case we can remove the absolute value bars in 
Eq. (9), and by taking the exponential of both sides, we find that 


y 

1 - (y/K) 


= Ce rt , 


( 10 ) 


where C = e c . In order to satisfy the initial condition y(0) = y 0 , we must choose 
C = yo/[l — (yo/K)]- Using this value for C in Eq. (10) and solving for y, we obtain 


y 0 K 

y 0 + (K- y 0 )e~ rl ' 


( 11 ) 


We have derived the solution (11) under the assumption that 0 < y 0 < K. If 
yo > K, then the details of dealing with Eq. (9) are only slightly different, and we 
leave it to you to show that Eq. (11) is also valid in this case. Finally, note that 
Eq. (11) also contains the equilibrium solutions y = (pi (f) = 0 and y = 02(f) = K 
corresponding to the initial conditions yo = 0 and yo = K , respectively. 

All the qualitative conclusions that we reached earlier by geometrical reason- 
ing can be confirmed by examining the solution (11). In particular, if yo = 0, then 
Eq. (11) requires that y(t) = 0 for all t. If yo > 0, and if we let f -> oo in Eq. (11), 
then we obtain 

lim y(t) = y 0 K/y 0 = K. 


Thus, for each yo > 0, the solution approaches the equilibrium solution y = 02(f) = K 
asymptotically as f — > oo. Therefore we say that the constant solution 02(f) = K is 
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an asymptotically stable solution of Eq. (7) or that the point y = K is an asymp- 
totically stable equilibrium or critical point. After a long time, the population is 
close to the saturation level K regardless of the initial population size, as long as 
it is positive. Other solutions approach the equilibrium solution more rapidly as r 
increases. 

On the other hand, the situation for the equilibrium solution y = <j>\ (t) = 0 is quite 
different. Even solutions that start very near zero grow as t increases and, as we 
have seen, approach K as t — > oo. We say that (j>\ (t) = 0 is an unstable equilibrium 
solution or that y = 0 is an unstable equilibrium or critical point. This means that the 
only way to guarantee that the solution remains near zero is to make sure its initial 
value is exactly equal to zero. 


EXAMPLE 

1 


The logistic model has been applied to the natural growth of the halibut population in certain 
areas of the Pacific Ocean. 12 Let y , measured in kilograms, be the total mass, or biomass, of 
the halibut population at time t. The parameters in the logistic equation are estimated to have 
the values r = 0.71/year and K = 80.5 x 10 6 kg. If the initial biomass is yo = 0.25 K, find the 
biomass 2 years later. Also find the time r for which y(r) = 0.75 K. 

It is convenient to scale the solution (11) to the carrying capacity K\ thus we write Eq. (11) 
in the form 


y_ = yo/ K 

K (y 0 /K) + [1 - (y 0 /K)]e- n ' 


( 12 ) 


Using the data given in the problem, we find that 


y( 2) 
K 


0.25 

0.25 + 0.75e -L42 


0.5797. 


Consequently, y(2) = 46.7 x 10 6 kg. 

To find r, we can first solve Eq. (12) for t. We obtain 

= (yo/K)[l - (y/K)] 
(y/K)[ 1 - (yo/K)] ’ 

hence 

f ^ (yo/K)\\ - (y/K)] 
r n (y/K)[\ - (y 0 /K) ] ' 


(13) 


Using the given values of r and yo/K and setting y/K = 0.75, we find that 


r 


1 (0.25)(0.25) 

0/71 ln (0.75)(0.75) 


1 

0/71 


ln9 = 3.095 years. 


The graphs of y/K versus t for the given parameter values and for several initial conditions 
are shown in Figure 2.5.4. 


12 A good source of information on the population dynamics and economics involved in making efficient 
use of a renewable resource, with particular emphasis on fisheries, is the book by Clark listed in the 
references at the end of this chapter. The parameter values used here are given on page 53 of this book 
and were obtained from a study by H. S. Mohring. 
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FIGURE 2.5.4 y/K versus t for population model of halibut in the Pacific Ocean. 



A Critical Threshold. We now turn to a consideration of the equation 



where r and T are given positive constants. Observe that (except for replacing the 
parameter K by T) this equation differs from the logistic equation (7) only in the 
presence of the minus sign on the right side. However, as we will see, the solutions 
of Eq. (14) behave very differently from those of Eq. (7). 

For Eq. (14) the graph of f(y) versus y is the parabola shown in Figure 2.5.5. 
The intercepts on the y-axis are the critical points y = 0 and y — T, corresponding 
to the equilibrium solutions 0i(t) = 0 and 02 (0 = T. If 0 < y < T, then dy/dt < 0, 
and y decreases as t increases. On the other hand, if y > T, then dy/dt > 0, and y 
grows as t increases. Thus (j>\ ( t ) = 0 is an asymptotically stable equilibrium solution 
and 02(0 = T is an unstable one. Further, f'(y) is negative for 0 < y < T / 2 and 
positive for T /2 < y < T , so the graph of y versus t is concave up and concave down, 
respectively, in these intervals. Also, /'(y) is positive for y > T, so the graph of y 
versus t is also concave up there. 
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FIGURE 2.5.5 f(y) versus y for dy/dt = — r(l — y/T)y. 
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Figure 2.5.6a shows the phase line (the y-axis) for Eq. (14). The dots at y = 0 and 
y — T are the critical points, or equilibrium solutions, and the arrows indicate where 
solutions are either increasing or decreasing. 

Solution curves of Eq. (14) can now be sketched quickly, as follows. First draw 
the equilibrium solutions y = 0 and y = T. Then sketch curves in the strip 0 < y < T 
that are decreasing as t increases and change concavity as they cross the line y — T / 2. 
Next draw some curves above y = T that increase more and more steeply as t and 
y increase. Make sure that all curves become flatter as y approaches either zero or 
T. The result is Figure 2.5.6 b, which is a qualitatively accurate sketch of solutions of 
Eq. (14) for any r and T. From this figure it appears that as time increases, y either 
approaches zero or grows without bound, depending on whether the initial value yo 
is less than or greater than T . Thus T is a threshold level, below which growth does 
not occur. 



FIGURE 2.5.6 Growth with a threshold: dy/dt — — r( 1 — y/T)y. (a) The phase line. 
(b) Plots of y versus t. 


We can confirm the conclusions that we have reached through geometrical rea- 
soning by solving the differential equation (14). This can be done by separating the 
variables and integrating, just as we did for Eq. (7). However, if we note that Eq. (14) 
can be obtained from Eq. (7) by replacing K by T and r by —r, then we can make 
the same substitutions in the solution (11) and thereby obtain 

y 0 T , 1C , 

- v = T7T ’ ( 15 ) 

y o + (T -y o)e rt 

which is the solution of Eq. (14) subject to the initial condition y(0) = yo. 

If 0 < y 0 < T, then it follows from Eq. (15) that y — »■ 0 as / — > oo. This agrees with 
our qualitative geometric analysis. If yo > T, then the denominator on the right side 
of Eq. (15) is zero for a certain finite value of t. We denote this value by t* and 
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calculate it from 


yo - (yo - T)e rt ’ = 0, 


which gives 



(16) 


Thus, if the initial population y 0 is above the threshold T, the threshold model predicts 
that the graph of y versus t has a vertical asymptote at t = t*; in other words, the 
population becomes unbounded in a finite time, whose value depends on yo, T, and r. 
The existence and location of this asymptote were not apparent from the geometric 
analysis, so in this case the explicit solution yields additional important qualitative, 
as well as quantitative, information. 

The populations of some species exhibit the threshold phenomenon. If too few 
are present, then the species cannot propagate itself successfully and the population 
becomes extinct. However, if the population is larger than the threshold level, then 
further growth occurs. Of course, the population cannot become unbounded, so 
eventually Eq. (14) must be modified to take this into account. 

Critical thresholds also occur in other circumstances. For example, in fluid mechan- 
ics, equations of the form (7) or (14) often govern the evolution of a small disturbance 
y in a laminar (or smooth) fluid flow. For instance, if Eq. (14) holds and y < T, then 
the disturbance is damped out and the laminar flow persists. However, if y > T , then 
the disturbance grows larger and the laminar flow breaks up into a turbulent one. In 
this case T is referred to as the critical amplitude. Experimenters speak of keeping 
the disturbance level in a wind tunnel sufficiently low so that they can study laminar 
flow over an airfoil, for example. 

Logistic Growth with a Threshold. As we mentioned in the last subsection, the threshold 
model (14) may need to be modified so that unbounded growth does not occur when 
y is above the threshold T. The simplest way to do this is to introduce another factor 
that will have the effect of making dy/dt negative when y is large. Thus we consider 



(17) 


where r > 0 and 0 < T < K. 

The graph of f(y) versus y is shown in Figure 2.5.7. In this problem there are three 
critical points, y = 0, y = T, and y = K, corresponding to the equilibrium solutions 
tp] (t) = 0, <p 2 (t) = T, and 03(f) — K, respectively. From Figure 2.5.7 we observe that 
dy/dt > 0 for T < y < K, and consequently y is increasing there. Further, dy/dt < 0 
for y < T and for y > K, so y is decreasing in these intervals. Consequently, the 
equilibrium solutions </>j (f) and /pit) are asymptotically stable, and the solution 0 2 (f) 
is unstable. 

The phase line for Eq. (17) is shown in Figure 2.5.8 a, and the graphs of some 
solutions are sketched in Figure 2.5.8 b. You should make sure that you understand 
the relation between these two figures, as well as the relation between Figures 2.5.7 
and 2.5.8 a. From Figure 2.5.8 b we see that if y starts below the threshold T, then 
y declines to ultimate extinction. On the other hand, if y starts above T, then y 
eventually approaches the carrying capacity K. The inflection points on the graphs 
of y versus t in Figure 2.5.8 b correspond to the maximum and minimum points, y\ 
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FIGURE 2.5.7 f(y) versus y for dy/dt = — r(l — y/T)(l — y/K)y. 


K' 



(a) 

FIGURE 2.5.8 Logistic growth with a threshold: dy/dt = —r( 1 — y/T)( 1 — y/K)y. (a) The 
phase line. ( b ) Plots of y versus t. 


and y 2 , respectively, on the graph of f(y) versus y in Figure 2.5.7. These values can 
be obtained by differentiating the right side of Eq. (17) with respect to y, setting the 
result equal to zero, and solving for y. We obtain 

y lj2 = (K+T± s/k 2 -KT+ T 2 )/ 3, (18) 

where the plus sign yields y\ and the minus sign y 2 . 

A model of this general sort apparently describes the population of the passenger 
pigeon, 13 which was present in the United States in vast numbers until late in the 
nineteenth century. It was heavily hunted for food and for sport, and consequently 
its numbers were drastically reduced by the 1880s. Unfortunately, the passenger pi- 
geon could apparently breed successfully only when present in a large concentration, 
corresponding to a relatively high threshold T. Although a reasonably large number 
of individual birds remained alive in the late 1880s, there were not enough in any one 


13 See, for example, Oliver L. Austin, Jr., Birds of the World (New York: Golden Press, 1983), pp. 143-145. 
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place to permit successful breeding, and the population rapidly declined to extinc- 
tion. The last survivor died in 1914. The precipitous decline in the passenger pigeon 
population from huge numbers to extinction in a few decades was one of the early 
factors contributing to a concern for conservation in this country. 


PROBLEMS Problems 1 through 6 involve equations of the form dy/dt =f(y). In each problem sketch 

the graph of f(y) versus y, determine the critical (equilibrium) points, and classify each one as 
asymptotically stable or unstable. Draw the phase line, and sketch several graphs of solutions 
in the iy-plane. 

1. dy/dt = ay + by * 1 2 3 4 5 6 7 , a > 0, b > 0, yo > 0 

2. dy/dt = ay + by 2 , a > 0, b > 0, — oo < yo < oo 

3. dy/dt = y(y - l)(y - 2), yo > 0 

4. dy/dt = e y — 1, — oo < yo < oo 

5. dy/dt = e~ y — 1, — oo < yo < oo 

6. dy/dt = — 2(arctany)/(l +y 2 ), — oo < y 0 < oo 

7. Semistable Equilibrium Solutions. Sometimes a constant equilibrium solution has the 
property that solutions lying on one side of the equilibrium solution tend to approach it, 
whereas solutions lying on the other side depart from it (see Figure 2.5.9). In this case the 
equilibrium solution is said to be semistable. 




FIGURE 2.5.9 In both cases the equilibrium solution <p(t) = k is semistable. 
(a) dy/dt < 0; ( b ) dy/dt > 0. 


(a) Consider the equation 

dy/dt = k{ 1 — y) 2 , (i) 

where A: is a positive constant. Show that y = 1 is the only critical point, with the corre- 
sponding equilibrium solution tp(t) — 1. 

(b) Sketch /(y) versus y. Show that y is increasing as a function of t for y < 1 and also 
for y > 1. The phase line has upward-pointing arrows both below and above y = 1. Thus 
solutions below the equilibrium solution approach it, and those above it grow farther away. 
Therefore <p(t) = 1 is semistable. 

(c) Solve Eq. (i) subject to the initial condition y(0) = yo and confirm the conclusions 
reached in part (b). 

Problems 8 through 13 involve equations of the form dy/dt = /(y). In each problem sketch 
the graph of f(y) versus y, determine the critical (equilibrium) points, and classify each one 
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as asymptotically stable, unstable, or semistable (see Problem 7). Draw the phase line, and 
sketch several graphs of solutions in the ty- plane. 


8. dy/dt = — k(y — l) 2 , 

9. dy/dt = y 2 (y 2 — 1), 

10. dy/dt = y(l — y 2 ), 

11. dy/dt = ay — bjy , 

12. dy/dt = y 2 (4 — y 2 ), 

13. dy/dt = y 2 (l — y) 2 , 


k > 0, — oo < yo < oo 

— oo < yo < oo 
— oo < yo < cxi 

a > 0, b > 0, yo > 0 

— cx < yo < cx 

— cx < yo < cx 


14. Consider the equation dy/dt = /(y) and suppose that yi is a critical point, that is,/(yi) = 0. 
Show that the constant equilibrium solution (/> (f) = yj is asymptotically stable if/'(yi) < 0 
and unstable if f'{y\) > 0. 

15. Suppose that a certain population obeys the logistic equation dy/dt = ry[l — (y/K)]. 

(a) If yo = K/3, find the time r at which the initial population has doubled. Find the value 
of r corresponding to r = 0.025 per year. 

(b) liy 0 /K = a. find the time T at which y(T)/K = p , where 0 < a, p < 1. Observe that 
T -*■ cx as a — > 0 or as p — >■ 1. Find the value of T for r = 0.025 per year, a = 0.1, and 
P = 0.9. 

16. Another equation that has been used to model population growth is the Gompertz 14 
equation 

dy/dt — ry\n(K/y), 


where r and K are positive constants. 

(a) Sketch the graph of f(y) versus y, find the critical points, and determine whether each 
is asymptotically stable or unstable. 

(b) For 0 < y < K , determine where the graph of y versus t is concave up and where it is 
concave down. 

(c) For each y in 0 < y < K, show that dy/dt as given by the Gompertz equation is never 
less than dy/dt as given by the logistic equation. 

17. (a) Solve the Gompertz equation 


dy/dt = ry\n(K/y), 


subject to the initial condition y(0) = y 0 . 

Hint: You may wish to let u = ln(y/70. 

(b) For the data given in Example 1 in the text (r = 0.71 per year, K = 80.5 x 10 6 kg, 
y 0 /K = 0.25), use the Gompertz model to find the predicted value of y (2). 

(c) For the same data as in part (b), use the Gompertz model to find the time t at which 
y(r) = 0.75 K. 

18. A pond forms as water collects in a conical depression of radius a and depth h. Suppose 
that water flows in at a constant rate k and is lost through evaporation at a rate proportional 
to the surface area. 

(a) Show that the volume V (t) of water in the pond at time t satisfies the differential 
equation 

dV/dt = k — onr(3a/7th) 2/3 V 2 t 3 , 


where a is the coefficient of evaporation. 


14 Benjamin Gompertz (1779-1865) was an English actuary. He developed his model for population 
growth, published in 1825, in the course of constructing mortality tables for his insurance company. 
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(b) Find the equilibrium depth of water in the pond. Is the equilibrium asymptotically 
stable? 

(c) Find a condition that must be satisfied if the pond is not to overflow. 

19. Consider a cylindrical water tank of constant cross section A. Water is pumped into the 
tank at a constant rate k and leaks out through a small hole of area a in the bottom of the 
tank. From Torricelli’s principle in hydrodynamics (see Problem 6 in Section 2.3) it follows 
that the rate at which water flows through the hole is aa^/lgh, where h is the current depth 
of water in the tank, g is the acceleration due to gravity, and a is a contraction coefficient 
that satisfies 0.5 < a < 1.0. 

(a) Show that the depth of water in the tank at any time satisfies the equation 

dh/dt = (k — aa^/2gh )/A. 

(b) Determine the equilibrium depth h e of water, and show that it is asymptotically stable. 
Observe that h e does not depend on A. 

Harvesting a Renewable Resource. Suppose that the population y of a certain species of fish 
(for example, tuna or halibut) in a given area of the ocean is described by the logistic equation 

dy/dt = r( 1 — y/K)y. 

Although it is desirable to utilize this source of food, it is intuitively clear that if too many 
fish are caught, then the fish population may be reduced below a useful level and possibly 
even driven to extinction. Problems 20 and 21 explore some of the questions involved in 
formulating a rational strategy for managing the fishery. 15 

20. At a given level of effort, it is reasonable to assume that the rate at which fish are caught 
depends on the population y : the more fish there are, the easier it is to catch them. Thus we 
assume that the rate at which fish are caught is given by Ey, where £ is a positive constant, 
with units of 1/time, that measures the total effort made to harvest the given species of 
fish. To include this effect, the logistic equation is replaced by 

dy/dt = r( 1 - y/K)y - Ey. (i) 

This equation is known as the Schaefer model after the biologist M. B. Schaefer, who 
applied it to fish populations. 

(a) Show that if E < r, then there are two equilibrium points, yi = 0 and 
yi = ATI - E/r) >0. 

(b) Show that y = y\ is unstable and y = yi is asymptotically stable. 

(c) A sustainable yield Y of the fishery is a rate at which fish can be caught indefinitely. 
It is the product of the effort E and the asymptotically stable population y 2 . Find Y as a 
function of the effort E; the graph of this function is known as the yield-effort curve. 

(d) Determine E so as to maximize Y and thereby find the maximum sustainable yield Y m . 

21. In this problem we assume that fish are caught at a constant rate h independent of the size 
of the fish population. Then y satisfies 

dy/dt = /'(l — y/K)y — h. (i) 


1:> An excellent treatment of this kind of problem, which goes far beyond what is outlined here, may be 
found in the book by Clark mentioned previously, especially in the first two chapters. Numerous additional 
references are given there. 
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The assumption of a constant catch rate h may be reasonable when y is large but becomes 
less so when y is small. 

(a) If h < rK/4, show that Eq. (i) has two equilibrium points y\ and _y 2 with yi < y 2 ; 
determine these points. 

(b) Show that y\ is unstable and y 2 is asymptotically stable. 

(c) From a plot of f(y) versus y, show that if the initial population y 0 > yi, then y — > y 2 
as t — > oo, but that if yo < y 1 , then y decreases as t increases. Note that y = 0 is not an 
equilibrium point, so if y 0 < yi, then extinction will be reached in a finite time. 

(d) If h > rK/4 , show that y decreases to zero as t increases regardless of the value of y 0 . 

(e) If h = rK/ 4, show that there is a single equilibrium point y — K/2 and that this point 
is semistable (see Problem 7). Thus the maximum sustainable yield is h„, = rK/4, corre- 
sponding to the equilibrium value y = K/2. Observe that h m has the same value as Y m 
in Problem 20(d). The fishery is considered to be overexploited if y is reduced to a level 
below K/2. 

Epidemics. The use of mathematical methods to study the spread of contagious diseases goes 
back at least to some work by Daniel Bernoulli in 1760 on smallpox. In more recent years 
many mathematical models have been proposed and studied for many different diseases. 16 
Problems 22 through 24 deal with a few of the simpler models and the conclusions that can be 
drawn from them. Similar models have also been used to describe the spread of rumors and 
of consumer products. 

22. Suppose that a given population can be divided into two parts: those who have a given 
disease and can infect others, and those who do not have it but are susceptible. Let x be the 
proportion of susceptible individuals and y the proportion of infectious individuals; then 
x + y = 1. Assume that the disease spreads by contact between sick and well members 
of the population and that the rate of spread dy/dt is proportional to the number of such 
contacts. Further, assume that members of both groups move about freely among each 
other, so the number of contacts is proportional to the product of x and y . Since x — 1 — y, 
we obtain the initial value problem 


(i) 


dy/dt = ay{ 1 - y), y(0) = y 0 . 


where a is a positive proportionality factor, and y 0 is the initial proportion of infectious 
individuals. 

(a) Find the equilibrium points for the differential equation (i) and determine whether 
each is asymptotically stable, semistable, or unstable. 

(b) Solve the initial value problem (i) and verify that the conclusions you reached in 
part (a) are correct. Show that y{t) — > 1 as t — > oo, which means that ultimately the dis- 
ease spreads through the entire population. 

23. Some diseases (such as typhoid fever) are spread largely by carriers, individuals who can 
transmit the disease but who exhibit no overt symptoms. Let x and y, respectively, denote 
the proportion of susceptibles and carriers in the population. Suppose that carriers are 
identified and removed from the population at a rate f), so 


dy/dt = — fiy. 


(0 


16 A standard source is the book by Bailey listed in the references. The models in Problems 22 through 24 
are discussed by Bailey in Chapters 5, 10, and 20, respectively. 
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Suppose also that the disease spreads at a rate proportional to the product of x and y; thus 

dx/dt = — axy. (ii) 

(a) Determine y at any time t by solving Eq. (i) subject to the initial condition y(0) = y Q . 

(b) Use the result of part (a) to find x at any time t by solving Eq. (ii) subject to the initial 
condition x(0) = x 0 . 

(c) Find the proportion of the population that escapes the epidemic by finding the limiting 
value of x as t — > oo. 

24. Daniel Bernoulli’s work in 1760 had the goal of appraising the effectiveness of a contro- 
versial inoculation program against smallpox, which at that time was a major threat to 
public health. Elis model applies equally well to any other disease that, once contracted 
and survived, confers a lifetime immunity. 

Consider the cohort of individuals born in a given year (/ = 0), and let n(t) be the 
number of these individuals surviving t years later. Let x(t) be the number of members of 
this cohort who have not had smallpox by year t and who are therefore still susceptible. 
Let fi be the rate at which susceptibles contract smallpox, and let v be the rate at which 
people who contract smallpox die from the disease. Finally, let fi(t) be the death rate from 
all causes other than smallpox. Then dx/dt , the rate at which the number of susceptibles 
declines, is given by 

dx/dt — —[/3 + )i{ty\x. (i) 

The first term on the right side of Eq. (i) is the rate at which susceptibles contract smallpox, 
and the second term is the rate at which they die from all other causes. Also 

dn/dt = —vfix — fj.(t)n, (ii) 

where dn/dt is the death rate of the entire cohort, and the two terms on the right side are 
the death rates due to smallpox and to all other causes, respectively. 

(a) Let z = x/n and show that z satisfies the initial value problem 

dz/dt = -fiz(l - vz), z( 0) = 1. (iii) 

Observe that the initial value problem (iii) does not depend on fi(t). 

(b) Find z(t) by solving Eq. (iii). 

(c) Bernoulli estimated that v = /3 = 1. Using these values, determine the proportion of 
20-year-olds who have not had smallpox. 

Note: On the basis of the model just described and the best mortality data available at 
the time, Bernoulli calculated that if deaths due to smallpox could be eliminated (v = 0), 
then approximately 3 years could be added to the average life expectancy (in 1760) of 26 
years, 7 months. He therefore supported the inoculation program. 

Bifurcation Points. For an equation of the form 

dy/dt = f (a, y), (i) 

where a is a real parameter, the critical points (equilibrium solutions) usually depend on the 
value of a. As a steadily increases or decreases, it often happens that at a certain value of a, 
called a bifurcation point, critical points come together, or separate, and equilibrium solutions 
may either be lost or gained. Bifurcation points are of great interest in many applications, 
because near them the nature of the solution of the underlying differential equation is under- 
going an abrupt change. For example, in fluid mechanics a smooth (laminar) flow may break 
up and become turbulent. Or an axially loaded column may suddenly buckle and exhibit a 
large lateral displacement. Or, as the amount of one of the chemicals in a certain mixture is in- 
creased, spiral wave patterns of varying color may suddenly emerge in an originally quiescent 
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fluid. Problems 25 through 27 describe three types of bifurcations that can occur in simple 
equations of the form (i). 

25. Consider the equation 

dy/dt = a — y 2 . (ii) 

(a) Find all of the critical points for Eq. (ii). Observe that there are no critical points if 
a < 0, one critical point if a = 0, and two critical points if a > 0. 

(b) Draw the phase line in each case and determine whether each critical point is asymp- 
totically stable, semistable, or unstable. 

(c) In each case sketch several solutions of Eq. (ii) in the ty- plane. 

(d) If we plot the location of the critical points as a function of a in the ay-plane, we obtain 
Figure 2.5.10. This is called the bifurcation diagram for Eq. (ii). The bifurcation at a = 0 
is called a saddle-node bifurcation. This name is more natural in the context of second 
order systems, which are discussed in Chapter 9. 
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FIGURE 2.5.10 

Bifurcation diagram for y' = a — y 2 . 


26. Consider the equation 

dy/dt = ay — y 3 = y(a — y 2 ). (iii) 

(a) Again consider the cases a < 0, a = 0, and a > 0. In each case find the critical points, 
draw the phase line, and determine whether each critical point is asymptotically stable, 
semistable, or unstable. 

(b) In each case sketch several solutions of Eq. (iii) in the fy-plane. 

(c) Draw the bifurcation diagram for Eq. ( iii) , that is, plot the location of the critical points 
versus a. For Eq. (iii) the bifurcation point at a = 0 is called a pitchfork bifurcation; your 
diagram may suggest why this name is appropriate. 

27. Consider the equation 

dy/dt = ay - y 2 = y(a - y). (iv) 

(a) Again consider the cases a < 0, a = 0, and a > 0. In each case find the critical points, 
draw the phase line, and determine whether each critical point is asymptotically stable, 
semistable, or unstable. 

(b) In each case sketch several solutions of Eq. (iv) in the ty- plane. 
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(c) Draw the bifurcation diagram for Eq. (iv). Observe that for Eq. (iv) there are the 
same number of critical points for a < 0 and a > 0 but that their stability has changed. 
For a < 0 the equilibrium solution y = 0 is asymptotically stable and y = a is unstable, 
while for a > 0 the situation is reversed. Thus there has been an exchange of stability as a 
passes through the bifurcation point a = 0. This type of bifurcation is called a transcritical 
bifurcation. 

28. Chemical Reactions. A second order chemical reaction involves the interaction (colli- 
sion) of one molecule of a substance P with one molecule of a substance Q to produce 
one molecule of a new substance X; this is denoted by P + Q — > X. Suppose that p and 
q , where p / q, are the initial concentrations of P and Q, respectively, and let x(t) be the 
concentration of X at time t. Then p — x(t) and q — x(t) are the concentrations of P and 
Q at time t, and the rate at which the reaction occurs is given by the equation 

dx/dt = a(p — x)(q — x), (i) 


where a is a positive constant. 

(a) If x(0) = 0, determine the limiting value of x(t) as t — »■ oo without solving the differ- 
ential equation. Then solve the initial value problem and find x(t) for any t. 

(b) If the substances P and Q are the same, then p = q and Eq. (i) is replaced by 

dx/dt = a(p — x) 2 . (ii) 

If x(0) = 0, determine the limiting value of x(t) as r — >• oo without solving the differential 
equation. Then solve the initial value problem and determine x(t) for any t. 


2.6 Exact Equations and Integrating Factors 


For first order equations there are a number of integration methods that are applica- 
ble to various classes of problems. The most important of these are linear equations 
and separable equations, which we have discussed previously. Flere, we consider a 
class of equations known as exact equations for which there is also a well-defined 
method of solution. Keep in mind, however, that those first order equations that 
can be solved by elementary integration methods are rather special; most first order 
equations cannot be solved in this way. 


EXAMPLE 

1 


Solve the differential equation 

2 x + y 2 + 2 xyy' = 0. 


( 1 ) 


The equation is neither linear nor separable, so the methods suitable for 
equations are not applicable here. However, observe that the function ir{x,y) 
the property that 


2x + y 2 


3i jf 
~dx ’ 


2 xy = 


dir 

3 y ' 


those types of 
= x 2 + xy 2 has 


( 2 ) 


Therefore the differential equation can be written as 

d\/f dir dy 
dx dy dx 


( 3 ) 
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Assuming that y is a function of x and calling upon the chain rule, we can write Eq. (3) in the 
equivalent form 


dl/r 

dx 


^-(x 2 +xy 2 ) = 0. 
ax 


(4) 


Therefore 


\jf(x,y) — x 2 + xy 2 = c, 


(5) 


where c is an arbitrary constant, is an equation that defines solutions of Eq. (1) implicitly. 


In solving Eq. (1) the key step was the recognition that there is a function t/t that 
satisfies Eqs. (2). More generally, let the differential equation 

M(x,y) + N(x,y)y' = 0 (6) 

be given. Suppose that we can identify a function ijr such that 

= M(x,y), ~J~(x,y) = N(x,y), (7) 

dx 3 y 

and such that ^r{x,y) — c defines y — </; (x ) implicitly as a differentiable function 
of x. Then 

3i lr dxlr dy d 

M(x,y) + N(x,y)y = + — — = -j-f[x,<j)(.x)] 

dx dy dx dx 

and the differential equation (6) becomes 

^ir[x,<p(x)] = 0 . ( 8 ) 

In this case Eq. (6) is said to be an exact differential equation. Solutions of Eq. (6), 
or the equivalent Eq. (8), are given implicitly by 

f(x,y)=c, (9) 

where c is an arbitrary constant. 

In Example 1 it was relatively easy to see that the differential equation was exact 
and, in fact, easy to find its solution, by recognizing the required function \jj. For 
more complicated equations it may not be possible to do this so easily. A systematic 
way of determining whether a given differential equation is exact is provided by the 
following theorem. 


Let the functions M , N, M y , and N x , where subscripts denote partial derivatives, be 
continuous in the rectangular 17 region R:a<x<p,y<y<8. Then Eq. (6) 

M(x,y)+N(x,y)y' = 0 


17 It is not essential that the region be rectangular, only that it be simply connected. In two dimensions 
this means that the region has no holes in its interior. Thus, for example, rectangular or circular regions 
are simply connected, but an annular region is not. More details can be found in most books on advanced 
calculus. 
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is an exact differential equation in R if and only if 

M y (x,y) = N x (x,y) (10) 

at each point of R. That is, there exists a function i// satisfying Eqs. (7), 
fx(x,y) = M(x,y), ^ y (x,y ) = N(x,y ), 

if and only if M and N satisfy Eq. (10). 


The proof of this theorem has two parts. First, we show that if there is a function 
i/f such that Eqs. (7) are true, then it follows that Eq. (10) is satisfied. Computing M y 
and N x from Eqs. (7), we obtain 

M y {x,y) = \[r xy (x, y) , N x (x,y) = ij/ yx (x,y). (11) 

Since M v and N x are continuous, it follows that i fr xy and \j/ yx are also continuous. This 
guarantees their equality, and Eq. (10) follows. 

We now show that if M and N satisfy Eq. (10), then Eq. (6) is exact. The proof 
involves the construction of a function ijr satisfying Eqs. (7) 


fx{x,y) = M{x,y), Vq( x,y) = N(x,y). 

We begin by integrating the first of Eqs. (7) with respect to x, holding y constant. We 
obtain 

f{x,y) = Q(x,y) + h(y), (12) 

where Q(x,y ) is any differentiable function such that dQ(x,y)/dx = M(x,y). For 
example, we might choose 

Q(x,y)=f M(s,y)ds, (13) 

where xo is some specified constant in a < xq < The function h in Eq. (12) is an 
arbitrary differentiable function of y, playing the role of the arbitrary constant. Now 
we must show that it is always possible to choose h(y) so that the second of Eqs. (7) 
is satisfied, that is, i/q = N. By differentiating Eq. (12) with respect to y and setting 
the result equal to N(x,y), we obtain 

ty y (x,y) = yf^(x,y) + h'(y ) = N(x,y). 

3 y 

Then, solving for h'(y), we have 

h'(y) = N(x,y) - ^p-(x,y). (14) 


In order for us to determine h(y) from Eq. (14), the right side of Eq. (14), despite 
its appearance, must be a function of y only. To establish that this is true, we can 
differentiate the quantity in question with respect to x, obtaining 


3 N 

dx 


(x,y) 


d_dQ 
dx 3 y 


(X,y). 


(15) 


By interchanging the order of differentiation in the second term of Eq. (15), we have 


3 N 
dx 


(x,y) 


_d_d Q 
3 y dx 


( x,y ), 


2.6 Exact Equations and Integrating Factors 


97 



or, since dO/dx = M, 

dN dM 

-r-Ory) - - t — (x,y), 

dx dy 

which is zero on account of Eq. (10). Hence, despite its apparent form, the right 
side of Eq. (14) does not, in fact, depend on x. Then we find h(y) by integrating 
Eq. (14), and upon substituting this function in Eq. (12), we obtain the required 
function i jr(x,y). This completes the proof of Theorem 2.6.1. 

It is possible to obtain an explicit expression for t/r (x, y) in terms of integrals (see 
Problem 17) , but in solving specific exact equations, it is usually simpler and easier j ust 
to repeat the procedure used in the preceding proof. That is, integrate = M with 
respect to x , including an arbitrary function of h(y) instead of an arbitrary constant, 
and then differentiate the result with respect to y and set it equal to N. Finally, use 
this last equation to solve for h(y). The next example illustrates this procedure. 

EXAMPLE 

2 

Solve the differential equation 

(y cosx + 2xe y ) + (sinx + x 2 e y — 1 )y' = 0. (16) 

By calculating M v and N x , we find that 

M y (x,y) — cosx + 2xe y = N x (x.y), 

so the given equation is exact. Thus there is a i/s(x,y) such that 

i jr x (x,y) = ycosx + 2xe y , 
i/r v (x,y) = sinx + x 2 e y — 1. 

Integrating the first of these equations, we obtain 

i /r(x,y) = y sinx + x 2 e y + h(y). (17) 

Setting i Js y = N gives 

i jr y (x,y) = sinx + x 2 e v + h'(y) = sinx + x 2 e v — 1. 

Thus h'(y) = — 1 and h(y) = — y. The constant of integration can be omitted since any solution 
of the preceding differential equation is satisfactory; we do not require the most general one. 
Substituting for h(y) in Eq. (17) gives 

i lr(x,y) = y sinx + x 2 e y — y. 

Hence solutions of Eq. (16) are given implicitly by 

ysinx + x 2 e y —y = c. (18) 

EXAMPLE 

3 

Solve the differential equation 

(3xy + y 2 ) + (x 2 + xy)y’ = 0. (19) 

We have 

M y (x,y) = 3x + 2y, N x (x,y) = 2x + y; 

since M v ^ N x , the given equation is not exact. To see that it cannot be solved by the procedure 
described above, let us seek a function i jr such that 

f x {x,y) = 3xy + y 2 , f y (x,y) = x 2 + xy. (20) 


(20) 
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Integrating the first of Eqs. (20) gives 

f(x,y) = | x 2 y + xy 2 + h(y), (21) 

where h is an arbitrary function of y only. To try to satisfy the second of Eqs. (20), we compute 
i lf y from Eq. (21) and set it equal to N, obtaining 

jX 2 + 2x_y + h'(y) — x 2 + xy 


or 

h’(y) = —\x 2 — xy. (22) 

Since the right side of Eq. (22) depends on x as well as y, it is impossible to solve Eq. (22) for 
h(y). Thus there is no 4r(x,y) satisfying both of Eqs. (20). 


Integrating Factors. It is sometimes possible to convert a differential equation that is 
not exact into an exact equation by multiplying the equation by a suitable integrating 
factor. Recall that this is the procedure that we used in solving linear equations in 
Section 2.1. To investigate the possibility of implementing this idea more generally, 
let us multiply the equation 

M(x,y) dx + N(x,y) dy = 0 (23) 

by a function /x and then try to choose /x so that the resulting equation 

p.(x,y)M(x,y) dx + /x(x,y)N(x,y) dy — 0 (24) 

is exact. By Theorem 2.6.1, Eq. (24) is exact if and only if 

(fiM)y = (p.N) x . (25) 

Since M and N are given functions, Eq. (25) states that the integrating factor /x must 
satisfy the first order partial differential equation 

Mp,y — N p x + (My — N x )p — 0. (26) 

If a function /x satisfying Eq. (26) can be found, then Eq. (24) will be exact. The 
solution of Eq. (24) can then be obtained by the method described in the first part of 
this section. The solution found in this way also satisfies Eq. (23), since the integrating 
factor /x can be canceled out of Eq. (24). 

A partial differential equation of the form (26) may have more than one solution; 
if this is the case, any such solution may be used as an integrating factor of Eq. (23). 
This possible nonuniqueness of the integrating factor is illustrated in Example 4. 

Unfortunately, Eq. (26), which determines the integrating factor /x, is ordinarily at 
least as hard to solve as the original equation (23). Therefore, although in principle 
integrating factors are powerful tools for solving differential equations, in practice 
they can be found only in special cases. The most important situations in which 
simple integrating factors can be found occur when /x is a function of only one of the 
variables x or y, instead of both. Let us determine necessary conditions on M and N 
so that Eq. (23) has an integrating factor /x that depends on x only. Assuming that /x 
is a function of x only, we have 


(lJ.M)y = fJ.My, 


(fiN) x = pN x + N — . 

dx 
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EXAMPLE 

4 


PROBLEMS 


Thus, if (/ ] xM) y is to equal ( fiN ) x , it is necessary that 


dfi 

dx 


M v - Nr 


N 




(27) 


If {M y — N x ) /N is a function of x only, then there is an integrating factor // that also 
depends only on x; further, /i(x) can be found by solving Eq. (27), which is both linear 
and separable. 

A similar procedure can be used to determine a condition under which Eq. (23) 
has an integrating factor depending only on y; see Problem 23. 


Find an integrating factor for the equation 

(3xy + y 2 ) + ( \x 2 + xy)y' = 0 (19) 

and then solve the equation. 

In Example 3 we showed that this equation is not exact. Let us determine whether it has an 
integrating factor that depends on x only. On computing the quantity (M y — N x )/N, we find 
that 

M y {x,y ) - N x (x,y) _ 3x + 2 y - (2x + y) _ 

N(x,y ) x 2 + xy 

Thus there is an integrating factor /u that is a function of x only, and 


equation 

d/ji fi 

dx x 

(29) 

Hence 

H(x) — x. 

(30) 


Multiplying Eq. (19) by this integrating factor, we obtain 

(3x 2 y + xy 2 ) + (x 3 + x 2 y)y' = 0. (31) 


-. (28) 
it satisfies the differential 


The latter equation is exact, and its solutions are given implicitly by 

x 3 y + \x 2 y 2 = c. 

Solutions may also be found in explicit form since Eq. (32) is quadratic in y. 
You may also verify that a second integrating factor for Eq. (19) is 


li(x,y) 


1 

xy(2x + y) ’ 


(32) 


and that the same solution is obtained, though with much greater difficulty, if this integrating 
factor is used (see Problem 32). 


Determine whether each of the equations in Problems 1 through 12 is exact. If it is exact, find 
the solution. 

1. (2x + 3) + (2 y — 2)y' = 0 2. (2x + 4 y) + (2x — 2 y)y' = 0 

3. (3x 2 — 2 xy + 2) dx + (6 y 2 — x 2 + 3) dy = 0 

4. (2 xy 2 + 2 y) + (2 x 2 y + 2 x)y' = 0 
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dy ax + by 


dy ax — by 


dx bx — cy 


dx bx + cy 


7. (e* sin y — 2 y sin x) dx + (e x cos y + 2 cos x) dy = 0 

8. ( e x sin y + 3 y) dx — (3x — e x siny) dy = 0 

9. (ye ty cos 2x — 2e xy sin 2x + 2x) dx + (xe xy cos 2x — 3) dy = 0 

10. (y/x + 6x) dx + (lnx — 2) dy = 0, x > 0 

11. (xlny + xy)dx + (ylnx +xy)rfy = 0; x > 0, y>0 


1 0 Xdx ydy 

' (x 2 +y2) 3/2 + (x 2 +> ,2 ) 3/2 


In each of Problems 13 and 14 solve the given initial value problem and determine at least 
approximately where the solution is valid. 

13. (2x — y) dx + (2 y — x) dy = 0, y( 1) = 3 

14. ( 9x 2 + y — 1) dx — (4 y — x) dy = 0, y(l) = 0 

In each of Problems 15 and 16 find the value of b for which the given equation is exact, and 
then solve it using that value of b. 

15. (xy 1 + bx 2 y) dx + (x + y)x 2 dy - 0 

16. (ye 2xy + x) dx + bxe 2xy dy = 0 

17. Assume that Eq. (6) meets the requirements of Theorem 2.6.1 in a rectangle R and is 
therefore exact. Show that a possible function 4r(x,y) is 



where (x 0 ,yo) is a point in R. 

18. Show that any separable equation 

M(x) + N(y)y' = 0 

is also exact. 

In each of Problems 19 through 22 show that the given equation is not exact but becomes exact 
when multiplied by the given integrating factor. Then solve the equation. 

19. x 2 y 3 + x(l + y 2 )y' = 0, n(x,y) = 1 /xy 3 



21. y dx + (2x — ye y ) dy = 0, /x(x,y) = y 

22. (x + 2) siny dx + x cosy dy = 0, /r(x,y) = xe* 

23. Show that if (N x — M y )/M = Q , where Q is a function of y only, then the differential 


equation 


M + Ny' = 0 


has an integrating factor of the form 


M(y) = exp 




has an integrating factor of the form /r(xy). Find a general formula for this integrating 
factor. 
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In each of Problems 25 through 31 find an integrating factor and solve the given equation. 

25. (3x 2 y + 2xy + y 3 ) dx + (x 2 + y 2 ) dy = 0 26. / = e 2x + y - 1 

27. dx + {x/y — siny) dy = 0 28. y dx + (2xy — e~ 2y ) dy = 0 

29. e x dx + (e* coty + 2 y esc y) dy = 0 

30. [4 (x 3 /y 2 ) + (3/y)] dx + [3 (x/y 2 ) + 4y] dy = 0 
6 N 

3x 4 — 

U 

Hint: See Problem 24. 

32. Solve the differential equation 


31. 


x v 
— + 3- 
y x 


dx 


(3xy + y 2 ) + (x 2 + xy)y' = 0 


using the integrating factor p(x,y) = [xy(2x + y)] h Verify that the solution is the same 
as that obtained in Example 4 with a different integrating factor. 


2.7 Numerical Approximations: Euler’s Method 


Recall two important facts about the first order initial value problem 

^=/(F)0, y(t 0 )=yo. (l) 

First, if / and df/dy are continuous, then the initial value problem (1) has a unique 
solution y = / (f) in some interval surrounding the initial point t = t 0 . Second, it is 
usually not possible to find the solution / by symbolic manipulations of the differ- 
ential equation. Up to now we have considered the main exceptions to the latter 
statement: differential equations that are linear, separable, or exact, or that can be 
transformed into one of these types. Nevertheless, it remains true that solutions of 
the vast majority of first order initial value problems cannot be found by analytical 
means, such as those considered in the first part of this chapter. 

Therefore it is important to be able to approach the problem in other ways. As we 
have already seen, one of these ways is to draw a direction field for the differential 
equation (which does not involve solving the equation) and then to visualize the 
behavior of solutions from the direction field. This has the advantage of being a 
relatively simple process, even for complicated differential equations. Flowever, it 
does not lend itself to quantitative computations or comparisons, and this is often a 
critical shortcoming. 

For example. Figure 2.7.1 shows a direction field for the differential equation 


dy , _ 1 

-=3-2,- 5j . 


( 2 ) 


From the direction field you can visualize the behavior of solutions on the rectangle 
shown in the figure. A solution starting at a point on the y-axis initially increases with 
t, but it soon reaches a maximum value and then begins to decrease as t increases 
further. 
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FIGURE 2.7.1 A direction field for Eq. (2). 


You may also observe that in Figure 2.7.1 many tangent line segments at successive 
values of t almost touch each other. It takes only a bit of imagination to consider 
starting at a point on the y-axis and linking line segments for successive values of t in 
the grid, thereby producing a piecewise linear graph. Such a graph would apparently 
be an approximation to a solution of the differential equation. To convert this idea 
into a useful method for generating approximate solutions, we must answer several 
questions, including the following: 

1. Can we carry out the linking of tangent lines in a systematic and straightforward manner? 

2. If so, does the resulting piecewise linear function provide an approximation to an actual 
solution of the differential equation? 

3. If so, can we assess the accuracy of the approximation? That is, can we estimate how far 
the approximation deviates from the solution itself? 

It turns out that the answer to each of these questions is affirmative. The resulting 
method was originated by Euler about 1768 and is referred to as the tangent line 
method or the Euler method. We will deal with the first two questions in this section, 
but defer a systematic discussion of the third question until Chapter 8. 

To see how the Euler method works, let us consider how we might use tangent 
lines to approximate the solution y = <p(t) of Eqs. (1) near t = to. We know that the 
solution passes through the initial point (to, yo), and from the differential equation, we 
also know that its slope at this point is/(t 0 ,yo)- Thus we can write down an equation 
for the line tangent to the solution curve at (to, yob namely, 

y = yo+f(t 0 ,yo)(t-t Q ). (3) 

The tangent line is a good approximation to the actual solution curve on an interval 
short enough so that the slope of the solution does not change appreciably from its 
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value at the initial point; see Figure 2.7.2. Thus, if t\ is close enough to to, we can 
approximate 0(q) by the value y\ determined by substituting t = t\ into the tangent 
line approximation at t = to', thus 

yi = yo +f(t 0 ,yo)(h - to). (4) 



FIGURE 2.7.2 A tangent line approximation. 


To proceed further, we can try to repeat the process. Unfortunately, we do not know 
the value 0(q) of the solution at t\ . The best we can do is to use the approximate 
value y\ instead. Thus we construct the line through (t\,y\) with the slope/(q,yi), 

y = yi+f(h,yi)(t-ti). (5) 

To approximate the value of 0 ( t ) at a nearby point t 2 , we use Eq. (5) instead, obtaining 

yi = yi +/(ft,yt)fe - h). (6) 

Continuing in this manner, we use the value of y calculated at each step to deter- 
mine the slope of the approximation for the next step. The general expression for 
the tangent line starting at ( t n ,y n ) is 

y = yn+f(t n ,yn)(t-t n y, ( 7 ) 

hence the approximate value y n+ \ at t n+ \ in terms of t n , t n+ \, and y n is 

y«+i — yn T - / (t n ,y n )(t n + 1 t n ), n = 0, 1, 2, ... . (8) 

If we introduce the notation /„ = f(t n ,y n ), then we can rewrite Eq. (8) as 

y n +l — yn T fn ' (^w +1 tn), n — 0,1,2,.... (9) 

Finally, if we assume that there is a uniform step size /7 between the points to,ti,t 2 , 
then t n+ i — t n + h for each n, and we obtain Euler’s formula in the form 

y «+ 1 = y„ +.fnh , n = 0, 1,2 (10) 

To use Euler’s method, you simply evaluate Eq. (9) or Eq. (10) repeatedly, de- 
pending on whether or not the step size is constant, using the result of each step to 
execute the next step. In this way you generate a sequence of values y\,y 2 ,y-i , . . . that 
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approximate the values of the solution <p(t) at the points q, * 2 , * 3 , If, instead of a 

sequence of points, you need a function to approximate the solution </>(*), then you 
can use the piecewise linear function constructed from the collection of tangent line 
segments. That is, let y be given in [t 0 , q] by Eq. (7) with n — 0, in [q, q] by Eq. (7) 
with n = 1 , and so on. 


Consider the initial value problem 

EXAMPLE 

1 _Z =3 — 2 t-iy, y( 0) = 1. (11) 

Use Euler’s method with step size h = 0.2 to find approximate values of the solution of Eqs. (11) 
at r = 0.2, 0.4, 0.6, 0.8, and 1. Compare them with the corresponding values of the actual 
solution of the initial value problem. 

Note that the differential equation in the given initial value problem is the same as in Eq. (2). 
This equation is linear, so it can be solved as in Section 2.1, using the integrating factor e' /2 . 
The resulting solution of the initial value problem (11) is 

y = (j>(t) = 14 — 4r — 13e~ t/2 . (12) 

To approximate this solution by means of Euler’s method, note that in this case 
f(t,y) — 3 — 2 1 — y /2. Using the initial values to = 0 and y 0 = 1, we find that 

fo = f(t 0 ,y 0 ) = /( 0, 1) = 3 - 0 - 0.5 = 2.5 

and then, from Eq. (3), the tangent line approximation near t = 0 is 

y = 1 + 2.5 (t - 0) = 1 + 2.5 1. (13) 

Setting t = 0.2 in Eq. (13), we find the approximate value y \ of the solution at t = 0.2, namely, 

Tt = 1 + (2.5) (0.2) = 1.5. 


At the next step we have 


/i = /(0.2, 1.5) = 3 - 2(0.2) - (0.5) (1.5) = 3 - 0.4 - 0.75 = 1.85. 

Then the tangent line approximation near t = 0.2 is 

y = 1.5 + 1.85(r - 0.2) = 1.13 + 1.85*. (14) 

Evaluating the expression in Eq. (14) for t = 0.4, we obtain 

y 2 = 1.13 + 1.85(0.4) = 1.87. 

Repeating this computational procedure three more times, we obtain the results shown in 
Table 2.7.1. 

The first column contains the f-values separated by the step size h = 0.2. The third column 
shows the corresponding y-values computed from Euler’s formula (10). In the fourth column 
are the tangent line approximations found from Eq. (7). The second column contains values of 
the solution (12) of the initial value problem (11), correct to five decimal places. The solution 
(12) and the tangent line approximation are also plotted in Figure 2.7.3. 

From Table 2.7.1 and Figure 2.7.3 we see that the approximations given by Euler’s method for 
this problem are greater than the corresponding values of the actual solution. This is because 
the graph of the solution is concave down and therefore the tangent line approximations he 
above the graph. 
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TABLE 2.7.1 Results of Euler’s Method with h = 0.2 for 
y' = 3 — 2t — \y, y(0) = 1 


t 

Exact 

Euler 

with h = 0.2 


Tangent line 

0.0 

1.00000 

1.00000 

y = 

1 + 2.5f 

0.2 

1.43711 

1.50000 

y = 

1.13 + 1.85; 

0.4 

1.75650 

1.87000 

y = 

1.364 + 1.265t 

0.6 

1.96936 

2.12300 

y = 

1.6799 + 0.7385; 

0.8 

2.08584 

2.27070 

y = 

2.05898 + 0.26465; 

1.0 

2.11510 

2.32363 




y 

2.4 


1.6 


1.2 


Tangent line approximation 


Solution 


_L 


_L 


_L 




0.2 0.4 0.6 0.8 1 t 

FIGURE 2.7.3 Plots of the solution and a tangent line approximation for the initial value 

problem (11). 


The accuracy of the approximations in this example is not good enough to be satisfactory in 
a typical scientific or engineering application. For example, at t = 1 the error in the approxi- 
mation is 2.32363 — 2.11510 = 0.20853, which is a percentage error of about 9.86% relative to 
the exact solution. One way to achieve more accurate results is to use a smaller step size, with 
a corresponding increase in the number of computational steps. We explore this possibility in 
the next example. 


Of course, computations such as those in Example 1 and in the other examples 
in this section are usually done on a computer. Some software packages include 
code for the Euler method, while others do not. In any case, it is straightforward to 
write a computer program that will carry out the calculations required to produce 
results such as those in Table 2.7.1. Basically, what is required is a loop that will 
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evaluate Eq. (10) repetitively, along with suitable instructions for input and output. 
The output can be a list of numbers, as in Table 2.7.1, or a plot, as in Figure 2.7.3. 
The specific instructions can be written in any high-level programming language with 
which you are familiar. 


Consider again the initial value problem (11) 

EXAMPLE 

2 J = 3 - 2 t-iy, y{ 0) = 1. 

Use Euler’s method with various step sizes to calculate approximate values of the solution 
for 0 < t < 5. Compare the calculated results with the corresponding values of the exact 
solution (12) 

y = (p(t) = 14 — 4f — 13e -f/2 . 

We used step sizes h — 0.1, 0.05, 0.025, and 0.01, corresponding respectively to 50, 100, 200, 
and 500 steps, to go from t — 0 to t = 5. The results of these calculations, along with the values 
of the exact solution, are presented in Table 2.7.2. All computed entries are rounded to four 
decimal places, although more digits were retained in the intermediate calculations. 


TABLE 2.7.2 A Comparison of Exact Solution with Euler’s Method for Several 
Step Sizes h for y' = 3 — 2t — fy, y(0) = 1 


t 

Exact 

h = 0.1 

h = 0.05 

h = 0.025 

h = 0.01 

0.0 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0 

2.1151 

2.2164 

2.1651 

2.1399 

2.1250 

2.0 

1.2176 

1.3397 

1.2780 

1.2476 

1.2295 

3.0 

-0.9007 

-0.7903 

-0.8459 

-0.8734 

-0.8898 

4.0 

-3.7594 

-3.6707 

-3.7152 

-3.7373 

-3.7506 

5.0 

-7.0671 

-7.0003 

-7.0337 

-7.0504 

-7.0604 


What conclusions can we draw from the data in Table 2.7.2? The most important observation 
is that, for a fixed value of t , the computed approximate values become more accurate as the 
step size h decreases. You can see this by reading across a particular row in the table from left 
to right. This is what we would expect, of course, but it is encouraging that the data confirm our 
expectations. For example, for t = 2 the approximate value with h = 0.1 is too large by 0.1221 
(about 10%), whereas the value with h — 0.01 is too large by only 0.0119 (about 1%). In this 
case, reducing the step size by a factor of 10 (and performing 10 times as many computations) 
also reduces the error by a factor of about 10. By comparing the errors for other pairs of 
values in the table, you can verify that this relation between step size and error holds for them 
also: reducing the step size by a given factor also reduces the error by approximately the same 
factor. Does this mean that for the Euler method the error is approximately proportional to 
the step size? Of course, one example does not establish such a general result but it is at least 
an interesting conjecture. 18 


1S A more detailed discussion of the errors in using the Euler method appears in Chapter 8. 
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A second observation from Table 2.7.2 is that, for a fixed step size h, the approximations 
become more accurate as t increases, at least for t > 2. For instance, for h = 0.1 the error for 
t — 5 is only 0.0668, which is a little more than one half of the error at t = 2. We will return to 
this matter later in this section. 

All in all, Euler’s method seems to work rather well for this problem. Reasonably good 
results are obtained even for a moderately large step size h = 0.1, and the approximation can 
be improved by decreasing h. 


Let us now look at another example. 


Consider the initial value problem 

EXAMPLE 

3 ^=4-t + 2y, y(0) = 1. (15) 

The general solution of this differential equation was found in Example 2 of Section 2.1, and 
the solution of the initial value problem (15) is 

y = -l + \t+^. (16) 

Use Euler’s method with several step sizes to find approximate values of the solution on the 
interval 0 < t < 5. Compare the results with the corresponding values of the solution (16). 

Using the same range of step sizes as in Example 2, we obtain the results presented in Table 
2.7.3. 


TABLE 2.7.3 A Comparison of Exact Solution with Euler’s Method for Several Step Sizes h 


for y 

= 4 — t + 2y, y(0) 

= 1 




t 

Exact 

h = 0.1 

h = 0.05 

h = 0.025 

h = 0.01 

0.0 

1.000000 

1.000000 

1.000000 

1.000000 

1.000000 

1.0 

19.06990 

15.77728 

17.25062 

18.10997 

18.67278 

2.0 

149.3949 

104.6784 

123.7130 

135.5440 

143.5835 

3.0 

1109.179 

652.5349 

837.0745 

959.2580 

1045.395 

4.0 

8197.884 

4042.122 

5633.351 

6755.175 

7575.577 

5.0 

60573.53 

25026.95 

37897.43 

47555.35 

54881.32 


The data in Table 2.7.3 again confirm our expectation that, for a given value of t, accuracy 
improves as the step size h is reduced. For example, for t — 1 the percentage error diminishes 
from 17.3% when h = 0.1 to 2.1% when h = 0.01. However, the error increases fairly rapidly 
as t increases for a fixed h. Even for h = 0.01, the error at t = 5 is 9.4%, and it is much 
greater for larger step sizes. Of course, the accuracy that is needed depends on the purpose 
for which the results are intended, but the errors in Table 2.7.3 are too large for most scientific 
or engineering applications. To improve the situation, one might either try even smaller step 
sizes or else restrict the computations to a rather short interval away from the initial point. 
Nevertheless, it is clear that Euler’s method is much less effective in this example than in 
Example 2. 
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To understand better what is happening in these examples, let us look again at 
Euler’s method for the general initial value problem (1) 

^r=/(M0> y(t 0 )=yo, 

whose solution we denote by 0(f ). Recall that a first order differential equation has 
an infinite family of solutions, indexed by an arbitrary constant c, and that the initial 
condition picks out one member of this infinite family by determining the value of c. 
Thus in the infinite family of solutions, 0 ( f ) is the one that satisfies the initial con- 
dition 0(f o ) = y 0 . 

At the first step Euler’s method uses the tangent line approximation to the graph of 
y — 0(f) passing through the initial point (f 0 ,yo), and this produces the approximate 
value yi at t\. Usually y\ ^ 0(h), so at the second step Euler’s method uses the 
tangent line approximation not to y — 0(f), but to a nearby solution y = 0i(f) that 
passes through the point (fi,yi). So it is at each following step. Euler’s method 
uses a succession of tangent line approximations to a sequence of different solutions 
0 (f ), 0i (f), 02(f), . . . of the differential equation. At each step the tangent line is 
constructed to the solution passing through the point determined by the result of 
the preceding step, as shown in Figure 2.7.4. The quality of the approximation after 
many steps depends strongly on the behavior of the set of solutions that pass through 
the points (f„,y„) for n = 1,2,3, 



In Example 2 the general solution of the differential equation is 

y = 14 — 4f + ce -,/2 (17) 

and the solution of the initial value problem (11) corresponds to c = —13. The family 
of solutions (17) is a converging family since the term involving the arbitrary constant 
c approaches zero as f — >• oo. It does not matter very much which solutions we are 
approximating by tangent lines in the implementation of Euler’s method, since all 
the solutions are getting closer and closer to each other as f increases. 
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On the other hand, in Example 3 the general solution of the differential equation is 

y = + \t + ce 2 \ (18) 

and this is a diverging family. Note that solutions corresponding to two nearby values 
of c become arbitrarily far apart as t increases. In Example 3 we are trying to follow 
the solution for c = 11/4, but in the use of Euler’s method we are actually at each 
step following another solution that separates from the desired one faster and faster 
as t increases. This explains why the errors in Example 3 are so much larger than 
those in Example 2. 

In using a numerical procedure such as the Euler method, you must always keep 
in mind the question of whether the results are accurate enough to be useful. In 
the preceding examples, the accuracy of the numerical results could be determined 
directly by a comparison with the solution obtained analytically. Of course, usually 
the analytical solution is not available if a numerical procedure is to be employed, 
so what is usually needed are bounds for, or at least estimates of, the error that do 
not require a knowledge of the exact solution. You should also keep in mind that 
the best that we can expect, or hope for, from a numerical approximation is that it 
reflects the behavior of the actual solution. Thus a member of a diverging family 
of solutions will always be harder to approximate than a member of a converging 
family. 

If you wish to read more about numerical approximations to solutions of initial 
value problems, you may go directly to Chapter 8 at this point. There we present 
some information on the analysis of errors and also discuss several algorithms that 
are computationally much more efficient than the Euler method. 


PROBLEMS Many of the problems in this section call for fairly extensive numerical computations. The 
amount of computing that it is reasonable for you to do depends strongly on the type of 
computing equipment that you have. A few steps of the requested calculations can be carried 
out on almost any pocket calculator — or even by hand if necessary. To do more, you will find at 
least a programmable calculator desirable, and for some problems a computer may be needed. 

Remember also that numerical results may vary somewhat depending on how your program 
is constructed and on how your computer executes arithmetic steps, rounds off, and so forth. 
Minor variations in the last decimal place may be due to such causes and do not necessarily 
indicate that something is amiss. Answers in the back of the book are recorded to six digits in 
most cases, although more digits were retained in the intermediate calculations. 


In each of Problems 1 through 4: 

(a) Find approximate values of the solution of the given initial value problem at t = 0.1, 0.2, 
0.3, and 0.4 using the Euler method with h = 0.1. 

(b) Repeat part (a) with h = 0.05. Compare the results with those found in (a). 

(c) Repeat part (a) with h = 0.025. Compare the results with those found in (a) and (b). 

(d) Find the solution y = ij>(t) of the given problem and evaluate <j>(t) at t = 0.1, 0.2, 0.3, and 
0.4. Compare these values with the results of (a), (b), and (c). 

l.? = 3 + t-y, y(0) = 1 4fl 2. y' = 2y — 1, y(0) = 1 

3. y = 0.5 - t + 2 y, y(0) = 1 4. / = 3 cos t - 2 y, y( 0) = 0 
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In each of Problems 5 through 10 draw a direction field for the given differential equation and 
state whether you think that the solutions are converging or diverging. 

4fl 5. y = 5 - 3^/y gl 6. y = y(3 - ty) 

4fl 7. / = (4 - ty)/ (1 + y 2 ) jfl 8. / = -ty + O.ly 3 

(fl 9. y' = f 2 ±y 2 10. y’ = (y 2 + 2ty)/(3 + t 2 ) 

In each of Problems 11 through 14 use Euler’s method to find approximate values of the 
solution of the given initial value problem at t = 0.5, 1, 1.5, 2, 2.5, and 3: 



(a) 

With h = 0.1. 


(b) With h = 0.05. 


(c) 

With h = 0.025. 


(d) With h = 0.01. 

41 

11. 

y = 5 - 3Vy, 

y(0) = 2 


41 

12. 

II 

1 

3 

y (0) = 0.5 


41 

13. 

y = (4 - oo/a 4 

-y 2 ), y (0) = —2 


41 

14. 

y' = — ty + O.ly 3 , 

y( 0) = 1 


41 

15. 

Consider the initial value problem 



y = 3t 2 /(3 y 2 - 4), y(l) = 0. 


(a) Use Euler’s method with h = 0.1 to obtain approximate values of the solution at 
t = 1.2, 1.4, 1.6, and 1.8. 

(b) Repeat part (a) with h = 0.05. 

(c) Compare the results of parts (a) and (b). Note that they are reasonably close for 
t = 1.2, 1.4, and 1.6 but are quite different for t = 1.8. Also note (from the differential 
equation) that the line tangent to the solution is parallel to the y-axis when y = ±2/\/3 
= ±1.155. Explain how this might cause such a difference in the calculated values. 

01, 16. Consider the initial value problem 

y' = t 2 +y 2 , y(0) = 1. 

Use Euler’s method with h = 0.1, 0.05, 0.025, and 0.01 to explore the solution of this 
problem for 0 < t < 1. What is your best estimate of the value of the solution at t = 0.8? 
At t = 1? Are your results consistent with the direction field in Problem 9? 

17. Consider the initial value problem 

/= (y 2 ±2fy)/(3±t 2 ), y(l) = 2. 

Use Euler’s method with h = 0.1, 0.05, 0.025, and 0.01 to explore the solution of this 
problem for 1 < t < 3. What is your best estimate of the value of the solution at t = 2.5? 
At t = 3? Are your results consistent with the direction field in Problem 10? 

18. Consider the initial value problem 

y = -ty + O.ly 3 , y(0) = a, 

where a is a given number. 

(a) Draw a direction field for the differential equation (or reexamine the one from Prob- 
lem 8). Observe that there is a critical value of a in the interval 2 < a < 3 that separates 
converging solutions from diverging ones. Call this critical value ag. 

(b) Use Euler’s method with h = 0.01 to estimate ag. Do this by restricting a 0 to an 
interval [ a , b], where b — a = 0.01. 

V 19. Consider the initial value problem 

y' = y 2 - f 2 , y(0) = a, 

where a is a given number. 
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(a) Draw a direction field for the differential equation. Observe that there is a critical 
value of a in the interval 0 < a < 1 that separates converging solutions from diverging 
ones. Call this critical value a 0 . 

(b) Use Euler’s method with h = 0.01 to estimate a 0 . Do this by restricting a 0 to an 
interval [a, b], where b — a = 0.01. 

20. Convergence of Euler’s Method. It can be shown that, under suitable conditions on /, 
the numerical approximation generated by the Euler method for the initial value problem 
y' = f(t,y), y(t 0 ) = y 0 converges to the exact solution as the step size h decreases. This is 
illustrated by the following example. Consider the initial value problem 


/ = l-t + y, y(k)=yo- 

(a) Show that the exact solution is y = cp(t) = (y 0 — to)e ,_ '° + t. 

(b) Using the Euler formula, show that 

y k = (1 + h)y k _ i + h - ht k _i, k = 1, 2 

(c) Noting that yq = (1 + h)(y 0 — to) + h, show by induction that 

y„ = (1 + h) n (y 0 - to) + t n (i) 


for each positive integer n. 

(d) Consider a fixed point t > t 0 and for a given n choose h = (t — to)/n. Then t„ = t for 
every n. Note also that h — >• 0 as n — > oo. By substituting for h in Eq. (i) and letting 
n — ► oo, show that y n <j>(t) as n — > oo. 

Hint: lim (1 + a/n) n = e a . 

n—> oo 


In each of Problems 21 through 23 use the technique discussed in Problem 20 to show that 
the approximation obtained by the Euler method converges to the exact solution at any fixed 
point as h -*■ 0. 

21 . / = y, y(<0) = 1 

22. y' = 2y — 1, v(0) = 1 Hint: = (1 + 2h)/2 + 1/2 

23. y' = \ — t + 2 y, y( 0) = 1 Hint: yj = (1 + 2 h) + t\[ 2 
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In this section we discuss the proof of Theorem 2.4.2, the fundamental existence and 
uniqueness theorem for first order initial value problems. This theorem states that 
under certain conditions on f(t,y), the initial value problem 

y'=f(t,y), y(to)=y 0 (l) 

has a unique solution in some interval containing the point to. 

In some cases (for example, if the differential equation is linear) the existence 
of a solution of the initial value problem (1) can be established directly by actually 
solving the problem and exhibiting a formula for the solution. However, in general, 
this approach is not feasible because there is no method of solving the differential 
equation that applies in all cases. Therefore, for the general case, it is necessary to 
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Theorem 2.8.1 


adopt an indirect approach that demonstrates the existence of a solution of Eqs. (1) 
but usually does not provide a practical means of finding it. The heart of this method is 
the construction of a sequence of functions that converges to a limit function satisfying 
the initial value problem, although the members of the sequence individually do 
not. As a rule, it is impossible to compute explicitly more than a few members 
of the sequence; therefore the limit function can be determined only in rare cases. 
Nevertheless, under the restrictions on f(t,y) stated in Theorem 2.4.2, it is possible 
to show that the sequence in question converges and that the limit function has 
the desired properties. The argument is fairly intricate and depends, in part, on 
techniques and results that are usually encountered for the first time in a course on 
advanced calculus. Consequently, we do not go into all the details of the proof here; 
we do, however, indicate its main features and point out some of the difficulties that 
must be overcome. 

First of all, we note that it is sufficient to consider the problem in which the initial 
point (f 0 ,yo) is the origin; that is, we consider the problem 

y' =f(t,y), y( 0 ) = 0 . ( 2 ) 

If some other initial point is given, then we can always make a preliminary change 
of variables, corresponding to a translation of the coordinate axes, that will take the 
given point (fo,yo) into the origin. The existence and uniqueness theorem can now 
be stated in the following way. 


If / and df/dy are continuous in a rectangle R: \t\ < a , |y| < b , then there is 
some interval |f| < h < a in which there exists a unique solution y = 0(f) of the 
initial value problem (2). 


For the method of proof discussed here it is necessary to transform the initial value 
problem (2) into a more convenient form. If we suppose temporarily that there is a 
differentiable function y = 0(f) that satisfies the initial value problem, then/[f,0(f)] 
is a continuous function of t only. Hence we can integrate y' = /(f,y) from the initial 
point t = 0 to an arbitrary value of f, obtaining 

0(0 = f f[s,(p(s)]ds, (3) 

Jo 

where we have made use of the initial condition 0 (0) = 0. We also denote the dummy 
variable of integration by s. 

Since Eq. (3) contains an integral of the unknown function 0, it is called an integral 
equation. This integral equation is not a formula for the solution of the initial value 
problem, but it does provide another relation satisfied by any solution of Eqs. (2). 
Conversely, suppose that there is a continuous function y = 0(f) that satisfies the 
integral equation (3); then this function also satisfies the initial value problem (2). To 
show this, we first substitute zero for f in Eq. (3), which shows that the initial condition 
is satisfied. Further, since the integrand in Eq. (3) is continuous, it follows from the 
fundamental theorem of calculus that 0 is differentiable, and that 0'(f) = f[t, 0(f)]. 
Therefore the initial value problem and the integral equation are equivalent in the 
sense that any solution of one is also a solution of the other. It is more convenient 
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to show that there is a unique solution of the integral equation in a certain interval 
|f| < h. The same conclusion will then hold also for the initial value problem. 

One method of showing that the integral equation (3) has a unique solution is 
known as the method of successive approximations or Picard’s 19 iteration method. 
In using this method, we start by choosing an initial function 0o, either arbitrarily or 
to approximate in some way the solution of the initial value problem. The simplest 
choice is 

0o (0 = 0; (4) 

then 0o at least satisfies the initial condition in Eqs. (2), although presumably not the 
differential equation. The next approximation 0i is obtained by substituting 0o f.v) 
for 0 (s) in the right side of Eq. (3) and calling the result of this operation 0 1 (t). Thus 

0t (0 = f f[s,<Po(.s)]ds. (5) 

Jo 

Similarly, 02 is obtained from 0 1 : 

02(0= f /[.V, 01 (.S’)] ds, (6) 

Jo 

and, in general, 

0/1+1 (0 = f f[s,(pn(s)]ds. (7) 

Jo 

In this manner we generate the sequence of functions {0„} = 0o,0i, . . . , 0„, Each 

member of the sequence satisfies the initial condition, but in general none satisfies 
the differential equation. However, if at some stage, say, for n = k, we find that 
cp k+1 (t) — (p k (0, then it follows that (p k is a solution of the integral equation (3). Hence 
(f>k is also a solution of the initial value problem (2), and the sequence is terminated 
at this point. In general, this does not occur, and it is necessary to consider the entire 
infinite sequence. 

To establish Theorem 2.8.1, we must answer four principal questions: 

1. Do all members of the sequence { <j> n j exist, or may the process break down at some stage? 

2 . Does the sequence converge? 

3 . What are the properties of the limit function? In particular, does it satisfy the integral 
equation (3) and hence the initial value problem (2)? 

4 . Is this the only solution, or may there be others? 

We first show how these questions can be answered in a specific and relatively simple 
example and then comment on some of the difficulties that may be encountered in 
the general case. 


w Charles-Emile Picard (1856-1914), except for Henri Poincare, perhaps the most distinguished French 
mathematician of his generation, was appointed professor at the Sorbonne before the age of 30. He 
is known for important theorems in complex variables and algebraic geometry as well as differential 
equations. A special case of the method of successive approximations was first published by Liouville 
in 1838. However, the method is usually credited to Picard, who established it in a general and widely 
applicable form in a series of papers beginning in 1890. 
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EXAMPLE 

1 


Solve the initial value problem 


y = 2t{\+y), y(0) = 0 

by the method of successive approximations. 

Note first that if y = 0(f), then the corresponding integral equation is 

0(f) = f 2s[l + 0(s)] ds. 

Jo 

If the initial approximation is 0o(f) = 0, it follows that 

0i (f) = [ 2s[l + 0o(s)] ds = f 2s ds = t 2 . 


Similarly, 


and 






02 (f) = / 2 s[l + 0 i(i)] ds = / 2s[l + s 1 ] ds = t 1 + — 


f 

2 


f 

Jo 


[ 2s 

2 ^ 

1 + r + — 

Jo 

2 


ds — f -j- — -j- 


2-3 


Equations (10), (11), and (12) suggest that 


<Pn(t) — + — + — + - 

2! 3! n\ 


( 8 ) 

(9) 

(10) 

( 11 ) 

( 12 ) 

(13) 


for each n > 1, and this result can be established by mathematical induction, as follows. Equa- 
tion (13) is certainly true for n = 1; see Eq. (10). We must show that if it is true for n = k, then 
it also holds for n = k + 1. We have 


-f 


(pk+lit) = I 2s[l + (pk(s)] ds 

= l 2 s { 1 +s 2 + h + '" + id ]ds 


= r + 2! + 3! + 


2k+2 


{k + 1 )! ’ 


(14) 


and the inductive proof is complete. 

A plot of the first four iterates, 0i(f), . . . , 04(f), is shown in Figure 2.8.1. As k increases, 
the iterates seem to remain close over a gradually increasing interval, suggesting eventual 
convergence to a limit function. 

It follows from Eq. (13) that 0„(f) is the uth partial sum of the infinite series 


'O f2k 


(15) 


hence lim 0„(f) exists if and only if the series (15) converges. Applying the ratio test, we see 

n— >co 

that, for each f, 

t 2k+2 f 2 

0 as k — * oo. (16) 


(k + 1 )! t 2k 


k + 1 


Thus the series (15) converges for all f, and its sum 0(f) is the limit of the sequence (0„(f)l- 
Further, since the series (15) is a Taylor series, it can be differentiated or integrated term by 
term as long as f remains within the interval of convergence, which in this case is the entire 

OO 

f-axis. Therefore, we can verify by direct computation that 0(f) = t 2k /k\ is a solution of the 

k = 1 

integral equation (9). Alternatively, by substituting 0(f) for y in Eqs. (8), we can verify that 
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FIGURE 2.8.1 Plots of <j>i (f), </> 4 (t) for Example 1. 


this function satisfies the initial value problem. In this example it is also possible, from the 
series (15), to identify <p in terms of elementary functions, namely, 4>(t) = e r — 1. However, 
this is not necessary for the discussion of existence and uniqueness. 

Explicit knowledge of f (/) does make it possible to visualize the convergence of the se- 
quence of iterates more clearly by plotting <p(t) — <j> k (t ) for various values of k. Figure 2.8.2 
shows this difference for k = 1, ... ,4. This figure clearly shows the gradually increasing inter- 
val over which successive iterates provide a good approximation to the solution of the initial 
value problem. 



FIGURE 2.8.2 Plots of </j (t ) — (pp-(t) for Example 1 for k = 1, .... 4. 


Finally, to deal with the question of uniqueness, let us suppose that the initial value problem 
has two solutions <f> and i jr. Since (f> and ij/ both satisfy the integral equation (9), we have by 
subtraction that 

fit) - fif) = [ 2s[4>(s) - i/r(s)] ds. 

Jo 
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Taking absolute values of both sides, we have, if f > 0, 


If 


10(0 - 0(01 = / 2s[<t>(s) - 0(s)] ds 


f 


< I 2s\<p(s) — 0(s)| ds. 


If we restrict f to lie in the interval 0 < t < A/2, where A is arbitrary, then 2 1 < A , and 


10(0 - 0(01 < A [ 10(0 - 0(01 ds. 
Jo 

It is now convenient to introduce the function U defined by 

U(t)= [ 10(0 - 0(01 ds. 

Jo 

Then it follows at once that 


(17) 


(18) 


(7(0) = 0, (19) 

(7(f) > 0, for f > 0. (20) 

Further, (7 is differentiable, and 17' (0 = 10(0 — 0(01- Hence, by Eq. (17), 

17' (0 -0(7(0 < 0. (21) 

Multiplying Eq. (21) by the positive quantity e~ At gives 

[e~ A, U(t)f < 0. (22) 

Then, upon integrating Eq. (22) from zero to f and using Eq. (19), we obtain 

e~ At (7 (0 £ 0 for t > 0. 

Hence (7(0 < 0 for f > 0, and in conjunction with Eq. (20), this requires that (7(0 = 0 for each 
f > 0. Thus (7'(0 = 0, and therefore 0(f) = 0(f), which contradicts the original hypothesis. 
Consequently, there cannot be two different solutions of the initial value problem for f > 0. 
A slight modification of this argument leads to the same conclusion for f < 0. 


Returning now to the general problem of solving the integral equation (3), let us 
consider briefly each of the questions raised earlier: 

1. Do all members of the sequence (0„) exist? In the example, / and df/dy were continuous 
in the whole ty- plane, and each member of the sequence could be explicitly calculated. 
In contrast, in the general case, / and df/dy are assumed to be continuous only in the 
rectangle R: |f| < a, |y| < b (see Figure 2.8.3). Furthermore, the members of the sequence 
cannot as a rule be explicitly determined. The danger is that at some stage, say, for n = k, 
the graph of y = 0A. (f) may contain points that lie outside the rectangle R. Hence at the 
next stage — in the computation of <pk+i(t) — it would be necessary to evaluate f(t,y) at 
points where it is not known to be continuous or even to exist. Thus the calculation of 
4>k+i (f) might be impossible. 

To avoid this danger, it may be necessary to restrict f to a smaller interval than |f| < a. 
To find such an interval, we make use of the fact that a continuous function on a closed 
bounded region is bounded. Hence / is bounded on R; thus there exists a positive number 
M such that 

\f(t,y)\<M, (t,y) in R. (23) 

We have mentioned before that 

0„(O) =0 
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FIGURE 2.8.3 Region of definition for Theorem 2.8.1. 

for each n. Since is equal to <j>' k+l (t), the maximum absolute slope of the graph 

of the equation y = 4>k+i(t) is M. Since this graph contains the point (0,0), it must lie 
in the wedge-shaped shaded region in Figure 2.8.4. Hence the point [t,4>k+i(t)] remains 
in R at least as long as R contains the wedge-shaped region, which is for \t\ < b/M. We 
hereafter consider only the rectangle D: \t\ < h, |y| < b , where h is equal either to a or to 
b/M, whichever is smaller. With this restriction, all members of the sequence {<j>„(t)} exist. 
Note that whenever b/M < a, then you can try to obtain a larger value of h by finding a 
better (that is, smaller) bound M for \f(t,y)\, if this is possible. 



(a) (b) 

FIGURE 2.8.4 Regions in which successive iterates lie. (a) b/M < a;(b) b/M > a. 


2. Does the sequence {0„(O ) converge? As in the example, we can identify 0„(f ) = + 

[02 (0 — 0i (01 + • • • + [0„(O — 0„-i(t)] as the nth partial sum of the series 

OO 

Mt) + J2 [&+i(0 -0HOJ. (24) 

k=l 

The convergence of the sequence {</)„(/)} is established by showing that the series (24) 
converges. To do this, it is necessary to estimate the magnitude \cf>k+i(t) — 4>k(t)\ of the 
general term. The argument by which this is done is indicated in Problems 15 through 
18 and will be omitted here. Assuming that the sequence converges, we denote the limit 
function by 0, so that 

0(0 = lim 0„(O- (25) 

n—> oo 

3. What are the properties of the limit function 0? In the first place, we would like to know 
that 0 is continuous. This is not, however, a necessary consequence of the convergence 
of the sequence (0„(O), even though each member of the sequence is itself continuous. 
Sometimes a sequence of continuous functions converges to a limit function that is dis- 
continuous. A simple example of this phenomenon is given in Problem 13. One way to 
show that 0 is continuous is to show not only that the sequence (0,,) converges, but also 
that it converges in a certain manner, known as uniform convergence. We do not take up 
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PROBLEMS 


this question here but note only that the argument referred to in paragraph 2 is sufficient 
to establish the uniform convergence of the sequence (0„) and, hence, the continuity of 
the limit function 0 in the interval |f| < h. 

Now let us return to Eq. (7) 


0K+1 (t) = / f[s, 4 >n(s)]ds. 
Jo 

Allowing n to approach oo on both sides, we obtain 




= lim f 

«-*• 00 Jo 


f[S,<t>n(s ) ] ds. 


(26) 


We would like to interchange the operations of integrating and taking the limit on the 
right side of Eq. (26) so as to obtain 


0(f) = f lim f[s,<p n (s)]ds. (27) 

Jo n ^°° 

In general, such an interchange is not permissible (see Problem 14, for example), but once 
again, the fact that the sequence {0„(f)J converges uniformly is sufficient to allow us to 
take the limiting operation inside the integral sign. Next, we wish to take the limit inside 
the function/, which would give 


and hence 


0(f) = / f[s, lim 0„(y)] ds 

Jo n ^°° 

</>(<)= f f[s,4>(.s)]ds. 

Jo 


(28) 


(29) 


The statement that 


lim f[s,4> n (s)] =f[s, lim 0„(s)] 

n — >oo n— >oo 

is equivalent to the statement that / is continuous in its second variable, which is known by 
hypothesis. Hence Eq. (29) is valid, and the function 0 satisfies the integral equation (3). 
Thus 0 is also a solution of the initial value problem (2). 

4 . Are there other solutions of the integral equation (3) besides y = 0(f)? To show the 
uniqueness of the solution y = 0 (f ) , we can proceed much as in the example. First, assume 
the existence of another solution y = 0(f). It is then possible to show (see Problem 19) 
that the difference 0(f) — 0(f) satisfies the inequality 

10(f) - 0(01 < A f \4>(s) - ij/(s)\ds (30) 

Jo 

for 0 < f < h and a suitable positive number A. From this point the argument is identical 
to that given in the example, and we conclude that there is no solution of the initial value 
problem (2) other than the one generated by the method of successive approximations. 


In each of Problems 1 and 2 transform the given initial value problem into an equivalent 
problem with the initial point at the origin. 

1 . dy/dt = f 2 + y * 1 , y(l) = 2 


2. dy/dt = 1 — y 3 , y(— 1) = 3 
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In each of Problems 3 through 6 let <f>o(t) = 0 and use the method of successive approximations 
to solve the given initial value problem. 

(a) Determine <p„(t) for an arbitrary value of n. 

(b) Plot 4>„(t ) for n= 1, ... ,4. Observe whether the iterates appear to be converging. 

(c) Express lim 0„(t) = 0(r) in terms of elementary functions; that is, solve the given initial 

«— ► OO 

value problem. 

(d) Plot |0(f) — 0„(OI for n = 1, 4. For each of 0i(f), ■ ■ ■ ,04(0, estimate the interval in 

which it is a reasonably good approximation to the actual solution. 

jfl 3. y' — 2(y + 1), y(0) = 0 jfl 4. y' = -y-l, y(0) = 0 

5. yf = -y/2 + t, y(0) = 0 6. y' = y + 1 - f, y(0) = 0 

In each of Problems 7 and 8 let 0o(O = 0 and use the method of successive approximations to 
solve the given initial value problem. 

(a) Determine 0„(O for an arbitrary value of n. 

(b) Plot 0„(O for n = 1, ... ,4. Observe whether the iterates appear to be converging. 

7 . y' = ty + 1, y(0) = 0 jfl 8. y' = t 2 y - 1, y(0) = 0 

In each of Problems 9 and 10 let 0 o (O = 0 and use the method of successive approximations 
to approximate the solution of the given initial value problem. 

(a) Calculate 0i (t), ... ,03 (t). 

(b) Plot 0i it ), . . . , 03 (t) and observe whether the iterates appear to be converging. 

4 fl 9. y' = t 2 +y 2 , y 10) = 0 ^10. y' = l-y 3 , y(0) = 0 

In each of Problems 11 and 12 let 0 o (f) = 0 and use the method of successive approximations 
to approximate the solution of the given initial value problem. 

(a) Calculate 0i(f), • • • , 04(0, or (if necessary) Taylor approximations to these iterates. Keep 
terms up to order six. 

(b) Plot the functions you found in part (a) and observe whether they appear to be converging. 

11. y'=- siny + 1, y(0) = 0 jfl 12. / = (3 1 2 + 4r + 2)/2 (y - 1), y(0) = 0 


13. Let <t>„(x) — x n for 0 < x < 1 and show that 

(0, 0 < x < 1, 

lim 0„(x) = { 

II, X = 1. 

This example shows that a sequence of continuous functions may converge to a limit 
function that is discontinuous. 

2 

14. Consider the sequence <j> n {x) = 2nxe~ ra , 0 < x < 1. 

(a) Show that lim 0„(x) = 0 for 0 < x < 1; hence 


/ lim 0„(x) dx = 0. 

Jo n ^°° 

(b) Show that / 2 nxe~ nx " dx = 1 — e~"\ hence 
Jo 

lim / 0/j (x) dx = 1 . 

,^oo J 0 

Thus, in this example, 

pb pb 

lim / 0„ (x) dxj^ lim 0„ (x) dx, 

«^°o J a J a oo 

even though lim 0„ (x) exists and is continuous. 
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In Problems 15 through 18 we indicate how to prove that the sequence (0„(O ), defined by 
Eqs. (4) through (7), converges. 

15. If 3 f/dy is continuous in the rectangle D, show that there is a positive constant K such 
that 

- f(t,y 2 )\ < K\y t -y 2 \, (i) 

where (t,y\) and (t,y 2 ) are any two points in D having the same t coordinate. This inequality 
is known as a Lipschitz 20 condition. 

Hint: Hold t fixed and use the mean value theorem on / as a function of y only. Choose 
K to be the maximum value of \df /dy \ in D. 

16. If (fin—i (0 and 0„(O are members of the sequence {0„(Oh use the result of Problem 15 to 
show that 

-fit, 0,-1 (0]| < K\4>n(t) - 


17. (a) Show that if \t\ < h, then 

1 0i (01 £ M\t\, 

where M is chosen so that \f(t,y)\ < M for {t,y) in D. 

(b) Use the results of Problem 16 and part (a) of Problem 17 to show that 

MK\t\ 2 

102 ( 0 - 01(01 < - - ^ LL - 

(c) Show, by mathematical induction, that 


MK n ~ 1 \t\ n MK n -'h n 

10,(0 - 0n—i (01 < r 1 ^ < ; 


18. Note that 


0 «(O — 01 (0 + [ 02(0 — 0 i (01 + • • • + [ 0,(0 — 0 «— 1 ( 01 - 

(a) Show that 

10,(01 < 101(01 + 102(0 - 01(01 + • • • + 1 07, (0 - 0,-1 (01- 

(b) Use the results of Problem 17 to show that 


M 

10,(01 < y 


( Kh ) 2 (Kh) n 

Kh + — — + • • • + — 

2! n! 


(c) Show that the sum in part (b) converges as n — > oo and, hence, the sum in part (a) 
also converges as n — > oo. Conclude therefore that the sequence (0„(O! converges since 
it is the sequence of partial sums of a convergent infinite series. 

19. In this problem we deal with the question of uniqueness of the solution of the integral 
equation (3) 


0(0= f f[s,<p(s)]ds. 

Jo 

(a) Suppose that tp and \jr are two solutions of Eq. (3). Show that, for t > 0, 

0(0-0 (0 = [ {/E,0Cs)l - f\s, 0 (01 ! ds. 

Jo 


20 Rudolf Lipschitz (1832-1903), professor at the University of Bonn for many years, worked in several ar- 
eas of mathematics. The inequality (i) can replace the hypothesis that 3 f/dy is continuous in Theorem 2.8.1; 
this results in a slightly stronger theorem. 
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(b) Show that 

10(0- 0(01 < f \f[s,(p(s)] - f[s,i//(s)]\ds. 

Jo 

(c) Use the result of Problem 15 to show that 

10(0 - 0(01 <K f |0 (s) - 0COI ds, 

Jo 

where K is an upper bound for \df/dy\ in D. This is the same as Eq. (30), and the rest of 
the proof may be constructed as indicated in the text. 
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Although a continuous model leading to a differential equation is reasonable and 
attractive for many problems, there are some cases in which a discrete model may 
be more natural. For instance, the continuous model of compound interest used 
in Section 2.3 is only an approximation to the actual discrete process. Similarly, 
sometimes population growth may be described more accurately by a discrete than 
by a continuous model. This is true, for example, of species whose generations do 
not overlap and that propagate at regular intervals, such as at particular times of 
the calendar year. Then the population y n+ \ of the species in the year n + 1 is some 
function of n and the population y n in the preceding year; that is, 

y n +i =f(n,y n ), n = 0,1,2,.... (1) 

Equation (1) is called a first order difference equation. It is first order because the 
value of y n+ \ depends on the value of y n but not on earlier values y„_i , y n - 2 , and so 
forth. As for differential equations, the difference equation (1) is linear if / is a linear 
function of y„; otherwise, it is nonlinear. A solution of the difference equation (1) is 
a sequence of numbers ya,y\,yi , . . . that satisfy the equation for each n. In addition 
to the difference equation itself, there may also be an initial condition 

yo = ol (2) 

that prescribes the value of the first term of the solution sequence. 

We now assume temporarily that the function/ in Eq. (1) depends only on y n , but 
not on n. In this case 


y n +i=f(yn), n = 0,1,2,.... (3) 

If yo is given, then successive terms of the solution can be found from Eq. (3). Thus 

yi =f(yo), 


and 

yi =/(yi) =f[f(yo)]- 

The quantity /[/(yo)] is called the second iterate of the difference equation and is 
sometimes denoted by / 2 (yo). Similarly, the third iterate y 3 is given by 

y.3 = fiyi) =/{/[/ (yo)]} =/ 3 (yo), 
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and so on. In general, the nth iterate y n is 

y n =f(y n - 1) =/"(yo)- 

This procedure is referred to as iterating the difference equation. It is often of pri- 
mary interest to determine the behavior of y n as n -» oo; in particular, does y n ap- 
proach a limit, and if so, what is it? 

Solutions for which y n has the same value for all n are called equilibrium solutions. 
They are frequently of special importance, just as in the study of differential equations. 
If equilibrium solutions exist, one can find them by setting y n+ \ equal to y n in Eq. (3) 
and solving the resulting equation 

y n =f(y n ) (4) 

for y n . 


Linear Equations. Suppose that the population of a certain species in a given region in 
year n + 1, denoted by y n +i, is a positive multiple p n of the population y n in year n\ 
that is, 

y n +i = Pnyn, n = 0,1,2,.... (5) 

Note that the reproduction rate p n may differ from year to year. The difference 
equation (5) is linear and can easily be solved by iteration. We obtain 


y t = Poyo, 

y2 = piyi = pi/wo, 

and, in general, 

y„ = Pn-1 ■ ■■ Poyo, n = 1,2,.... (6) 


Thus, if the initial population yo is given, then the population of each succeeding 
generation is determined by Eq. (6). Although for a population problem p n is intrin- 
sically positive, the solution (6) is also valid if p n is negative for some or all values of 
n. Note, however, that if p n is zero for some n, then y n+ \ and all succeeding values of 
y are zero; in other words, the species has become extinct. 

If the reproduction rate p n has the same value p for each n, then the difference 
equation (5) becomes 

y n + 1 = py n (7) 

and its solution is 

y n = p n yo- ( 8 ) 


Equation (7) also has an equilibrium solution, namely, y n — 0 for all n. corresponding 
to the initial value yo = 0. The limiting behavior of y„ is easy to determine from 
Eq. (8). In fact, 

fO, if Ip I < i; 


lim y„ = y 0 , if P = 1; 

n—> oo 

does not exist, otherwise. 


( 9 ) 


In other words, the equilibrium solution y„ = 0 is asymptotically stable for \p\ < 1 
and unstable if |p| > 1. 

Now we will modify the population model represented by Eq. (5) to include the 
effect of immigration or emigration. If b n is the net increase in population in year 
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n due to immigration, then the population in year n + 1 is the sum of those due to 
natural reproduction and those due to immigration. Thus 

y n + 1 = py n + bn, n = 0, 1, 2, , (10) 

where we are now assuming that the reproduction rate p is constant. We can solve 
Eq. (10) by iteration in the same manner as before. We have 

yi = pyo + b 0 , 

yi = p(pyo + bo) + bi = p 2 y 0 + pb 0 + b u 
y3 = p(p 2 yo + pbo + bi) + b 2 = p 3 yo + p 2 b 0 + pbi + b 2 , 

and so forth. In general, we obtain 

n — 1 

yn = p n yo + p n l bo H h pbn - 2 + b n - 1 = p n yo + ^2 P n A i bj. (11) 

/— o 

Note that the first term on the right side of Eq. (11) represents the descendants of 
the original population, while the other terms represent the population in year n 
resulting from immigration in all preceding years. 

In the special case where b n — b ^ 0 for all n, the difference equation is 

y n + 1 = py„ + b, ( 12 ) 

and from Eq. (11) its solution is 

yn = P n yo + (l + p + p~ + ■ ■ ■ + p n l )b. (13) 

If p ^ 1, we can write this solution in the more compact form 

y n = p n yo + \ — —b, (14) 

1 - p 

where again the two terms on the right side are the effects of the original population 
and of immigration, respectively. Rewriting Eq. (14) as 

y n = P n (yo - ~r—] + Y~— ( 15 ) 

V i-p/ i - p 

makes the long-time behavior of y n more evident. It follows from Eq. (15) that 
y n —> b/( 1 — p) if |p| < 1. If \p\ >1 or if p = — 1 then y n has no limit unless 
yo = b/(l — p). The quantity b/(l — p), for p ^ 1, is an equilibrium solution of 
Eq. (12), as can readily be seen directly from that equation. Of course, Eq. (14) 
is not valid for p = 1. To deal with that case, we must return to Eq. (13) and let p — 1 
there. It follows that 

y n = yo + nb, (16) 


so in this case y„ becomes unbounded as n — ► oo. 

The same model also provides a framework for solving many problems of a finan- 
cial character. For such problems y„ is the account balance in the nth time period, 
p n — 1 + r n , where r n is the interest rate for that period, and b n is the amount 
deposited or withdrawn. The following example is typical. 
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EXAMPLE 

1 


A recent college graduate takes out a $10,000 loan to purchase a car. If the interest rate is 
12%, what monthly payment is required to pay off the loan in 4 years? 

The relevant difference equation is Eq. (12), where y n is the loan balance outstanding in the 
nth month, p = 1 + r, where r is the interest rate per month, and b is the effect of the monthly 
payment. Note that p = 1.01, corresponding to a monthly interest rate of 1%. Since payments 
reduce the loan balance, b must be negative; the actual payment is \b\. 

The solution of the difference equation (12) with this value for p and the initial condition 
y 0 = 10,000 is given by Eq. (15); that is, 

y n = (1.01)"(10,000 + 100b) - 100b. (17) 


The value of b needed to pay off the loan in 4 years is found by setting y& = 0 and solving 
for b. This gives 


b = 


-100 


(1.01) 48 
( 1 . 01) 48 - 1 


-263.34. 


(18) 


The total amount paid on the loan is 48 times |b|, or $12,640.32. Of this amount, $10,000 is 
repayment of the principal and the remaining $2640.32 is interest. 


Nonlinear Equations. Nonlinear difference equations are much more complicated and 
have much more varied solutions than linear equations. We will restrict our attention 
to a single equation, the logistic difference equation 

y n + 1 = py„ (l - j-) , (19) 

which is analogous to the logistic differential equation 



that was discussed in Section 2.5. Note that if the derivative dy/dt in Eq. (20) is 
replaced by the difference (y n+ 1 — y„)/h , then Eq. (20) reduces to Eq. (19) with 
p = 1 + hr and k — (1 + hr)K/hr. To simplify Eq. (19) a little more, we can scale the 
variable y n by introducing the new variable u„ — y n /k. Then Eq. (19) becomes 

m„+i = pu n (l - u n ), (21) 

where p is a positive parameter. 

We begin our investigation of Eq. (21) by seeking the equilibrium, or constant, so- 
lutions. These can be found by setting n„ + i equal to u n in Eq. (21), which corresponds 
to setting dy/dt equal to zero in Eq. (20). The resulting equation is 

U n = pu,t - pu 2 n , (22) 

so it follows that the equilibrium solutions of Eq. (21) are 

u n = 0, u n = — . (23) 

P 

The next question is whether the equilibrium solutions are asymptotically stable or 
unstable. That is, for an initial condition near one of the equilibrium solutions, does 
the resulting solution sequence approach or depart from the equilibrium solution? 
One way to examine this question is by approximating Eq. (21) by a linear equation 
in the neighborhood of an equilibrium solution. For example, near the equilibrium 
solution u n — 0, the quantity u 2 n is small compared to u„ itself, so we assume that we 
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can neglect the quadratic term in Eq. (21) in comparison with the linear terms. This 
leaves us with the linear difference equation 

w„+i = pu n , (24) 


which is presumably a good approximation to Eq. (21) for u„ sufficiently near zero. 
However, Eq. (24) is the same as Eq. (7), and we have already concluded, in Eq. (9), 
that u n -» 0 as n — »■ oo if and only if \p\ <1, or (since p must be positive) for 
0 < p < 1. Thus the equilibrium solution u n = 0 is asymptotically stable for the linear 
approximation (24) for this set of p values, so we conclude that it is also asymptoti- 
cally stable for the full nonlinear equation (21). This conclusion is correct, although 
our argument is not complete. What is lacking is a theorem stating that the solutions 
of the nonlinear equation (21) resemble those of the linear equation (24) near the 
equilibrium solution u n = 0. We will not take time to discuss this issue here; the same 
question is treated for differential equations in Section 9.3. 

Now consider the other equilibrium solution u n = (p — 1 ) / p . To study solutions 
in the neighborhood of this point, we write 


p - 1 

Mn — 4 “ Vn> 

P 


(25) 


where we assume that v n is small. By substituting from Eq. (25) in Eq. (21) and 
simplifying the resulting equation, we eventually obtain 

v n +i = (2- p)v n - pv 2 n . (26) 


Since v n is small, we again neglect the quadratic term in comparison with the linear 
terms and thereby obtain the linear equation 


Vn + 1 — (2 P)Vn- 


(27) 


Referring to Eq. (9) once more, we find that v n -> 0 as n — »■ oo for \2 — p\ < 1, that 
is, for 1 < p < 3. Therefore we conclude that, for this range of values of p, the 
equilibrium solution u n = (p — l)/p is asymptotically stable. 

Figure 2.9.1 contains the graphs of solutions of Eq. (21) for p = 0.8, p = 1.5, and 
p = 2.8, respectively. Observe that the solution converges to zero for p = 0.8 and 
to the nonzero equilibrium solution for p = 1.5 and p = 2.8. The convergence is 
monotone for p = 0.8 and p = 1.5 and is oscillatory for p = 2.8. The graphs shown 
are for particular initial conditions, but the graphs for other initial conditions are 
similar. 
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FIGURE 2.9.1 Solutions of u n+ 1 = pw„(l — u n ): (a) p = 0.8; ( b ) p = 1.5; (c) p = 2.8. 
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Another way of displaying the solution of a difference equation is shown in 
Figure 2.9.2. In each part of this figure the graphs of the parabola y — px{ 1 — x) 
and of the straight line y — x are shown. The equilibrium solutions correspond to 
the points of intersection of these two curves. The piecewise linear graph consisting 
of successive vertical and horizontal line segments, sometimes called a stairstep di- 
agram, represents the solution sequence. The sequence starts at the point uq on the 
.r-axis. The vertical line segment drawn upward to the parabola at «o corresponds to 
the calculation of puo(l — »o ) = iq. This value is then transferred from the y-axis to 
the x-axis; this step is represented by the horizontal line segment from the parabola 
to the line y = x. Then the process is repeated over and over again. Clearly, the 
sequence converges to the origin in Figure 2.9.2 a and to the nonzero equilibrium 
solution in the other two cases. 
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To summarize our results so far: the difference equation (21) has two equilib- 
rium solutions, u n = 0 and u„ = (p — 1 )/p; the former is asymptotically stable for 
0 < P < 1, and the latter is asymptotically stable for 1 < p < 3. When p = 1, the two 
equilibrium solutions coincide at u = 0; this solution can be shown to be asymptoti- 
cally stable. In Figure 2.9.3 the parameter p is plotted on the horizontal axis and u on 
the vertical axis. The equilibrium solutions u — 0 and u = (p — 1 )/p are shown. The 
intervals in which each one is asymptotically stable are indicated by the solid portions 
of the curves. There is an exchange of stability from one equilibrium solution to the 
other at p = 1. 



FIGURE 2.9.3 Exchange of stability for u n+1 = pw„(l — u n ). 


For p > 3 neither of the equilibrium solutions is stable, and the solutions of 
Eq. (21) exhibit increasing complexity as p increases. For p somewhat greater than 3, 
the sequence u n rapidly approaches a steady oscillation of period 2; that is, u n oscil- 
lates back and forth between two distinct values. For p = 3.2 a solution is shown in 
Figure 2.9.4. For n greater than about 20, the solution alternates between the values 
0.5130 and 0.7995. The graph is drawn for the particular initial condition uq = 0.3, 
but it is similar for all other initial values between 0 and 1. Figure 2.9.4 b also shows 
the same steady oscillation as a rectangular path that is traversed repeatedly in the 
clockwise direction. 

At about p = 3.449, each state in the oscillation of period 2 separates into two 
distinct states, and the solution becomes periodic with period 4; see Figure 2.9.5, 
which shows a solution of period 4 for p — 3.5. As p increases further, periodic 
solutions of period 8, 16, . . . appear. The appearance of a new solution at a certain 
parameter value is called a bifurcation. 

The p-values at which the successive period doublings occur approach a limit that 
is approximately 3.57. For p > 3.57 the solutions possess some regularity, but no 
discernible detailed pattern for most values of p. For example, a solution for p = 3.65 
is shown in Figure 2.9.6. It oscillates between approximately 0.3 and 0.9, but its fine 
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FIGURE 2.9.4 A solution of u n+ 1 = pu n ( 1 — u n ) for p = 3.2; period 2. (a) u n versus n; 
(b) a two-cycle. 


structure is unpredictable. The term chaotic is used to describe this situation. One of 
the features of chaotic solutions is extreme sensitivity to the initial conditions. This 
is illustrated in Figure 2.9.7, where two solutions of Eq. (21) for p = 3.65 are shown. 
One solution is the same as that in Figure 2.9.6 and has the initial value u 0 = 0.3, 
while the other solution has the initial value uq = 0.305. For about 15 iterations the 
two solutions remain close and are hard to distinguish from each other in the figure. 
After that, although they continue to wander about in approximately the same set 
of values, their graphs are quite dissimilar. It would certainly not be possible to use 
one of these solutions to estimate the value of the other for values of n larger than 
about 15. 

It is only comparatively recently that chaotic solutions of difference and differential 
equations have become widely known. Equation (20) was one of the first instances 
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FIGURE 2.9.5 A solution of u n+ 1 = pu n ( 1 — u n ) for p = 3.5; period 4. (a) u n versus n; 
( b ) a four-cycle. 


of mathematical chaos to be found and studied in detail, by Robert May 21 in 1974. 
On the basis of his analysis of this equation as a model of the population of certain 
insect species, May suggested that if the growth rate p is too large, then it will be 
impossible to make effective long-range predictions about these insect populations. 
The occurrence of chaotic solutions in simple problems has stimulated an enormous 
amount of research in recent years, but many questions remain unanswered. It is 
increasingly clear, however, that chaotic solutions are much more common than was 
suspected at first and that they may be a part of the investigation of a wide range of 
phenomena. 


21 R. M. May, “Biological Populations with Nonoverlapping Generations: Stable Points, Stable Cycles, and 
Chaos,” Science 186 (1974), pp. 645-647; “Biological Populations Obeying Difference Equations: Stable 
Points, Stable Cycles, and Chaos,” Journal of Theoretical Biology 51 (1975), pp. 511-524. 


130 


Chapter 2. First Order Differential Equations 



FIGURE 2.9.6 A solution of u n+1 = pu n (l — u n ) for p — 3.65; a chaotic solution. 



FIGURE 2.9.7 Two solutions of u n+ \ = pu n ( 1 — u„) for p = 3.65; mq = 0.3 and uq = 0.305. 


PROBLEMS 


In each of Problems 1 through 6 solve the given difference equation in terms of the initial 
value yo. Describe the behavior of the solution as n -> oo. 


1. y „+ i = -0.9 y n 


n + 1 

2. y „+ 1 = — -zy n 
n + 2 


3. y„+i = 


« + 3 
n + 1 




4. y«+i = (-l)" 4 - 1 ^ 


5. y n+1 = 0.5y„ + 6 


6. y„ +1 = — 0.5y„ + 6 


7. Find the effective annual yield of a bank account that pays interest at a rate of 7%, 
compounded daily; that is, divide the difference between the final and initial balances by 
the initial balance. 
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8. An investor deposits $1000 in an account paying interest at a rate of 8% compounded 
monthly, and also makes additional deposits of $25 per month. Find the balance in the 
account after 3 years. 

9. A certain college graduate borrows $8000 to buy a car. The lender charges interest at an 
annual rate of 10%. What monthly payment rate is required to pay off the loan in 3 years? 
Compare your result with that of Problem 9 in Section 2.3. 

10. A homebuyer wishes to take out a mortgage of $100,000 for a 30-year period. What 
monthly payment is required if the interest rate is (a) 9%, (b) 10%, (c) 12%? 

11. A homebuyer takes out a mortgage of $100,000 with an interest rate of 9%. What monthly 
payment is required to pay off the loan in 30 years? In 20 years? What is the total amount 
paid during the term of the loan in each of these cases? 

12. If the interest rate on a 20-year mortgage is fixed at 10% and if a monthly payment of 
$1000 is the maximum that the buyer can afford, what is the maximum mortgage loan that 
can be made under these conditions? 

13. A homebuyer wishes to finance the purchase with a $95,000 mortgage with a 20-year term. 
What is the maximum interest rate the buyer can afford if the monthly payment is not to 
exceed $900? 

The Logistic Difference Equation. Problems 14 through 19 deal with the difference equation 
(21), Un +1 = pu n {\ n n ). 

14. Carry out the details in the linear stability analysis of the equilibrium solution 
u n = (p — 1 )/ p. That is, derive the difference equation (26) in the text for the pertur- 
bation v n . 

15. (a) For p = 3.2 plot or calculate the solution of the logistic equation (21) for several 
initial conditions, say, uo = 0.2, 0.4, 0.6, and 0.8. Observe that in each case the solution 
approaches a steady oscillation between the same two values. This illustrates that the 
long-term behavior of the solution is independent of the initial conditions. 

(b) Make similar calculations and verify that the nature of the solution for large n is 
independent of the initial condition for other values of p, such as 2.6, 2.8, and 3.4. 

16. Assume that p > 1 in Eq. (21). 

(a) Draw a qualitatively correct stairstep diagram and thereby show that if u 0 < 0, then 
u n — > —oo as n — > oo. 

(b) In a similar way, determine what happens as n — > oo if u 0 > 1. 

17. The solutions of Eq. (21) change from convergent sequences to periodic oscillations of 
period 2 as the parameter p passes through the value 3. To see more clearly how this 
happens, carry out the following calculations. 

(a) Plot or calculate the solution for p = 2.9, 2.95, and 2.99, respectively, using an initial 
value u 0 of your choice in the interval (0, 1). In each case estimate how many iterations 
are required for the solution to get “very close” to the limiting value. Use any convenient 
interpretation of what “very close” means in the preceding sentence. 

(b) Plot or calculate the solution for p = 3.01, 3.05, and 3.1, respectively, using the same 
initial condition as in part (a). In each case estimate how many iterations are needed to 
reach a steady-state oscillation. Also find or estimate the two values in the steady-state 
oscillation. 

18. By calculating or plotting the solution of Eq. (21) for different values of p, estimate the 
value of p at which the solution changes from an oscillation of period 2 to one of period 
4. In the same way, estimate the value of p at which the solution changes from period 4 
to period 8. 
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02/ 19. Let pk be the value of p at which the solution of Eq. (21) changes from period 2* 1 to 
period 2 k . Thus, as noted in the text, pi = 3, P 2 = 3.449, and p 3 = 3.544. 

(a) Using these values of pi,p 2 , and p 3 , or those you found in Problem 18, calculate 
(p? - Pt)/(P3 - Pi)- 

(b) Let 8 „ = (p„ — p„_i)/(p„ + i — p„). It has been shown that 8 n approaches a limit <5 as 
n -> oo, where 8 = 4.6692 is known as the Feigenbaum 22 number. Determine the per- 
centage difference between the limiting value 8 and 82 , as calculated in part (a). 

(c) Assume that 83 = 8 and use this relation to estimate p 4 , the value of p at which solutions 
of period 16 appear. 

(d) By plotting or calculating solutions near the value of p 4 found in part (c), try to detect 
the appearance of a period 16 solution. 

(e) Observe that 


Pn — Pi + (Pi — Pi) + (P3 — Pi) + • • • + (p n — Pn-l)- 

Assuming that (p 4 — pi) = (p 3 — pi) 8 ~ l , (ps — p 4 ) = (p 3 — pi)S~ 2 , and so forth, express p„ 
as a geometric sum. Then find the limit of p n as n — > 00. This is an estimate of the value 
of p at which the onset of chaos occurs in the solution of the logistic equation (21). 


PROBLEMS 


Miscellaneous Problems. One of the difficulties in solving first order equations is that there 
are several methods of solution, each of which can be used on a certain type of equation. It 
may take some time to become proficient in matching solution methods with equations. The 
first 32 of the following problems are presented to give you some practice in identifying the 
method or methods applicable to a given equation. The remaining problems involve certain 
types of equations that can be solved by specialized methods. 

In each of Problems 1 through 32 solve the given differential equation. If an initial condition 
is given, also find the solution that satisfies it. 


dy x 3 - 2 y 

dx x 






2 x + y 
3 + 3y 2 — x ’ 

2 xy + y 2 + 1 
x 2 + 2 xy 

4x 3 + 1 
,v(2 + 3y) 

2 xy + 1 
x 2 + 2 y 


y(0) = 0 


2 dy 1 + cos x 

dx 2 — siny 

d v 

4. -f = 3 - 6 x + y - 2xy 


6 . 


8 . 



dy 

dx 


+ xy = 1 - y. 

y( 1 ) 

sinx 

y( 2 ) 

+ 2 y= , 

jc 


0 


1 


10. (x 2 y + xy — y) dx + (x 2 y — 2x 2 ) dy = 0 


22 This result for the logistic difference equation was discovered by Mitchell Feigenbaum (1944 - ) 

in August 1975, while he was working at the Los Alamos National Laboratory. Within a few weeks he 
had established that the same limiting value also appears in a large class of period-doubling difference 
equations. Feigenbaum, who has a doctorate in physics from M.I.T., is now at Rockefeller University. 
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11 . 

13. 

15. 

17. 

19. 

21 . 

23. 

25. 

26. 

28. 

30. 

32. 

33. 


(. x 2 + y) dx + (x + e v ) dy = 0 

^ =1+2 x + y 2 + 2xy 2 
dx 

(e x + l)^- = y-ye x 
dx 

J = e 2 * + 3 y 
dx 

dy 3x 2 — 2 y — y 3 
dx 2x + 3 xy 2 

dy 2y 2 + 6xy — 4 

dx 3x 2 + 4xy + 3 y 2 

+ (t + l)y = e 2 ‘ 


i ~ dy 1 

12. — + v = 

dx ' 1 + e x 


14. (x + y) dx + (x + 2y) dy = 0, y(2) = 3 


dy e x cos y — e 2y cosx 

16. = 


dx — e - * sin y + 2e 2 ^ sin x 

\ y( 0) = 3 


18. ‘j- +2 y = e 
dx 


20 . y = (+ + y 

dy x 2 — 1 

22 - 1 - 
dx y 2 + 1 


y ( — 1) = 1 


2 - - 


y 


dx + 


x z + y 2 


24. 2 sin y sin x cos x dx + cos y sin‘ xdy = 0 
dy = 0 


27. 


dx x 2 y + y 3 

29 = + y 

dx x — y 

31 dy 3x 2 y + y 2 

dx 2x 3 + 3xy 


Ilinl: Let u = x z . 


yd) = -2 


y x 2 + y 2 
xy' = y + xe 3 ’/ 1 

(2y + 3x) dx = —x dy 

(3y 2 + 2xy) dx — (2xy + x 2 ) dy = 0 

xy' + y- y 2 e 2x = 0, y(l) = 2 

Riccati Equations. The equation 

= <7i(0 + 92 (Oy + 93 (Oy 2 

dt 

is known as a Riccati 23 equation. Suppose that some particular solution yi of this equation 
is known. A more general solution containing one arbitrary constant can be obtained 
through the substitution 

y = yt(0 + -77-. 

v{t) 

Show that v(t) satisfies the first order linear equation 

dv 

= -(92 + 2q 3 y 1 )v - 93 . 


Note that v(t ) will contain a single arbitrary constant. 


23 Riccati equations are named for Jacopo Francesco Riccati (1676-1754), a Venetian nobleman, who 
declined university appointments in Italy, Austria, and Russia to pursue his mathematical studies privately 
at home. Riccati studied these equations extensively; however, it was Euler (in 1760) who discovered the 
result stated in this problem. 
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34. Using the method of Problem 33 and the given particular solution, solve each of the 
following Riccati equations: 

(a) / = 1 + t 2 - 2ty 4- y 2 ; yi(t) = t 


( b ) y' = -]2-- t + y 2 ’ yi(o = j 


dy 2 cos 2 1 — sin” t + y 1 


(C) -dt = 


2 cos t 


yi(f) — sint 


35. The propagation of a single action in a large population (for example, drivers turning on 
headlights at sunset) often depends partly on external circumstances (gathering darkness) 
and partly on a tendency to imitate others who have already performed the action in 
question. In this case the proportion y{t) of people who have performed the action can 
be described 24 by the equation 


dy/dt = (1 - y)\x(t) + by], 


(i) 


where x(t) measures the external stimulus and b is the imitation coefficient. 

(a) Observe that Eq. (i) is a Riccati equation and that y\(t) = 1 is one solution. Use the 
transformation suggested in Problem 33, and find the linear equation satisfied by v it). 

(b) Find v(t) in the case that x(t) = at, where a is a constant. Leave your answer in the 
form of an integral. 


Some Special Second Order Equations. Second order equations involve the second deriva- 
tive of the unknown function and have the general form/' = fit, y, /). Usually such equations 
cannot be solved by methods designed for first order equations. However, there are two types 
of second order equations that can be transformed into first order equations by a suitable 
change of variable. The resulting equation can sometimes be solved by the methods presented 
in this chapter. Problems 36 through 51 deal with these types of equations. 


Equations with the Dependent Variable Missing. For a second order differential equation 
of the form y" = f(t,y'), the substitution v = /, v' = y" leads to a first order equation of the 
form v' = f{t, v). If this equation can be solved for v, then y can be obtained by integrating 
dy/dt = v. Note that one arbitrary constant is obtained in solving the first order equation for 
v, and a second is introduced in the integration for y. In each of Problems 36 through 41 use 
this substitution to solve the given equation. 

36. t 2 y" + 2ty' - 1 = 0, t > 0 37. ty" + / = 1, t > 0 

38. /' + tiy'f = 0 39. 2 t 2 y" + (y') 3 = 2ty' , t > 0 

40. y" +/ = e~' 41. t 2 y" — iy') 2 , t > 0 


Equations with the Independent Variable Missing. Consider second order differential equa- 
tions of the form y" = fiy , y ') , in which the independent variable t does not appear explicitly. If 
we let v = y', then we obtain dv/dt = fiy, v). Since the right side of this equation depends on y 
and v, rather than on t and v, this equation contains too many variables. However, if we think 
of y as the independent variable, then by the chain rule, dv/dt = idv / dy)idy / dt) = i fdv/dy). 
Hence the original differential equation can be written as vidv/dy) = fiy, v). Provided that 
this first order equation can be solved, we obtain v as a function of y. A relation between y 
and t results from solving dy/dt = viy), which is a separable equation. Again, there are two 
arbitrary constants in the final result. In each of Problems 42 through 47 use this method to 
solve the given differential equation. 


24 See Anatol Rapoport, “Contribution to the Mathematical Theory of Mass Behavior: I. The Propagation 
of Single Acts,” Bulletin of Mathematical Biophysics 14 (1952), pp. 159-169, and John Z. Hearon, “Note 
on the Theory of Mass Behavior,” Bulletin of Mathematical Biophysics 17 (1955), pp. 7-13. 
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CHAPTER 


Second Order Linear 
Equations 


3 


Linear equations of second order are of crucial importance in the study of differential 
equations for two main reasons. The first is that linear equations have a rich theo- 
retical structure that underlies a number of systematic methods of solution. Further, 
a substantial portion of this structure and of these methods is understandable at a 
fairly elementary mathematical level. In order to present the key ideas in the sim- 
plest possible context, we describe them in this chapter for second order equations. 
Another reason to study second order linear equations is that they are vital to any 
serious investigation of the classical areas of mathematical physics. One cannot go 
very far in the development of fluid mechanics, heat conduction, wave motion, or 
electromagnetic phenomena without finding it necessary to solve second order lin- 
ear differential equations. As an example, we discuss the oscillations of some basic 
mechanical and electrical systems at the end of the chapter. 


3.1 Homogeneous Equations with Constant Coefficients 


A second order ordinary differential equation has the form 



( 1 ) 


where / is some given function. Usually, we will denote the independent variable 
by t since time is often the independent variable in physical problems, but some- 
times we will use x instead. We will use y, or occasionally some other letter, to 
designate the dependent variable. Equation (1) is said to be linear if the function/ 
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has the form 

f^y d i) = sm-pm%- q my, c> 

that is, if / is linear in y and dy/dt. In Eq. (2) g,p, and q are specified functions of the in- 
dependent variable t but do not depend on y. In this case we usually rewrite Eq. (1) as 

y" + p 0 t)y ' + q(t)y = git), (3) 

where the primes denote differentiation with respect to t. Instead of Eq. (3), we 
often see the equation 

P{t)y" + Q{t)y' + R{t)y = G(t). (4) 


Of course, if P(t) ^ 0, we can divide Eq. (4) by P(t) and thereby obtain Eq. (3) with 


Pit) = 


6(0 

PitY 


q(t ) = 


R(t) 

Pit y 


git ) = 


Git) 
Pit) ' 


( 5 ) 


In discussing Eq. (3) and in trying to solve it, we will restrict ourselves to intervals in 
which p , q. and g are continuous functions. 1 

If Eq. (1) is not of the form (3) or (4), then it is called nonlinear. Analytical 
investigations of nonlinear equations are relatively difficult, so we will have little to 
say about them in this book. Numerical or geometical approaches are often more 
appropriate, and these are discussed in Chapters 8 and 9. 

An initial value problem consists of a differential equation such as Eq. (1), (3), or 
(4) together with a pair of initial conditions 


yito) = yo, y'it 0 ) = y' 0 . 


( 6 ) 


where yo and y f 0 are given numbers prescribing values for y and y’ at the initial 
point to- Observe that the initial conditions for a second order equation identify not 
only a particular point (to,yo) through which the graph of the solution must pass, 
but also the slope y' Q of the graph at that point. It is reasonable to expect that two 
initial conditions are needed for a second order equation because, roughly speaking, 
two integrations are required to find a solution and each integration introduces an 
arbitrary constant. Presumably, two initial conditions will suffice to determine values 
for these two constants. 

A second order linear equation is said to be homogeneous if the term g(t) in 
Eq. (3), or the term G(t) in Eq. (4), is zero for all t. Otherwise, the equation is 
called nonhomogeneous. As a result, the term g(t), or G(t), is sometimes called 
the nonhomogeneous term. We begin our discussion with homogeneous equations, 
which we will write in the form 


P(t)y" + Qit)y' + Rit)y = 0. (7) 

Later, in Sections 3.5 and 3.6, we will show that once the homogeneous equation 
has been solved, it is always possible to solve the corresponding nonhomogeneous 


1 There is a corresponding treatment of higher order linear equations in Chapter 4. If you wish, you may 
read the appropriate parts of Chapter 4 in parallel with Chapter 3. 
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equation (4), or at least to express the solution in terms of an integral. Thus the 
problem of solving the homogeneous equation is the more fundamental one. 

In this chapter we will concentrate our attention on equations in which the func- 
tions P, Q , and R are constants. In this case, Eq. (7) becomes 

ay" + by' + cy — 0, (8) 

where a , b. and c are given constants. It turns out that Eq. (8) can always be solved 
easily in terms of the elementary functions of calculus. On the other hand, it is 
usually much more difficult to solve Eq. (7) if the coefficients are not constants, and 
a treatment of that case is deferred until Chapter 5. Before taking up Eq. (8), let 
us first gain some experience by looking at a simple example that in many ways is 
typical. 


Solve the equation 

EXAMPLE y"-y = 0, (9) 

^ and also find the solution that satisfies the initial conditions 

y( 0 ) = 2, /(0) = -l. (to) 

Observe that Eq. (9) is just Eq. (8) with a = 1, b = 0, and c = —1. In words, Eq. (9) says that 
we seek a function with the property that the second derivative of the function is the same as the 
function itself. Do any of the functions that you studied in calculus have this property? A little 
thought will probably produce at least one such function, namely, y\(t) = e', the exponential 
function. A little more thought may also produce a second function, y 2 (f) = e~‘ . Some further 
experimentation reveals that constant multiples of these two solutions are also solutions. For 
example, the functions 2e' and 5e~‘ also satisfy Eq. (9), as you can verify by calculating their 
second derivatives. In the same way, the functions ciyi(t) = cie' and c 2 y 2 (r) = c 2 e - ' satisfy the 
differential equation (9) for all values of the constants C\ and c 2 . 

Next, it is vital to notice that the sum of any two solutions of Eq. (9) is also a solution. In 
particular, since ciyi(t) and c 2 y 2 (0 are solutions of Eq. (9), so is the function 

y = ciyi(0 + c 2 y 2 (t) = cic' + c 2 e~‘ (11) 

for any values of c\ and c 2 . Again, this can be verified by calculating the second derivative y" 
from Eq. (11). We have y' = Cie' — c 2 e - ' and y" = cie' + c 2 e - '; thus y" is the same as y, and 
Eq. (9) is satisfied. 

Let us summarize what we have done so far in this example. Once we notice that the 
functions y\(t) — e' and y 2 (t) = e~ l are solutions of Eq. (9), it follows that the general linear 
combination (11 ) of these functions is also a solution. Since the coefficients cq and c 2 in Eq. (11) 
are arbitrary, this expression represents an infinite family of solutions of the differential equa- 
tion (9). 

It is now possible to consider how to pick out a particular member of this infinite family of 
solutions that also satisfies a given set of initial conditions (10). In other words, we seek the 
solution that passes through the point (0, 2) and at that point has the slope —1. First, we set 
t = 0 and y = 2 in Eq. (11); this gives the equation 

Ci + c 2 = 2. 

Next, we differentiate Eq. (11) with the result that 

y' = c\e‘ — c 2 e — h 


(12) 
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Then, setting t = 0 and y' = —1, we obtain 

ci-c 2 = -1. (13) 

By solving Eqs. (12) and (13) simultaneously for c\ and c 2 we find that 

ci = |, c 2 = |. (14) 

Finally, inserting these values in Eq. (11), we obtain 

y=\e>+ \e-\ (15) 

the solution of the initial value problem consisting of the differential equation (9) and the 
initial conditions (10). 

What conclusions can we draw from the preceding example that will help us to 
deal with the more general equation (8) 

ay" + by' + cy = 0, 

whose coefficients a, b. and c are arbitrary (real) constants? In the first place, in the 
example the solutions were exponential functions. Further, once we had identified 
two solutions, we were able to use a linear combination of them to satisfy the given 
initial conditions as well as the differential equation itself. 

It turns out that by exploiting these two ideas we can solve Eq. (8) for any values 
of its coefficients and also satisfy any given set of initial conditions for y and y' . We 
start by seeking exponential solutions of the form y = e rt , where r is a parameter to 
be determined. Then it follows that / = re rt and y" = r 2 e rt . By substituting these 
expressions for y, /, and y" in Eq. (8), we obtain 

(ar 2 + br + c)e rt = 0, 

or, since e rt / 0, 

ar 2 + br + c = 0. (16) 

Equation (16) is called the characteristic equation for the differential equation (8). 
Its significance lies in the fact that if r is a root of the polynomial equation (16), then 
y — e rt is a solution of the differential equation (8). Since Eq. (16) is a quadratic 
equation with real coefficients, it has two roots, which may be real and different, real 
but repeated, or complex conjugates. We consider the first case here and the latter 
two cases in Sections 3.3 and 3.4. 

Assuming that the roots of the characteristic equation (16) are real and different, 
let them be denoted by ri and r 2 , where r\ ^ r 2 . Thenyi(f) = e n ‘ andy2(0 = e nt are 
two solutions of Eq. (8). Just as in Example 1, it now follows that 

y = ciyi(0 + c 2 y 2 (t) = c l e nt + c 2 e rit (17) 

is also a solution of Eq. (8) . To verify that this is so, we can differentiate the expression 
in Eq. (17); hence 

/ = c 1 r 1 e n ' + c 2 r 2 e^ (18) 


and 


y" = C \r 2 e r]t + c 2 r\e nt . 


( 19 ) 
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Substituting these expressions for y, y’ , and y" in Eq. (8) and rearranging terms, we 
obtain 

ay" + by' + cy = c\(ar\ + br\ + c)e nt + c 2 (ar\ + br 2 + c)e r2t . (20) 


The quantity in each of the parentheses on the right side of Eq. (20) is zero because 
r\ and r 2 are roots of Eq. (16); therefore, y as given by Eq. (17) is indeed a solution 
of Eq. (8), as we wished to verify. 

Now suppose that we want to find the particular member of the family of solutions 
(17) that satisfies the initial conditions (6) 

y(to) = yo, y\t 0 ) = y' 0 . 

By substituting t — to and y = yo in Eq. (17), we obtain 

ci/ 1 ' 0 + c 2 e rit0 = y 0 - (21) 

Similarly, setting t = to and y' = y' 0 in Eq. (18) gives 

Cir 1 e nt ° + c 2 r 2 e rit ° = y' 0 . ( 22 ) 


On solving Eqs. (21) and (22) simultaneously for ci and c 2 , we find that 

Cl = y*zm e -n* t C2 = y^A e -ny 

r i -r 2 r\ - r 2 


(23) 


Recall that r\ — r 2 ^ 0 so that the expressions in Eq. (23) always make sense. Thus, 
no matter what initial conditions are assigned — that is, regardless of the values of to, 
yo, and y' 0 in Eqs. (6) — it is always possible to determine ci and c 2 so that the initial 
conditions are satisfied. Moreover, there is only one possible choice of ci and c 2 
for each set of initial conditions. With the values of ci and c 2 given by Eq. (23), the 
expression (17) is the solution of the initial value problem 


ay" + by' + cy = 0, y(t 0 ) = y 0 , y'(t 0 ) = y' 0 . (24) 


It is possible to show, on the basis of the fundamental theorem cited in the next 
section, that all solutions of Eq. (8) are included in the expression (17), at least for the 
case in which the roots of Eq. (16) are real and different. Therefore, we call Eq. (17) 
the general solution of Eq. (8). The fact that any possible initial conditions can be 
satisfied by the proper choice of the constants in Eq. (17) makes more plausible the 
idea that this expression does include all solutions of Eq. (8). 

Let us now look at some further examples. 


Find the general solution of 

y " + 5/ + 6y = 0. (25) 

We assume that y = e n , and it then follows that r must be a root of the characteristic equation 
r 1 + 5r + 6 = (r + 2)(r + 3) = 0. 

Thus the possible values of r are r\ = —2 and r 2 = —3; the general solution of Eq. (25) is 

y = Cie -2t + c 2 e -3 '. 


(26) 
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Find the solution of the initial value problem 

y" + 5y' + 6y = 0, y( 0) = 2, /(0) = 3. (27) 

The general solution of the differential equation was found in Example 2 and is given by 
Eq. (26). To satisfy the first initial condition, we set t — 0 and y = 2 in Eq. (26); thus c\ and C 2 
must satisfy 

Ci + C 2 = 2. (28) 

To use the second initial condition, we must first differentiate Eq. (26). This gives 
y' = — 2cie -2 ' — 3c2e~ 3 '. Then, setting t = 0 and y' = 3, we obtain 

—2c\ - 3 c 2 = 3. (29) 

By solving Eqs. (28) and (29), we find that Ci = 9 and c 2 = —7. Using these values in the 
expression (26), we obtain the solution 

y = 9e -2 ' - 7e~ 3 ' (30) 

of the initial value problem (27). The graph of the solution is shown in Figure 3.1.1. 


y 


2 *- 

y _ 9c z ' 7e :u 

1 - 


0.5 1 1.5 2 t 

FIGURE 3.1.1 Solution of y" + 5y' + 6y = 0, y(0) = 2, /( 0) = 3. 


Find the solution of the initial value problem 

4/ - 8/ + 3y = 0, y( 0) = 2, y'( 0) = | . (31) 

If y = e", then the characteristic equation is 

4r 2 - 8r + 3 = 0 

and its roots are r = 3/2 and r = 1/2. Therefore the general solution of the differential equa- 
tion is 

y = c 1 e it/2 + c 2 e ,/2 . (32) 

Applying the initial conditions, we obtain the following two equations for ci and C2: 

Cl + C2 = 2, |ci + ^C2 = \ . 

The solution of these equations is ci = — c 2 = f , so the solution of the initial value problem 
(31) is 

y = - |e 3,/2 + fe'/ 2 . (33) 

Figure 3.1.2 shows the graph of the solution. 
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FIGURE 3.1.2 Solution of Ay" - 8/ + 3y = 0, y(0) = 2, y'(O) = 0.5. 


The solution (30) of the initial value problem (27) initially increases (because its initial slope 
is positive) but eventually approaches zero (because both terms involve negative exponential 
functions). Therefore the solution must have a maximum point, and the graph in Figure 3.1.1 
confirms this. Determine the location of this maximum point. 

The coordinates of the maximum point can be estimated from the graph, but to find them 
more precisely, we seek the point where the solution has a horizontal tangent line. By differ- 
entiating the solution (30), y = 9e -2 ' — 7e -3 ', with respect to f, we obtain 

/ = -18e- 2 ' + 21e- 3 '. (34) 

Setting y' equal to zero and multiplying by e 3 ' , we find that the critical value t m satisfies e‘ = 7 /6; 
hence 

t m = In (7/6) = 0.15415. (35) 

The corresponding maximum value y m is given by 

108 

y m = 9e~ 2lm - 7e" 3,m = — = 2.20408. (36) 

In this example the initial slope is 3, but the solution of the given differential equation 
behaves in a similar way for any other positive initial slope. In Problem 26 you are asked to 
determine how the coordinates of the maximum point depend on the initial slope. 


Returning to the equation ay" + by' + cy = 0 with arbitrary coefficients, recall that 
when r\ ^ d, its general solution (17) is the sum of two exponential functions. There- 
fore the solution has a relatively simple geometrical behavior: as t increases, the 
magnitude of the solution either tends to zero (when both exponents are negative) 
or else grows rapidly (when at least one exponent is positive). These two cases are 
illustrated by the solutions of Examples 3 and 4. which are shown in Figures 3.1.1 
and 3.1.2, respectively. There is also a third case that occurs less often: the solution 
approaches a constant when one exponent is zero and the other is negative. 

In Sections 3.3 and 3.4, respectively, we return to the problem of solving the equa- 
tion ay" + by' + cy = 0 when the roots of the characteristic equation either are com- 
plex conjugates or are real and equal. In the meantime, in Section 3.2, we provide 
a systematic discussion of the mathematical structure of the solutions of all second 
order linear homogeneous equations. 
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PROBLEMS In each of Problems 1 through 8 find the general solution of the given differential equation. 
1. y" + 2y'-3y = 0 2. y" + 3/ + 2y = 0 

3. 6y " -y'-y = 0 4. 2y" - 3y' + y = 0 

5. y" + 5y' = 0 6. 4/' - 9y = 0 

7. y" - 9/ + 9y = 0 8. /' - 2/ - 2y = 0 


In each of Problems 9 through 16 find the solution of the given initial value problem. Sketch 
the graph of the solution and describe its behavior as t increases. 

9. y" +y'-2y = 0, y(0) = 1, y'(0) = 1 

10. y" + 4 y + 3y = 0, y(0) = 2, y'( 0) = -1 

11. 6y" - 5/ + y = 0, y(0) = 4, y'( 0) = 0 

12. y" + 3/ = 0, y(0) = -2, y'(0) = 3 

13. y" + 5y' + 3y = 0, y(0) = 1, y'( 0) = 0 

14. 2y" + y' - 4y = 0, y( 0) = 0, y'(0) = 1 

15. y" + 8 y' - 9y = 0, y( 1) = 1, /( 1) = 0 

16. 4 y" - y = 0, y(-2) = 1, y'(-2) = -1 

17. Find a differential equation whose general solution is y = cie 2 ' + C 2 e~ 3 '. 

18. Find a differential equation whose general solution is y = cie~' /2 + C 2 e~ 21 . 

19. Find the solution of the initial value problem 


y" - y = 0, y(0)=f, y'( 0) = 


Plot the solution for 0 < t < 2 and determine its minimum value. 

20. Find the solution of the initial value problem 

2y"-3y'+y = 0, y(0)=2, y'(0) = 


Then determine the maximum value of the solution and also find the point where the 
solution is zero. 

21. Solve the initial value problem y" — y' — 2 y = 0, y(0) = a, y'(0) = 2. Then find a so that 
the solution approaches zero as t -> oo. 

22. Solve the initial value problem 4y" — y = 0, y CO) = 2, y'(0) = /S. Then find f) so that the 
solution approaches zero as t -> oo. 

In each of Problems 23 and 24 determine the values of a, if any, for which all solutions tend to 

zero as t — > oo; also determine the values of a, if any, for which all (nonzero) solutions become 

unbounded as t — >■ oo. 

23. y" — (2a — 1 )y' + a(a — 1 )y = 0 

24. y" + (3 - a)y' - 2(a - l)y = 0 

25. Consider the initial value problem 

2y" + 3/ - 2y = 0, y(0) = 1, y'(0) = -fi. 


where f) > 0. 

(a) Solve the initial value problem. 

(b) Plot the solution when fi = 1. Find the coordinates (to,yo) of the minimum point of 
the solution in this case. 

(c) Find the smallest value of /S for which the solution has no minimum point. 
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26. Consider the initial value problem (see Example 5) 

y" + 5y' + 6y = 0, y( 0) = 2, y'( 0) = 0, 

where f) > 0. 

(a) Solve the initial value problem. 

(b) Determine the coordinates t m andy m of the maximum point of the solution as functions 
of fS. 

(c) Determine the smallest value of (5 for which y m > 4. 

(d) Determine the behavior of t m and y m as p — > oo. 

27. Consider the equation ay" + by' + cy = d, where a, b , c, and d are constants. 

(a) Find all equilibrium, or constant, solutions of this differential equation. 

(b) Let y e denote an equilibrium solution, and let Y = y — y e . Thus Y is the deviation of 
a solution y from an equilibrium solution. Find the differential equation satisfied by Y . 

28. Consider the equation ay" + by' + cy = 0, where a, b , and c are constants with a > 0. Find 
conditions on a. b , and c such that the roots of the characteristic equation are: 

(a) real, different, and negative. 

(b) real with opposite signs. 

(c) real, different, and positive. 


3.2 Solutions of Linear Homogeneous Equations; the Wronskian 

In the preceding section we showed how to solve some differential equations of the 
form 

ay" + by' + cy = 0, 

where a , b , and c are constants. Now we build on those results to provide a clearer 
picture of the structure of the solutions of all second order linear homogeneous 
equations. In turn, this understanding will assist us in finding the solutions of other 
problems that we will encounter later. 

To discuss general properties of linear differential equations, it is helpful to intro- 
duce a differential operator notation. Let p and q be continuous functions on an 
open interval /, that is, for a < t < The cases a = — oo, or /J = oo, or both, are 
included. Then, for any function <p that is twice differentiable on /, we define the 
differential operator L by the equation 

L[<j>] = <j>" +p(j>' + q<j>. (1) 

Note that L[</>] is a function on I. The value of L[(p] at a point t is 

L[(p](t) = 4>"(t)+p(t)4>'(t) + q{t)(p{t). 

For example, if p(t) — t 2 , q{t) = 1 + t, and 0(f) = sin3f, then 

L[(f>](t) = (sin3f)" + f 2 (sin3f)' + (1 + f) sin3f 
= — 9sin3f + 3 f 2 cos 3 f + (1 + f) sin 3 1. 
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The operator L is often written as L = D 2 + pD + q , where D is the derivative 
operator. 

In this section we study the second order linear homogeneous equation 
L[<p](t) = 0. Since it is customary to use the symbol y to denote </>(/), we will usually 
write this equation in the form 

L[y] = y" + p(t)y' + q(t)y = 0. (2) 

With Eq. (2) we associate a set of initial conditions 

y(to) = yo, y'(k)) = y' Q , (3) 

where to is any point in the interval I, and yo and y' {] are given real numbers. We 
would like to know whether the initial value problem (2), (3) always has a solution, 
and whether it may have more than one solution. We would also like to know whether 
anything can be said about the form and structure of solutions that might be helpful 
in finding solutions of particular problems. Answers to these questions are contained 
in the theorems in this section. 

The fundamental theoretical result for initial value problems for second order 
linear equations is stated in Theorem 3.2.1, which is analogous to Theorem 2.4.1 
for first order linear equations. The result applies equally well to nonhomogeneous 
equations, so the theorem is stated in that form. 


Theorem 3.2.1 (Existence and Uniqueness Theorem) 

Consider the initial value problem 

y" +p(t)y' + q{t)y = g(t), y(f 0 ) = yo, /(to) = y' 0 , (4) 

where p , q, and g are continuous on an open interval I that contains the point to- 
Then there is exactly one solution y = 0(f) of this problem, and the solution exists 
throughout the interval 7. 


We emphasize that the theorem says three things: 

1. The initial value problem has a solution; in other words, a solution exists. 

2. The initial value problem has only one solution; that is, the solution is unique. 

3. The solution <p is defined throughout the interval I where the coefficients are continuous 
and is at least twice differentiable there. 

For some problems some of these assertions are easy to prove. For instance, we 
found in Example 1 of Section 3.1 that the initial value problem 

y" y — 0, v(0) = 2, y'(0) = — 1 (5) 

has the solution 

y = k + (6) 

The fact that we found a solution certainly establishes that a solution exists for this 
initial value problem. Further, the solution (6) is twice differentiable, indeed differen- 
tiable any number of times, throughout the interval (— oo, oo) where the coefficients 
in the differential equation are continuous. On the other hand, it is not obvious, and 
is more difficult to show, that the initial value problem (5) has no solutions other 
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Theorem 3.2.2 


than the one given by Eq. (6). Nevertheless, Theorem 3.2.1 states that this solution 
is indeed the only solution of the initial value problem (5). 

For most problems of the form (4) it is not possible to write down a useful ex- 
pression for the solution. This is a major difference between first order and second 
order linear equations. Therefore, all parts of the theorem must be proved by general 
methods that do not involve having such an expression. The proof of Theorem 3.2.1 is 
fairly difficult, and we do not discuss it here. * 2 We will, however, accept Theorem 3.2.1 
as true and make use of it whenever necessary. 


Find the longest interval in which the solution of the initial value problem 
(t 2 - 3t)y" + ty' - (t + 3)y = 0, y( 1) = 2, /(!) = 1 


is certain to exist. 

If the given differential equation is written in the form of Eq. (4), then p(t) = l/(f — 3), 
q(t) = — (t + 3 )/t(t — 3), and g(t) = 0. The only points of discontinuity of the coefficients are 
t = 0 and t = 3. Therefore, the longest open interval, containing the initial point t = 1, in 
which all the coefficients are continuous is 0 < t < 3. Thus, this is the longest interval in which 
Theorem 3.2.1 guarantees that the solution exists. 


Find the unique solution of the initial value problem 

y" + p(t)y' + q(t)y = 0, y(t 0 ) = 0, y'(t 0 ) = 0, 

where p and q are continuous in an open interval I containing to- 

The function y = <f> (t) = 0 for all t in I certainly satisfies the differential equation and initial 
conditions. By the uniqueness part of Theorem 3.2.1, it is the only solution of the given 
problem. 


Let us now assume that y\ and yj are two solutions of Eq. (2); in other words, 

L [yi\ = y'[ + py\ + qy\ = 

and similarly for y 2 ■ Then, just as in the examples in Section 3.1, we can generate 
more solutions by forming linear combinations of y\ and y 2 - We state this result as a 
theorem. 


(Principle of Superposition) 

If yi and yj are two solutions of the differential equation (2), 

L[y] = y" + P(t)y' + q(t)y = 0, 

then the linear combination ciyi + cjy 2 is also a solution for any values of the 
constants ci and C 2 - 


2 A proof of Theorem 3.2.1 may be found, for example, in Chapter 6, Section 8 of the book by Coddington 
listed in the references at the end of this chapter. 


148 


Chapter 3. Second Order Linear Equations 


A special case of Theorem 3.2.2 occurs if either c\ or c 2 is zero. Then we conclude 
that any constant multiple of a solution of Eq. (2) is also a solution. 

To prove Theorem 3.2.2, we need only substitute 

y = ciyi(t) + c 2 y 2 (0 (7) 

for y in Eq. (2). By calculating the indicated derivatives and rearranging terms, we 
obtain 


L[ciyi + c 2 y 2 ] = [ciyi + c 2 y 2 ]" + p[ciyi + c 2 y 2 ]' + q[ciyi + c 2 y 2 \ 

— Ciy'\ + c 2 y 2 + cipy[ + c 2 py' 2 + c\qy\ + c 2 qy 2 
= Ci[y? + py\ + qyi\ + c 2 [y 2 +py' 2 + qy 2 ] 

= ciL[yi\ + c 2 L[y 2 ], 

Since L[y\\ = 0 and L[y 2 ] = 0, it follows that L[c\y\ + c 2 y 2 ] = 0 also. Therefore, 
regardless of the values of c\ and c 2 ,y as given by Eq. (7) does satisfy the differential 
equation (2), and the proof of Theorem 3.2.2 is complete. 

Theorem 3.2.2 states that, beginning with only two solutions of Eq. (2), we can 
construct an infinite family of solutions by means of Eq. (7). The next question is 
whether all solutions of Eq. (2) are included in Eq. (7) or whether there may be 
other solutions of a different form. We begin to address this question by examining 
whether the constants ci and c 2 in Eq. (7) can be chosen so as to satisfy the initial 
conditions (3). These initial conditions require c\ and c 2 to satisfy the equations 


ciyi(to) + c 2 y 2 (t 0 ) = y 0 , 
ciyi(to) + c 2 y 2 (t 0 ) = y' 0 . 


The determinant of coefficients of the system (8) is 


W = 


yi(to) y 2 (fo) 
y[(t 0 ) y' 2 (t 0 ) 


= yi(to)y 2 (to ) - y[(to)y 2 (to). 


( 8 ) 


( 9 ) 


If W / 0, then Eqs. (8) have a unique solution (ci, c 2 ) regardless of the values of yo 
and y' 0 . This solution is given by 


_ yoy 2 (to) - y' 0 y 2 (to) _ -yoy[(to) + y' 0 yi(t 0 ) 

Cl yi(to)y 2 (to) - y[(to)y 2 (.to) ’ ° 2 yi(t 0 )y 2 (t 0 ) -yi(f 0 )y 2 (?o) ’ 

or, in terms of determinants, 


yo yi(to) 


0 

£ 

yo y '2 (fo) 


yi(fo) yo 


yt(fo) 

y 2 (to) 

c 2 - - 

yt (fo) 

y 2 (to) 

y[(t 0 ) 

y 2 (to) 


y[(to) 

y 2 (to) 


( 10 ) 


( 11 ) 


With these values for c\ and c 2 , the expression (7) satisfies the initial conditions (3) 
as well as the differential equation (2). 

On the other hand,if W = 0, then Eqs. (8) have no solution unless yo and y' 0 satisfy 
a certain additional condition; in this case there are infinitely many solutions. 
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The determinant W is called the Wronskian * * 3 determinant, or simply the Wron- 
skian, of the solutions yq and y 2 . Sometimes we use the more extended notation 
W(y 1 ,y 2 )(t 0 ) to stand for the expression on the right side of Eq. (9), thereby empha- 
sizing that the Wronskian depends on the functions yq and y 2 , and that it is evaluated 
at the point to. The preceding argument establishes the following result. 


Suppose that yq and y 2 are two solutions of Eq. (2) 

L[y ] = y" + pit)/ + q(t)y = o, 

and that the initial conditions (3) 

y(to) = yo, /(to) = y' 0 

are assigned. Then it is always possible to choose the constants c 1 ,C 2 so that 

y — ciyq(f) + c 2 y 2 {t) 

satisfies the differential equation (2) and the initial conditions (3) if and only if the 
Wronskian 

W = yi/ 2 - y\y 2 

is not zero at t 0 - 


In Example 2 of Section 3.1 we found that yq(f) = e 2t and y 2 (t) = e 3 ' are solutions of the 
differential equation 

/ + 5/ + 6y = 0. 


Find the Wronskian of yq and y 2 . 

The Wronskian of these two functions is 


W = 




— e 


—5t 


Since W is nonzero for all values of t, the functions yq and y 2 can be used to construct solutions 
of the given differential equation, together with initial conditions prescribed at any value of t. 
One such initial value problem was solved in Example 3 of Section 3.1. 


The next theorem justifies the term “general solution” that we introduced in Sec- 
tion 3.1 for the linear combination ciyq + c 2 y 2 - 


Suppose that yq and y 2 are two solutions of the differential equation (2), 

L[y] = y" + pit)/ + qit)y = 0. 

Then the family of solutions 

y = ciyq(f) + C 2 y 2 {t) 

with arbitrary coefficients ci and c 2 includes every solution of Eq. (2) if and only if 
there is a point to where the Wronskian of yq and y 2 is not zero. 


3 Wronskian determinants are named for Josef Maria Hoene-Wronski (1776-1853), who was born in 
Poland but spent most of his life in France. Wronski was a gifted but troubled man, and his life was 
marked by frequent heated disputes with other individuals and institutions. 
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Let </> be any solution of Eq. (2). To prove the theorem, we must determine whether 
<p is included in the linear combinations ci_yi + c 2 y 2 ■ That is, we must determine 
whether there are values of the constants ci and C 2 that make the linear combination 
the same as </;. Let t 0 be a point where the Wronskian of yi and y 2 is nonzero. Then 
evaluate cp and (p ' at this point and call these values yo and y' Q , respectively; thus 

yo = <P(to), y' 0 = </>'(*<))• 

Next, consider the initial value problem 

y" + p(t)y' + q(t)y = 0, y(t 0 ) = y 0 , y'{t 0 ) = y' 0 . (12) 

The function cp is certainly a solution of this initial value problem. Further, since we 
are assuming that W iyi,y 2 ){tQ) is nonzero, then it is possible (by Theorem 3.2.3) to 
choose ci and C 2 so that y = c\y\{t) + c 2 y 2 (t) is also a solution of the initial value 
problem (12). In fact, the proper values of ci and C 2 are given by Eqs. (10) or (11). 
The uniqueness part of Theorem 3.2.1 guarantees that these two solutions of the same 
initial value problem are actually the same function; thus, for the proper choice of ci 
and c 2 , 

(p(t) = ciyiit) + c 2 y 2 (t), (13) 

and therefore cp is included in the family of functions c \ y i + c 2 y 2 . Finally, since (p 
is an arbitrary solution of Eq. (2), it follows that every solution of this equation is 
included in this family. 

Now suppose that there is no point t 0 where the Wronskian is nonzero. Thus 
W(y] ,y 2 )(to) — Ono matter which point to is selected. Then (by Theorem 3.2.3) there 
are values of yo and y’ Q such that the system (8) has no solution for c\ and C 2 . Select 
a pair of such values and choose the solution cp (t) of Eq. (2) that satisfies the initial 
condition (3). Observe that such a solution is guaranteed to exist by Theorem 3.2.1. 
Flowever, this solution is not included in the family y = c\y\ + cyjy ■ Thus this linear 
combination does not include all solutions of Eq. (2) if W(yi , y 2 ) — 0. This completes 
the proof of Theorem 3.2.4. 

Theorem 3.2.4 states that, if and only if the Wronskian of y i and y 2 is not everywhere 
zero, then the linear combination c\y\ + c 2 y 2 contains all solutions of Eq. (2). It is 
therefore natural (and we have already done this in the preceding section) to call the 
expression 

y = ciyiit) + c 2 y 2 {t) 

with arbitrary constant coefficients the general solution of Eq. (2). The solutions y\ 
and y 2 are said to form a fundamental set of solutions of Eq. (2) if and only if their 
Wronskian is nonzero. 

We can restate the result of Theorem 3.2.4 in slightly different language: to find the 
general solution, and therefore all solutions, of an equation of the form (2), we need 
only find two solutions of the given equation whose Wronskian is nonzero. We did 
precisely this in several examples in Section 3.1, although there we did not calculate 
the Wronskians. You should now go back and do that, thereby verifying that all the 
solutions we called “general solutions” in Section 3.1 do satisfy the necessary Wron- 
skian condition. Alternatively, the following example includes all those mentioned 
in Section 3.1, as well as many other problems of a similar type. 
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Suppose that y\{t) = e ri 'andy 2(0 = e r2 ' are two solutions of an equation of the form (2). Show 
that they form a fundamental set of solutions if r\ ^ r 2 . 

We calculate the Wronskian of y\ and y 2 : 


e n< 

e nt 

rie n ‘ 

r 2 e‘ 


(r 2 - ri)exp[(ri + r 2 )t]. 


Since the exponential function is never zero, and since we are assuming that 
r 2 — ri / 0, it follows that W is nonzero for every value of t. Consequently, y i and y 2 form a 
fundamental set of solutions. 


Show that y\(t) = f 1/2 and y 2 (t) — t 1 form a fundamental set of solutions of 

2t 2 y" + 3ty' — y = 0, t > 0. (14) 

We will show how to solve Eq. (14) later (see Problem 34 in Section 3.3). However, at 
this stage we can verify by direct substitution that yi and y 2 are solutions of the differential 
equation. Since y[(t) = if -1 / 2 and y'[(t) = — \t~ 3/2 , we have 

2 t 2 (-ir 3/2 ) + 3t(ir 1/2 ) - r 1/2 «(-! + §- l)f 1/2 = 0. 

Similarly, yj (0 = — t~ 2 andyjCO = 2f^ 3 ,so 

2f 2 (2r 3 ) + 3t(-t~ 2 ) - r 1 = ( 4 - 3 - l)f _1 = 0. 


Next we calculate the Wronskian W of yq and y 2 : 


W = 


f l/2 

1.— 1/2 
2 l 



3 .- 3/2 
2 l 


(15) 


Since W 7 ^ 0 for t > 0, we conclude that yi and y 2 form a fundamental set of solutions there. 


In several cases we have been able to find a fundamental set of solutions, and 
therefore the general solution, of a given differential equation. However, this is 
often a difficult task, and the question arises as to whether a differential equation 
of the form (2) always has a fundamental set of solutions. The following theorem 
provides an affirmative answer to this question. 


Consider the differential equation (2) 

L[y] = y" + pit)/ + q(t)y = 0, 

whose coefficients p and q are continuous on some open interval 1. Choose some 
point to in 1. Let y i be the solution of Eq. (2) that also satisfies the initial conditions 

y(t 0 ) = 1, / (to) = 0, 

and let y 2 be the solution of Eq. (2) that satisfies the initial conditions 

y(to) = 0, y'(t 0 ) = 1. 


Then y\ and y 2 form a fundamental set of solutions of Eq. (2). 
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First observe that the existence of the functions y \ and y 2 is ensured by the existence 
part of Theorem 3.2.1. To show that they form a fundamental set of solutions, we 
need only calculate their Wronskian at to: 


W(yi,y 2 )(t 0 ) 


yiito) y 2 (to) 


1 0 

Ti(fo) y 2 (.t 0 ) 


0 1 


Since their Wronskian is not zero at the point to, the functions y\ and y 2 do form a 
fundamental set of solutions, thus completing the proof of Theorem 3.2.5. 

Note that the potentially difficult part of this proof, demonstrating the existence 
of a pair of solutions, is taken care of by reference to Theorem 3.2.1. Note also that 
Theorem 3.2.5 does not address the question of how to find the solutions y i and y 2 
by solving the specified initial value problems. Nevertheless, it may be reassuring to 
know that a fundamental set of solutions always exists. 


EXAMPLE 

6 


Find the fundamental set of solutions specified by Theorem 3.2.5 for the differential equation 

y"-y = o, (16) 

using the initial point t 0 = 0. 

In Section 3.1 we noted that two solutions of Eq. (16) are y\{t) = e‘ and y 2 (t) = e~'. The 
Wronskian of these solutions is W {yi,y 2 )(t) = — 2 / 0, so they form a fundamental set of so- 
lutions. However, they are not the fundamental solutions indicated by Theorem 3.2.5 because 
they do not satisfy the initial conditions mentioned in that theorem at the point t = 0. 

To find the fundamental solutions specified by the theorem, we need to find the solutions 
satisfying the proper initial conditions. Let us denote by y 2 (t) the solution of Eq. (16) that 
satisfies the initial conditions 

y(0) = 1, /(0) = 0. (17) 

The general solution of Eq. (16) is 

y = He' + c 2 e~‘, (18) 

and the initial conditions (17) are satisfied if c\ = 1/2 and c 2 = 1/2. Thus 

y 2 (t) = be' + \e~‘ = coshr. 

Similarly, if y 4 (t) satisfies the initial conditions 

y(0) = 0, y' (0) = 1, (19) 


then 


y 4 (t) = \e‘ — \e~‘ = sinhf. 
Since the Wronskian of y 2 and y 4 is 

W(y 3 ,y 4 )(t) = cosh 2 1 — sinh 2 1 = 1, 


these functions also form a fundamental set of solutions, as stated by Theorem 3.2.5. Therefore, 
the general solution of Eq. ( 16) can be written as 

y — k 4 cosh t + k 2 sinh t, (20) 

as well as in the form (18). We have used k\ and k 2 for the arbitrary constants in Eq. (20) 
because they are not the same as the constants c\ and c 2 in Eq. (18). One purpose of this 
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example is to make clear that a given differential equation has more than one fundamental 
set of solutions; indeed, it has infinitely many; see Problem 21. As a rule, you should choose 
the set that is most convenient. 


Now let us examine further the properties of the Wronskian of two solutions of 
a second order linear homogeneous differential equation. The following theorem, 
perhaps surprisingly, gives a simple explicit formula for the Wronskian of any two 
solutions of any such equation, even if the solutions themselves are not known. 


Theorem 3.2.6 (Abel’s Theorem ) 4 

If y i and y 2 are solutions of the differential equation 

L[y] = y" + p(t)y ' + q(t)y = 0, (21) 


where p and q are continuous on an open interval /, then the Wronskian W (yj , y 2 ) (f) 
is given by 


W(y 1 ,y 2 )(t) = cexp 



( 22 ) 


where c is a certain constant that depends on y i and y 2 , but not on f. Further, 
W(yi,y 2 )(t) either is zero for all t in / (if c = 0) or else is never zero in / (if c ^ 0). 


To prove Abel’s theorem, we start by noting that yi and y 2 satisfy 

y” + p(t)y\ + q(t)y\ = 0, 

y'i + p(0y' 2 + q(0y 2 = o. 

If we multiply the first equation by — y 2 , multiply the 
resulting equations, we obtain 

( yiy'z - y'[yi) +p(t)(yiy 2 - y'iyi) 

Next, we let W(t ) = W(yi,y 2 )(f) and observe that 

W' = yty 2 - y"y 2 . (25) 

Then we can write Eq. (24) in the form 

W' + p(t)W = 0. (26) 


(23) 

second by y\, and add the 
= 0. (24) 


4 The result in Theorem 3.2.6 was derived by the Norwegian mathematician Niels Henrik Abel (1802-1829) 
in 1827 and is known as Abel’s formula. Abel also showed that there is no general formula for solving a 
quintic, or fifth degree, polynomial equation in terms of explicit algebraic operations on the coefficients, 
thereby resolving a question that had been open since the sixteenth century. His greatest contributions, 
however, were in analysis, particularly in the study of elliptic functions. Unfortunately, his work was 
not widely noticed until after his death. The distinguished French mathematician Legendre called it a 
“monument more lasting than bronze.” 


154 


Chapter 3. Second Order Linear Equations 


Equation (26) can be solved immediately since it is both a first order linear equation 
(Section 2.1) and a separable equation (Section 2.2). Thus 


W(t) = cexp 



p(t) dt , 


(27) 


where c is a constant. The value of c depends on which pair of solutions of Eq. (21) 
is involved. Elowever, since the exponential function is never zero, W(t) is not 
zero unless c = 0, in which case W (t) is zero for all f , which completes the proof of 
Theorem 3.2.6. 

Note that the Wronskians of any two fundamental sets of solutions of the same 
differential equation can differ only by a multiplicative constant, and that the Wron- 
skian of any fundamental set of solutions can be determined, up to a multiplicative 
constant, without solving the differential equation. Further, since under the condi- 
tions of Theorem 3.2.6 the Wronskian W is either always zero or never zero, you can 
determine which case actually occurs by evaluating W at any single convenient value 
of t. 


EXAMPLE 

7 


In Example 5 we verified that yi(t) — t 1/2 and y 2 {t) = t 1 are solutions of the equation 

2 t 2 y" + 3ty' - y = 0, t > 0. (28) 


Verify that the Wronskian of yi and y 2 is given by Eq. (22). 

From the example just cited we know that W(y 2 ,y 2 )(t) = —(3/2 )r~ 3/2 . To use Eq. (22), we 
must write the differential equation (28) in the standard form with the coefficient of y" equal 
to 1. Thus we obtain 


f+ y~w y= °' 


so p(t) — 3/2 1. Hence 


W(y u y 2 )(t) = c exp | - / —dt 


f/ 


= c exp ( — - In t 


(29) 


Equation (29) gives the Wronskian of any pair of solutions of Eq. (28). For the particular 
solutions given in this example we must choose c = —3/2. 


Summary. We can summarize the discussion in this section as follows: to find the 
general solution of the differential equation 

y" + p(t)y’ + q(t)y = 0, a < t < 

we must first find two functions y\ and y 2 that satisfy the differential equation in 
a < t < ft. Then we must make sure that there is a point in the interval where the 
Wronskian W of y\ and y 2 is nonzero. Under these circumstances y\ and y 2 form a 
fundamental set of solutions and the general solution is 

y = ciyiit) + c 2 y2{t), 

where c\ and c 2 are arbitrary constants. If initial conditions are prescribed at a point 
in a < t < then c\ and c 2 can be chosen so as to satisfy these conditions. 
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PROBLEMS In each of Problems 1 through 6 find the Wronskian of the given pair of functions. 

1. e 2t , e~ 3,/2 2. cos t, sinf 

3. e~ 2t , te~ 2 ' 4. x, xe x 

5. e'sinf, e'cos t 6. cos 2 0, 1 + cos 29 

In each of Problems 7 through 12 determine the longest interval in which the given initial value 
problem is certain to have a unique twice differentiable solution. Do not attempt to find the 
solution. 

7. ty" + 3 y = t, y( 1) = 1, /(l) = 2 

8. (f - l)y" - 3 ty' + 4 y = sin f, y(- 2) = 2, /(- 2) = 1 

9. t(t - 4 )y" + 3 ty' + 4y = 2, y(3) = 0, /(; 3) = -1 

10. y" + (cos t)y' + 3 (In \t\)y = 0, y(2) = 3, y'{ 2) = 1 

11. (jt-3)/' + x/ + (ln|x|)y = 0, y(l) = 0, /( 1) = 1 

12. {x - 2)y" + / + {x - 2) (tan x)y = 0, y(3) = 1, y'( 3) = 2 

13. Verify that yi(t) — t 2 and y 2 (r) = t _1 are two solutions of the differential equation 
t 2 y" — 2y = 0 for t > 0. Then show that y = cq/ 2 + c 2 / _1 is also a solution of this equa- 
tion for any ci and c 2 . 

14. Verify that y\(t) = 1 and y 2 (t) = f 1/2 are solutions of the differential equation 

yy" + (y') 2 = 0 for t > 0. Then show that y = ci + c 2 t 1/2 is not, in general, a solution of 
this equation. Explain why this result does not contradict Theorem 3.2.2. 

15. Show that if y = /(f) is a solution of the differential equation y" + p(t)y' + <7(f)y = g(t), 
where g(t) is not always zero, then y — c<p(t), where c is any constant other than 1, is not a 
solution. Explain why this result does not contradict the remark following Theorem 3.2.2. 

16. Can y = sin(f 2 ) be a solution on an interval containing t = 0 of an equation 
y" + p{t)y' + q(t)y = 0 with continuous coefficients? Explain your answer. 

17. If the Wronskian W of / and g is 3e 4 ', and if f{t) = e 2 ', find g(t). 

18. If the Wronskian W of / and g is t 2 e', and if /(f) = f, find g(t). 

19. If W(f,g) is the Wronskian of / and g, and if u = 2/ — g, v = / 4- 2g. find the Wronskian 
W(u, v) of u and v in terms of W(f,g). 

20. If the Wronskian of / and g is t cos t — sin t, and if u = / + 3g, v = / — g, find the Wronskian 
of u and v. 

21. Assume that y 1 and y 2 are a fundamental set of solutions of y" + p(t)y' + q(t)y = 0 and 

let _y3 = ci\yi + a 2 y 2 and v 4 = £>i_yi + ft 2 y 2 , where a\, o 2 , bi, and b 2 are any constants. Show 

that 

W(y 3 ,y 4 ) = {a\b 2 - a 2 b 1 )W(y 1 ,y 2 ). 

Are y 3 and y 4 also a fundamental set of solutions? Why or why not? 

In each of Problems 22 and 23 find the fundamental set of solutions specified by Theorem 3.2.5 
for the given differential equation and initial point. 

22. y" + / - 2y = 0, t 0 = 0 

23. y" + 4/ + 3y = 0, t 0 = 1 

In each of Problems 24 through 27 verify that the functions y 2 and y 2 are solutions of the given 
differential equation. Do they constitute a fundamental set of solutions? 

24. y" + 4y = 0; y\(t) = cos 2 1, y 2 {t) = sin 2 1 

25. y" - 2/ + y = 0; yi (/) = e\ y 2 (t) = te< 
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26. x 2 y" — x(x + 2 )y' + (x + 2 )y = 0, x > 0; yi(x) = x, y 2 (x) = xe* 

27. (1 — xcotx)y" — xy' + y = 0, 0 < x < jr; y x (x)=x, yi(x) = sinx 

28. Consider the equation y" — y' — 2y = 0. 

(a) Show that yi(i) = e~‘ and yiil) = e 2t form a fundamental set of solutions. 

(b) Let y 3 (t) = -2e 2 \ y 4 (t) = yi(f) + 2y 2 (f), and y 5 (t) = 2yi(f) - 2y 3 (0- Are y 3 (t), y 4 (t ), 

and y 3 (t) also solutions of the given differential equation? 

(c) Determine whether each of the following pairs forms a fundamental set of solutions: 
[yi(t),y 3 {t)\, [y 2 (t),y 3 (t)]; lyi(t),y 4 (t)l, \y 4 (t),y 5 {ty\. 

In each of Problems 29 through 32 find the Wronskian of two solutions of the given 
differential equation without solving the equation. 

29. t 2 y" - t(t + 2 )y' + it + 2)y = 0 30. (cos t)y" + (sin t)y' - ty = 0 

31. x 2 y" + xy' + (x 2 — v 2 )y = 0, Bessel’s equation 

32. (1 — x 2 )y" — 2xy' + a(a + l)y = 0, Legendre’s equation 

33. Show that if p is differentiable and p (f) > 0, then the Wronskian W ( t ) of two solutions 
of [p(f)y']' + q{t)y = 0 is W{t) = c/pit), where c is a constant. 

34. If yi and y 2 are a fundamental set of solutions of ty" + 2y' + te'y = 0 and if 
IV (y 1 ,y 2 )(l) = 2, find the value of W (y i , y 2 ) (5) . 

35. If yi and y 2 are a fundamental set of solutions of t 2 y" — 2 y' + (3 + t)y = 0 and if 
W(y,,y 2 )(2) = 3, find the value of W(yi,y2)(4). 

36. If the Wronskian of any two solutions of y" + p(t)y' + q(t)y = 0 is constant, what does 
this imply about the coefficients p and i ql 

37. If/, g, and h are differentiable functions, show that W(fg,fh ) = f 2 W(g,h). 

In Problems 38 through 40 assume that p and q are continuous and that the functions yi and 
y 2 are solutions of the differential equation y" + p{t)y' + q(t)y = 0 on an open interval I. 

38. Prove that if yi and y 2 are zero at the same point in 7, then they cannot be a fundamental 
set of solutions on that interval. 

39. Prove that if y i and y 2 have maxima or minima at the same point in 7, then they cannot 
be a fundamental set of solutions on that interval. 

40. Prove that if y 3 and y 2 have a common point of inflection to in 7, then they cannot be 
a fundamental set of solutions on 7 unless both p and q are zero at to- 
il. Exact Equations. The equation P(x)y" + Q(x)y' + R(x)y = 0 is said to be exact if it can 

be written in the form \P{x)y'\ + [/(x)y]' = 0, where fix) is to be determined in terms of 
P{x), Q{x), and R(x). The latter equation can be integrated once immediately, resulting 
in a first order linear equation for y that can be solved as in Section 2.1. By equating the 
coefficients of the preceding equations and then eliminating /(x), show that a necessary 
condition for exactness is P"(x) — Q'(x) + R(x) = 0. It can be shown that this is also a 
sufficient condition. 

In each of Problems 42 through 45 use the result of Problem 41 to determine whether the 
given equation is exact. If so, solve the equation. 

42. y" + xy' + y = 0 43. y" + 3x 2 y' + xy = 0 

44. xy" — (cosx)y' + (sinx)y = 0, x > 0 45. x 2 y" + xy' — y = 0, x > 0 

46. The Adjoint Equation. If a second order linear homogeneous equation is not exact, it 
can be made exact by multiplying by an appropriate integrating factor /x(x). Thus we 
require that q(x) be such that p(x)P{x)y" + /r (x)Q(x)y' + p.{x)R{x)y = 0 can be written 
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in the form \p(x)P(x)y']' + [f(x)y ]' = 0. By equating coefficients in these two equations 
and eliminating /(x), show that the function /x must satisfy 

P/x" + (2 P’ - Q)p' + (P" - Q' + R)p = 0. 

This equation is known as the adjoint of the original equation and is important in the 
advanced theory of differential equations. In general, the problem of solving the ad- 
joint differential equation is as difficult as that of solving the original equation, so only 
occasionally is it possible to find an integrating factor for a second order equation. 

In each of Problems 47 through 49 use the result of Problem 46 to find the adjoint of the given 

differential equation. 

47. x 2 y" + xy' + (x 2 — v 2 )y = 0, Bessel’s equation 

48. (1 — x 2 )y" — 2xy' + a(a + l)y = 0, Legendre’s equation 

49. y" — xy = 0, Airy’s equation 

50. For the second order linear equation P(x)y" + Q(x)y' + R(x)y = 0, show that the adjoint 
of the adjoint equation is the original equation. 

51. A second order linear equation P(x)y" + Q(x)y' + R(x)y = 0 is said to be self-adjoint if 
its adjoint is the same as the original equation. Show that a necessary condition for this 
equation to be self-adjoint is that P'(x) = Q(x). Determine whether each of the equations 
in Problems 47 through 49 is self-adjoint. 


3.3 Complex Roots of the Characteristic Equation 

We continue our discussion of the equation 

ay" + by' + cy = 0, (1) 

where a, b, and c are given real numbers. In Section 3.1 we found that if we seek 
solutions of the form y = e rt , then r must be a root of the characteristic equation 

ar 2 + br + c = 0. (2) 

If the roots r\ and r 2 are real and different, which occurs whenever the discriminant 
b 2 — 4 ac is positive, then the general solution of Eq. (1) is 

y — cie nl + c 2 e , - t . (3) 

Suppose now that b 2 — 4 ac is negative. Then the roots of Eq. (2) are conjugate 
complexnumbers; we denote them by 

n = k + ip, r 2 = X - ip, (4) 

where X and p are real. The corresponding expressions for y are 

yi{t) = exp[(k + ip)t], y 2 {t) = exp[(A. — ip)t], (5) 

Our first task is to explore what is meant by these expressions, which involve evaluat- 
ing the exponential function for a complex exponent. For example, if 
X = —1, p = 2, and t — 3, then from Eq. (5), 

Pi(3) = e -3+6 '. 


( 6 ) 
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What does it mean to raise the number e to a complex power? The answer is provided 
by an important relation known as Euler’s formula. 


Euler’s Formula. To assign a meaning to the expressions in Eqs. (5), we need to give 
a definition of the complex exponential function. Of course, we want the definition 
to reduce to the familiar real exponential function when the exponent is real. There 
are several ways to discover how this extension of the exponential function should 
be defined. Here we use a method based on infinite series; an alternative is outlined 
in Problem 28. 

Recall from calculus that the Taylor series for e‘ about t = 0 is 

e 


°° f n 
t _ l _ 

~ n\ ’ 


— OO < t < OO. 


(7) 


n=0 


If we now assume that we can substitute it for t in Eq. (7), then we have 


e 


it 


E 


n = 0 


atr 

n\ 


OO 


= E 

n = 0 


(-1 ) n t 2n 
(2 n)\ 


OO 

+, ‘E 


n = 1 


(- 1 )”— 1 1 2 ”— 1 
(2n — 1 )! 


( 8 ) 


where we have separated the sum into its real and imaginary parts, making use of the 
fact that i 2 = —1, i 3 — —i, i 4 = 1, and so forth. The first series in Eq. (8) is precisely 
the Taylor series for cos t about t = 0, and the second is the Taylor series for sin t 
about t — 0. Thus we have 

e n = cos t + i sin t. (9) 


Equation (9) is known as Euler’s formula and is an extremely important mathe- 
matical relationship. Although our derivation of Eq. (9) is based on the unverified 
assumption that the series (7) can be used for complex as well as real values of the 
independent variable, our intention is to use this derivation only to make Eq. (9) 
seem plausible. We now put matters on a firm foundation by adopting Eq. (9) as the 
definition of e“. In other words, whenever we write e“, we mean the expression on 
the right side of Eq. (9). 

There are some variations of Euler’s formula that are also worth noting. If we 
replace t by — t in Eq. (9) and recall that cos(— t) = cos t and sin(— t) = — sinf, then 
we have 

e~ lt — cost — /sin t. (10) 

Further, if t is replaced by /it in Eq. (9), then we obtain a generalized version of 
Euler’s formula, namely, 

e mt — cos pt + i sin pt. (11) 


Next, we want to extend the definition of the exponential function to arbitrary com- 
plex exponents of the form (X + i/i)t. Since we want the usual properties of the expo- 
nential function to hold for complex exponents, we certainly want 
exp[(A + ip)t] to satisfy 

= e u e i»t (12 ) 
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Then, substituting for e lllt from Eq. (11), we obtain 

e a+ ‘ ll)t = e xt (cos pt + i sin pt) 

= e xt cos pt + ie xt sin pt. (13) 

We now take Eq. (13) as the definition of exp[(A. + ip)t]. The value of the exponential 
function with a complex exponent is a complex number whose real and imaginary 
parts are given by the terms on the right side of Eq. (13). Observe that the real 
and imaginary parts of cxp[(/, + ip)t] are expressed entirely in terms of elementary 
real-valued functions. For example, the quantity in Eq. (6) has the value 

e -3+6r = e -3 cos6 + ie -3 sin6 ^ 0.0478041 - 0.0139113/. 

With the definitions (9) and (13) it is straightforward to show that the usual laws of 
exponents are valid for the complex exponential function. You can also use Eq. (13) 
to verify that the differentiation formula 

-f- (e rt ) = re n (14) 

at 

holds for complex values of r. 


EXAMPLE 

1 


Find the general solution of the differential equation 

y" + y' + 9.25y = 0, 
Also find the solution that satisfies the initial conditions 


v(0) = 2, y (0) = 8, 


(15) 

(16) 


and draw its graph. 

The characteristic equation for Eq. (15) is 

r 2 + r + 9.25 = 0 

so its roots are 

n = — \ + 3i, r 2 = - 1 - 3 i. 

Therefore two solutions of Eq. (15) are 

yi(f) = exp[(— 1 + 3/)/] = e~ ,/2 (cos3t + /sin3t) (17) 

and 

y 2 (t) = exp[(— 1 — 3 i)t] = e“' /2 (cos3 1 — /sin3f). (18) 

You can verify that the Wronskian W(y±,y 2 )(t ) = — 6/e~', which is not zero, so the general 
solution of Eq. (15) can be expressed as a linear combination of yi(f) and y 2 (t) with arbitrary 
coefficients. 

However, rather than using the complex- valued solutions jvi (f) and y 2 (t), let us seek instead 
a fundamental set of solutions of Eq. (15) that are real-valued. From Theorem 3.2.2 we 
know that any linear combination of two solutions is also a solution, so let us form the linear 
combinations y\(t) + y 2 (t) and y\(t) — y 2 {t). In this way we obtain from Eqs. (17) and (18) 

yi (t) + y 2 (t) = 2e _,/2 cos 3 1, jq (f) — y 2 (t) = 2 ie~‘ 12 sin 3 1. 
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FIGURE 3.3.1 Solution of the initial value problem 
y" + / + 9.25 y = 0, y(0) = 2, /(0) = 8 


10 t 


Dropping the multiplicative constants 2 and 2 i for convenience, we are left with 

u(t) = e~ ,/2 cos 3 1, v (t) = e~ l/2 sin3t (19) 

as real-valued solutions of Eq. (15). [If you are not completely sure that u(t) and v(t) are 
solutions of the given differential equation, you should substitute these functions into Eq. (15) 
and confirm that they satisfy it.] On calculating the Wronskian of u(t) and v(t), we find 
that W(u, v)(t) = 3e - '; thus u(t) and v{t) form a fundamental set of solutions and the general 
solution of Eq. (15) can be written as 

y = Ciu(t) + c 2 v(t) = e~ t,2 (c\ cos 3 1 + c 2 sin 3f), (20) 

where c\ and c 2 are arbitrary constants. 

To satisfy the initial conditions (16), we first substitute t = 0 and y = 2 in Eq. (20) with 
the result that c 2 = 2. Then by differentiating Eq. (20), setting t = 0, and y' = 8, we obtain 
— jC i + 3 c 2 = 8, so that c 2 = 3. Thus the solution of the initial value problem (15), (16) is 

y = e -t/2 (2cos3f + 3 sin3t). (21) 

The graph of this solution is shown in Figure 3.3.1. 

From the graph we see that the solution of this problem is a decaying oscillation. The sine 
and cosine factors control the oscillatory nature of the solution, while the negative exponential 
factor in each term causes the magnitude of the oscillations to diminish as time increases. 


Complex Roots; The General Case. The functions yi(f) and yylt)- given by Eqs. (5) and 
with the meaning expressed by Eq. (13), are solutions of Eq. (1) when the roots 
of the characteristic equation (2) are complex numbers X ± i/x. Unfortunately, the 
solutions yi and y 2 are complex-valued functions, whereas in general we would prefer 
to have real- valued solutions, if possible, because the differential equation itself has 
real coefficients. We can proceed just as in Example 1 to find a fundamental set of 
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real-valued solutions. In particular, let us form the sum and then the difference of yq 
and_V 2 . We have 

y ] (t) + y 2 (t ) = e xt (cos fit + i sin fit) + e^cos fit — i sin fit) 

= 2e xt cos fit 

and 

yi(t) — yi(t) = e w (cos fit + i sin fit) — e Xt (cos fit — i sin fit) 

= 2ie' J sin fit. 

Hence, neglecting the constant multipliers 2 and 2 i, respectively, we have obtained a 
pair of real-valued solutions 

u(t) = e xt cos fit, v(t) = e At sin fit. (22) 

Observe that u and v are simply the real and imaginary parts, respectively, of y i . 

By direct computation you can show that the Wronskian of u and v is 

W(u,v)(t) = fie 2xt . (23) 

Thus, as long as fi ^ 0, the Wronskian W is not zero, so u and v form a fundamental 
set of solutions. (Of course, if fi = 0, then the roots are real and the discussion in this 
section is not applicable.) Consequently, if the roots of the characteristic equation 
are complex numbers X ± ifi, with fi / 0, then the general solution of Eq. (1) is 

y = c\e xt cos fit + c 2 e xt sin fit, (24) 

where c\ and C 2 are arbitrary constants. Note that the solution (24) can be written 
down as soon as the values of X and fi are known. Let us now consider some further 
examples. 


EXAMPLE 

2 


Find the solution of the initial value problem 

16 y" - 8/ + 145y = 0, y(0) = -2, /( 0) = 1. (25) 

The characteristic equation is 16r 2 — 8r + 145 = 0 and its roots are r = 1/4 ± 3 i. Thus the 
general solution of the differential equation is 

y = Cie' /4 cos3f + C 2 « ,/4 sin3f. (26) 

To apply the first initial condition, we set t = 0 in Eq. (26); this gives 

y( 0) = ci = - 2 . 

For the second initial condition we must differentiate Eq. (26) and then set t = 0. In this way 
we find that 

/( 0) = \ci + 3c 2 = 1, 

from which c 2 = 1/2. Using these values of c\ and c 2 in Eq. (26), we obtain 

y = -2e ,/4 cos 3r + 1 e' /4 sin 3 1 (27) 

as the solution of the initial value problem (25). The graph of this solution is shown in Fig- 
ure 3.3.2. 
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EXAMPLE 

3 


y 

10 


A 

y = -2e M cos 3 1 + e m sin 3 1 

/ \ 

5 


A / \ 

^ / \ 11 

A. . 

-5 
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\ / \ / 
v / 
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FIGURE 3.3.2 Solution of 16/' - 8/ + 145y = 0, y( 0) = —2, y'(O) = 1. 


In this case we observe that the solution is a growing oscillation. Again the trigonometric 
factors in Eq. (27) determine the oscillatory part of the solution, while the exponential factor 
(with a positive exponent this time) causes the magnitude of the oscillation to increase with 
time. 


Find the general solution of 

y" + 9y = 0. (28) 

The characteristic equation is r 2 + 9 = 0 with the roots r = ±3i; thus X = 0 and /r = 3. The 
general solution is 

y — ci cos 3 1 + c 2 sin 3f; (29) 
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note that if the real part of the roots is zero, as in this example, then there is no exponential 
factor in the solution. Figure 3.3.3 shows the graph of two typical solutions of Eq. (28). In each 
case the solution is a pure oscillation whose amplitude is determined by the initial conditions. 
Since there is no exponential factor in the solution (29), the amplitude of each oscillation 
remains constant in time. 


PROBLEMS 


In each of Problems 1 through 6 use Euler’s formula to write the given expression in the form 
a + ib. 

1. exp(l + 2 i) 2. exp(2 — 3 i) 


3. e i7t 
5. 2 1-i 


4_ e 2-Cr/2 )i 

6. n~ 1+2i 


In each of Problems 7 through 16 find the general solution of the given differential equation. 
7. y" - 2/ + 2y = 0 8. y" - 2/ + 6y = 0 

9. y" + 2/ - 8y = 0 10. y" + 2y' + 2y = 0 

11. y" + 6/ + 13y = 0 12. 4y" + 9y = 0 

13. y" + 2/ + 1.25y = 0 14. 9 y" + 9/ - 4y = 0 

15. y" + / + 1.25y = 0 16. y" + 4 y' + 6.25y = 0 


In each of Problems 17 through 22 find the solution of the given initial value problem. Sketch 
the graph of the solution and describe its behavior for increasing t. 

17. y" + 4 y = 0, y(0) = 0, y'(0) = 1 

18. y" + 4/ + 5y = 0, y(0) = 1, y'( 0) = 0 

19. y" - 2 y + 5y = 0, y{: r/2) = 0, y'(: r/2) = 2 

20. y" + y = 0, y( tt/ 3) = 2, y' ( tt/3) = -4 

21. y" + / + 1.25y = 0, y(0) = 3, /(0) = 1 

22. y" + 2y' + 2y = 0, y(: r/4) = 2, y'(7r/4) = -2 

23. Consider the initial value problem 


3 u" — u' + 2 u = 0, u(0) = 2, «'(0) = 0. 


(a) Find the solution u(t) of this problem. 

(b) For t > 0 find the first time at which \u(t)\ = 10. 

24. Consider the initial value problem 

5u" + 2«' + lu = 0, u( 0) = 2, m'(0) = 1. 


(a) Find the solution u(t ) of this problem. 

(b) Find the smallest T such that \u(t)\ < 0.1 for all t > T. 

25. Consider the initial value problem 

y" + 2y' + 6y = 0, y(0) = 2, y'(0) = a > 0. 

(a) Find the solution y(f) of this problem. 

(b) Find a so that y = 0 when t = 1. 

(c) Find, as a function of a, the smallest positive value of t for which y = 0. 

(d) Determine the limit of the expression found in part (c) as a — > oo. 
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26. Consider the initial value problem 

y" + lay' + ( a 2 + 1 )y = 0, y(0) = 1, /(0) = 0. 

(a) Find the solution y(t) of this problem. 

(b) For a = 1 find the smallest T such that \y(t)\ < 0.1 for t > T. 

(c) Repeat part (b) for a = 1/4, 1/2, and 2. 

(d) Using the results of parts (b) and (c), plot T versus a and describe the relation between 
T and a. 

27. Show that W (e Ar cos /it, e xt sin /it) = fie 22 ' . 

28. In this problem we outline a different derivation of Euler’s formula. 

(a) Show that yi(f) = cosr and y 2 (/) = sinf are a fundamental set of solutions of 
y" + y = 0; that is, show that they are solutions and that their Wronskian is not zero. 

(b) Show (formally) that y = e“ is also a solution of y" + y = 0. Therefore 

e l> = ci cos t + C 2 sin t (i) 


for some constants ci and C 2 . Why is this so? 

(c) Set t = 0 in Eq. (i) to show that ci = 1. 

(d) Assuming that Eq. ( 14) is true, differentiate Eq. (i) and then set t = 0 to conclude that 
C 2 = i- Use the values of ci and C 2 in Eq. (i) to arrive at Euler’s formula. 

29. Using Euler’s formula, show that 

cos t = (e“ + e~“) /2, sin t = ( e “ - e~ u ) /2 i. 

30. If e rt is given by Eq. (13), show that e (n +r2)t = e n 'e r2 ' for any complex numbers ry and r 2 . 

31. If e n is given by Eq. (13), show that 


for any complex number r. 

32. Let the real-valued functions p and q be continuous on the open interval 7, and let 
y = 4>(t ) = u(t) + iv(t) be a complex-valued solution of 

y" + p(t)y' + q(t)y = 0, (i) 

where u and v are real-valued functions. Show that u and v are also solutions of Eq. (i). 
Hint: Substitute y = <f>{t) in Eq. (i) and separate into real and imaginary parts. 

33. If the functions y i and y 2 are a fundamental set of solutions of y" + p(t)y' + q(t)y = 0, 
show that between consecutive zeros of yi there is one and only one zero of y 2 . Note 
that this result is illustrated by the solutions yi(f) = cos t and y 2 (t) = sin t of the equation 
y" + y = 0. 

Hint: Suppose that t\ and t 2 are two zeros of y\ between which there are no zeros of y 2 . 
Apply Rolle’s theorem to y\/y 2 to reach a contradiction. 


Change of Variables. Sometimes a differential equation with variable coefficients, 


y" +p(t)y' + q{t)y = 0, 


(i) 


can be put in a more suitable form for finding a solution by making a change of the independent 
variable. We explore these ideas in Problems 34 through 46. In particular, in Problem 34 we 
show that a class of equations known as Euler equations can be transformed into equations 
with constant coefficients by a simple change of the independent variable. Problems 35 through 


3.3 Complex Roots of the Characteristic Equation 


165 


42 are examples of this type of equation. Problem 43 determines conditions under which the 
more general Eq. (i) can be transformed into a differential equation with constant coefficients. 
Problems 44 through 46 give specific applications of this procedure. 

34. Euler Equations. An equation of the form 

‘ 2 ^ +at t + Py = °’ f> °' (H) 

where a and p are real constants, is called an Euler equation. 

(a) Let x = Inf and calculate dy/dt and d 2 y/dt 2 in terms of dy/dx and d 2 y/dx 2 . 

(b) Use the results of part (a) to transform Eq. (ii) into 

<■> 

Observe that Eq. (iii) has constant coefficients. Ifyi(x) andy 2 00 form a fundamental set 
of solutions of Eq. (iii), then yi(lnf) and y->(lnt) form a fundamental set of solutions of 
Eq. (ii). 


In each of Problems 35 through 42 use the method of Problem 34 to solve the given equation 
for t > 0. 


35. t 2 y" + ty' + y = 0 
37. t 2 y" + 3fy' + 1.25y = 0 
39. t 2 y" - My' + 6y = 0 

41. t 2 y" + 3ty' — 3y = 0 


36. t 2 y" + 4fy' + 2y = 0 
38. t 2 y" — 4ty' — 6y = 0 
40. t 2 y" - ty' + 5y = 0 

42. t 2 y" + Ity' + lOy = 0 


43. In this problem we determine conditions on p and q that enable Eq. (i) to be transformed 
into an equation with constant coefficients by a change of the independent variable. Let 
x — u(t ) be the new independent variable, with the relation between x and t to be specified 
later. 

(a) Show that 

dy dx dy d 2 y / dx\ 2 d 2 y d 2 x dy 

dt dt dx ’ dt 2 \dt ) dx 2 dt 2 dx 


(b) Show that the differential equation (i) becomes 


/ dx\ 2 d 2 y 
\dt ) dx 2 ~*~ 


/ d 2 x 
[it 2 


dx\ 
+ P(t) dt) 


dy 

dx 


+ q(t)y = 0. 


(iv) 


(c) In order for Eq. (iv) to have constant coefficients, the coefficients of d 2 y/dx 2 and of y 
must be proportional. If q(t) > 0, then we can choose the constant of proportionality to 
be 1; hence 

x = u(t) = J [q(tj] 112 dt. (v) 


(d) With x chosen as in part (c), show that the coefficient of dy/dx in Eq. (iv) is also a 
constant, provided that the expression 


q'(t) + 2p(t)q(t) 
2[q(t)]V 2 


(vi) 


is a constant. Thus Eq. (i) can be transformed into an equation with constant coefficients 
by a change of the independent variable, provided that the function ( q ' + 2pq)/q 3/2 is a 
constant. How must this result be modified if q(t) < 0? 
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In each of Problems 44 through 46 try to transform the given equation into one with constant 
coefficients by the method of Problem 43. If this is possible, find the general solution of the 
given equation. 

44. y" + ty' + e~' 2 y = 0, — oo < t < oo 

45. y" + 3 ty 1 + t 2 y = 0, -oo < t < oo 

46. ty" + ( t 2 — 1 )y' + t 3 y = 0, 0 < t < oo 


3.4 Repeated Roots; Reduction of Order 

In earlier sections we showed how to solve the equation 

ay" + by' + cy = 0 (1) 

when the roots of the characteristic equation 

ar 2 + br + c — 0 (2) 

either are real and different or are complex conjugates. Now we consider the third 
possibility, namely, that the two roots r\ and ri are equal. This case is transitional 
between the other two and occurs when the discriminant b 2 — 4 ac is zero. Then it 
follows from the quadratic formula that 

r i = r 2 = -b/2a. (3) 

The difficulty is immediately apparent; both roots yield the same solution 

yi (t) = e~ bt ' 2a (4) 

of the differential equation (1), and it is not obvious how to find a second solution. 


Solve the differential equation 

EXAMPLE y" + 4/ + 4y = 0. (5) 

The characteristic equation is 

r 2 + 4r + 4 = (r + 2) 2 = 0, 

so r\ = r 2 = —2. Therefore one solution of Eq. (5) is yifi) = e~ 2t . To find the general solution 
of Eq. (5), we need a second solution that is not a multiple of yq. This second solution can 
be found in several ways (see Problems 20 through 22); here we use a method originated by 
D’Alembert 5 in the eighteenth century. Recall that since y\(t) is a solution of Eq. (1), so is 
cyi{t) for any constant c. The basic idea is to generalize this observation by replacing c by a 


Dean d’Alembert (1717-1783), a French mathematician, was a contemporary of Euler and Daniel 
Bernoulli and is known primarily for his work in mechanics and differential equations. D'Alembert’s 
principle in mechanics and d’Alembert’s paradox in hydrodynamics are named for him, and the wave 
equation first appeared in his paper on vibrating strings in 1747. In his later years he devoted himself 
primarily to philosophy and to his duties as science editor of Diderot’s Encyclopedie. 
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function v(t ) and then trying to determine v(t) so that the product v(t)y\ (t) is also a solution 
of Eq. (1). 

To carry out this program, we substitute y = v(t)yi (t) in Eq. (5) and use the resulting equation 
to find v(t). Starting with 

y = uCOyiCO = v(t)e~ 2 ‘, (6) 

we have 

y' = i/(r)e -2 ' — 2v{t)e~ 2t (7) 


and 


y" = v"(t)e 2 ‘ — 4 v'(t)e 21 + 4 v(t)e 2t . 


( 8 ) 


By substituting the expressions in Eqs. (6), (7), and (8) in Eq. (5) and collecting terms, we 
obtain 


[v"(t) - 4v'(t) + 4 v(t) + 4 v\t) - 8 v(t) + 4v(t)]e~ 2 ‘ = 0, 


which simplifies to 
Therefore 

and 


v"{t) = 0 . 


v\t) = Cl 


( 9 ) 


v{t) = C\t + c 2 , (10) 

where ci and c? are arbitrary constants. Finally, substituting for v(t) in Eq. (6), we obtain 

y = Cite -2 ' + C 2 e -2 '. (11) 


The second term on the right side of Eq. (11) corresponds to the original solution 
yi(r) = exp(— 2t), but the first term arises from a second solution, namely, y 2 (t) = texp(— 2t). 
We can verify that these two solutions form a fundamental set by calculating their Wronskian: 


W(yi,y 2 )(t) 


e -2 ' te -2 ' 

— 2e -2 ' (1 - 2 t)e~ 2 ‘ 


= e -4 ' - 2fe -4 ' + 2fe -4 ' = e -4 ' £ 0. 


Therefore 

yi(t) = e -2 ', y 2 (t) = te~ 2 ‘ (12) 

form a fundamental set of solutions of Eq. (5), and the general solution of that equation is 
given by Eq. (11). Note that both y\(t) and y 2 (t) tend to zero as t — > oo; consequently, all 
solutions of Eq. (5) behave in this way. The graph of a typical solution is shown in Figure 3.4.1. 


y 

2 


1 


I I I I 

0.5 1 1.5 2 

FIGURE 3.4.1 A typical solution of y" + 4 y’ + 4y = 0. 
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The procedure used in Example 1 can be extended to a general equation whose 
characteristic equation has repeated roots. That is, we assume that the coefficients 
in Eq. (1) satisfy b 2 — 4 ac = 0, in which case 

yiit) = e- bt ' 2a 


is a solution. To find a second solution, we assume that 

y = v(t)yi(t ) = v(t)e~ b,/2a 

and substitute for y in Eq. (1) to determine v(t). We have 


and 


/ = v'(t)e- b,/2a v(t)e~ b,/2a 

2 a 


y" = i >"{t)e~ bt/2a - -v'(t)e~ bt/2a + ^v{t)e~ bt/2a . 
a 4 a 1 


(13) 

(14) 

(15) 


Then, by substituting in Eq. (1), we obtain 

b 


b b 2 


v'(t) - —v(t) 
2 a 


+ cv(t) \e~ bt/2a = 0. (16) 


Canceling the factor exp(— bt/2a), which is nonzero, and rearranging the remaining 
terms, we find that 


' b 2 b 2 

av"{t) + {—b + b)v'(t) + ( — - — + c ) v(t) = 0. 


(17) 


The term involving v'(t) is obviously zero. Further, the coefficient of v(t) is 
c — (h 2 /4n), which is also zero because b 2 — 4nc = 0 in the problem that we are 
considering. Thus, just as in Example 1, Eq. (17) reduces to 


v"(t) = 0; 


therefore 


Hence, from Eq. (13), we have 


V(t) = Cl + C2t. 


y = cie fo / 2a + c 2 te~ bt/2a . 

Thus y is a linear combination of the two solutions 

yi(t) = e~ bt ' 2a , yiit) = te~ bt/2a . 
The Wronskian of these two solutions is 

e -bt/la te -bt/2a 


WiyuyiXt) = 


b 


2 a 


bt \ 


e ~bt/2a I | | e -bt/la 


2 a 1 


= e~ b,la . 


(18) 

(19) 


( 20 ) 


Since W (yi,yi){t) is never zero, the solutions y i and y 2 given by Eq. (19) are a 
fundamental set of solutions. Further, Eq. (18) is the general solution of Eq. (1) 
when the roots of the characteristic equation are equal. In other words, in this case 
there is one exponential solution corresponding to the repeated root, and a second 
solution that is obtained by multiplying the exponential solution by t. 
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EXAMPLE 

2 


Find the solution of the initial value problem 

/' ~y' + 0.25y = 0, y( 0) = 2, /(0) = |. (21) 

The characteristic equation is 

r - r + 0.25 = 0, 

so the roots are r\ = n = 1 /2. Thus the general solution of the differential equation is 

y = C\e’ 12 + c 2 fe ,/2 . (22) 

The first initial condition requires that 

y(0) = ci = 2. 

To satisfy the second initial condition, we first differentiate Eq. (22) and then set t = 0. This 
gives 

y (0) = \ci +c 2 = 

so c 2 — —2/3. Thus the solution of the initial value problem is 

y = 2e ,/2 - |fe' /2 . (23) 

The graph of this solution is shown in Figure 3.4.2. 


y 

4 

y'(0) = 2: y = 2e tl2 + te t/2 

3 


2 

y(0) = | : y = 2e tl2 - |fe i/2 

1 

1 1 . 


1 2 3\ t 

-1 



FIGURE 3.4.2 Solutions of y" - y' + 0.25y = 0, >’(()) = 2, 

withy'(0) = 1/3 and withy'(0) = 2, respectively. 


Let us now modify the initial value problem (21) by changing the initial slope; to be specific, 
let the second initial condition be /( 0) = 2. The solution of this modified problem is 

y = 2e' /2 + te 1 ' 2 , 

and its graph is also shown in Figure 3.4.2. The graphs shown in this figure suggest that there is a 
critical initial slope, with a value between | and 2, that separates solutions that grow positively 
from those that ultimately grow negatively. In Problem 16 you are asked to determine this 
critical initial slope. 
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The geometrical behavior of solutions is similar in this case to that when the roots 
are real and different. If the exponents are either positive or negative, then the 
magnitude of the solution grows or decays accordingly; the linear factor t has little 
influence. A decaying solution is shown in Figure 3.4.1 and growing solutions in 
Figure 3.4.2. However, if the repeated root is zero, then the differential equation is 
y" — 0 and the general solution is a linear function of t. 


Summary. We can now summarize the results that we have obtained for second order 
linear homogeneous equations with constant coefficients 

ay" + by’ + cy = 0. (1) 

Let r\ and r 2 be the roots of the corresponding characteristic polynomial 

ar 2 + br + c = 0. (2) 

If r\ and r 2 are real but not equal, then the general solution of the differential 
equation (1) is 

y = Cl e nt + c 2 e rit . (24) 

If r\ and r 2 are complex conjugates X ± iji, then the general solution is 

y — c\e Xt cos/xf + c 2 e xt sinpt. (25) 

If n = r 2 , then the general solution is 

y = C\e r '‘ + c 2 te nt . (26) 

Reduction of Order. It is worth noting that the procedure used in this section for equa- 
tions with constant coefficients is more generally applicable. Suppose that we know 
one solution not everywhere zero, of 

f + P(t)? + q(t)y = 0. (27) 

To find a second solution, let 

y = v(t)y 1 (t); (28) 

then 

/ = v\t)yi(t) + v(t)y\(t) 

and 

y" = v"(t)yi(t) + 2v'(t)y[(t) + v(t)y'((t). 

Substituting for y, y', and y" in Eq. (27) and collecting terms, we find that 

yiv" + (2 y\ + pyi)v' + (y'{ +py[ + qy{)v = 0. (29) 

Since y i is a solution of Eq. (27), the coefficient of v in Eq. (29) is zero, so that Eq. (29) 
becomes 

yw" + (2y\ + pyi)v’ = 0. (30) 

Despite its appearance, Eq. (30) is actually a first order equation for the function v' 
and can be solved either as a first order linear equation or as a separable equation. 
Once v' has been found, then v is obtained by an integration. Finally, y is determined 
from Eq. (28). This procedure is called the method of reduction of order, because the 
crucial step is the solution of a first order differential equation for v' rather than the 
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EXAMPLE 

3 


PROBLEMS 


original second order equation for y. Although it is possible to write down a formula 
for v(t), we will instead illustrate how this method works by an example. 

Given that y\{t) — r * 1 11 is a solution of 

2 t 2 y" + 3fy' - y = 0, t> 0, (31) 

find a fundamental set of solutions. 

We set y = v(t)t _1 ; then 

/ = v'r 1 - vt~ 2 , y" = v"r l - 2 v'r 2 + 2 Vt~ 3 . 

Substituting for y, y', and y" in Eq. (31) and collecting terms, we obtain 

2t 2 (v"r l — 2vt~ 2 + 2 vt~ 3 ) + 3t(v’t~ 1 — vt~ 2 ) — vt _1 
= 2 tv" + (-4 + 3)v’ + (4r 1 - 3 r 1 - r')v 
= 2 tv" -v' = 0. (32) 

Note that the coefficient of v is zero, as it should be; this provides a useful check on our algebra. 
Separating the variables in Eq. (32) and solving for i /(f), we find that 

v'(t) = ct 1 ' 2 ; 

then 

v(t) = |cf 3/2 + k. 

It follows that 

y=lct 1/2 + kt~ 1 , (33) 

where c and k are arbitrary constants. The second term on the right side of Eq. (33) is a 
multiple of y\(t) and can be dropped, but the first term provides a new solution y 2 (t) = t 1/2 . 
You can verify that the Wronskian of y \ and y 2 is 

W{yx,y 2 m = |r 3/2 , t > 0. (34) 

Consequently, y i and y 2 form a fundamental set of solutions of Eq. (31). 


In each of Problems 1 through 10 find the general solution of the given differential equation. 

1. y" - 2/ + y = 0 2. 9 y" + 6/ + y = 0 

3. 4 y" - 4 y - 3y = 0 4. 4/' + 12/ + 9y = 0 

5. y" - 2 y' + lOy = 0 6. y" - 6/ + 9y = 0 

7. 4y" + 17/ + 4y = 0 8. 16/' + 24/ + 9y = 0 

9. 25/' - 20/ + 4y = 0 10. 2/' + 2/ + y = 0 


In each of Problems 11 through 14 solve the given initial value problem. Sketch the graph of 
the solution and describe its behavior for increasing t. 

11. 9y" - 12/ + 4y = 0, y(0) = 2, y'(0) = -1 

12. y" - 6/ + 9y = 0, y(0) = 0, y'(0) = 2 

13. 9y" + 6/ + 82y = 0, y(0) = -1, y'(0) = 2 

14. y" + 4/ + 4y = 0, y ( — 1) = 2, y'(-l) = 1 
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15. Consider the initial value problem 

4 y" + 12/ + 9y = 0, y( 0) = 1, /(()) = -4. 

(a) Solve the initial value problem and plot its solution for 0 < t < 5. 

(b) Determine where the solution has the value zero. 

(c) Determine the coordinates (to,yo) of the minimum point. 

(d) Change the second initial condition to y'( 0) = b and find the solution as a function 
of b. Then find the critical value of b that separates solutions that always remain positive 
from those that eventually become negative. 

16. Consider the following modification of the initial value problem in Example 2: 

y" - / + 0.25 y = 0, y(0) = 2, /(0) = b. 

Find the solution as a function of b and then determine the critical value of b that separates 
solutions that grow positively from those that eventually grow negatively. 

17. Consider the initial value problem 

4/' + 4/ +y = 0, y( 0) = 1, /(0) = 2. 

(a) Solve the initial value problem and plot the solution. 

(b) Determine the coordinates ((mJm) of the maximum point. 

(c) Change the second initial condition to /(0) = b > 0 and find the solution as a function 
of b. 

(d) Find the coordinates ( t M ,yM ) of the maximum point in terms of b. Describe the 
dependence of t M and y M on b as b increases. 

18. Consider the initial value problem 

9/' + 12/ + 4y = 0, y(0) = a > 0, /( 0) = — 1. 

(a) Solve the initial value problem. 

(b) Find the critical value of a that separates solutions that become negative from those 
that are always positive. 

19. If the roots of the characteristic equation are real, show that a solution of 

ay" + by' + cy = 0 is either everywhere zero or else can take on the value zero at most 
once. 

Problems 20 through 22 indicate other ways of finding the second solution when the charac- 
teristic equation has repeated roots. 

20. (a) Consider the equation y" + 2 ay' + a 2 y = 0. Show that the roots of the characteristic 
equation are n = r 2 = —a, so that one solution of the equation is e~ at . 

(b) Use Abel’s formula [Eq. (22) of Section 3.2] to show that the Wronskian of any two 
solutions of the given equation is 

W(t) = yi(t)y' 2 (t) - y[(t)y 2 (t) = cie“ 2 ", 

where ci is a constant. 

(c) Let yi (t) = e~ at and use the result of part (b) to obtain a differential equation satisfied 
by a second solution y 2 (t). By solving this equation, show that y 2 (t) = te~“‘. 

21. Suppose that r\ and r 2 are roots of ar 2 + br + c = 0 and that r\ / r 2 , then exp/it) 
and expert) are solutions of the differential equation ay" + by ' + cy = 0. Show that 
<p(t;r u r 2 ) = [exp (r 2 t) — exp(rif)]/(r 2 — ti) is also a solution of the equation for r 2 / r\. 
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Then think of ri as fixed and use L’Hospital’s rule to evaluate the limit of <t>{t\r\,r 2 ) as 
ri —■ > r\, thereby obtaining the second solution in the case of equal roots. 

22. (a) If ar 1 + br + c = 0 has equal roots r\, show that 

L[e rt ] = a(e rt )" + b(e rt )' + ce rt = a(r — r\) 2 e n . (i) 


Since the right side of Eq. (i) is zero when r = r lt it follows that exp fat) is a solution of 
L[y ] = ay" + by' + cy = 0. 

(b) Differentiate Eq. (i) with respect to r and interchange differentiation with respect to 
r and with respect to t, thus showing that 


^L[e rt ] = L 
dr 



= L[te n ] = ate r ‘(r — r\) 2 + 2ae"{r — ri). 


(ii) 


Since the right side of Eq . (ii) is zero when r = r\, conclude that t exp fa t) is also a solution 
of L[y] = 0. 


In each of Problems 23 through 30 use the method of reduction of order to find a second 
solution of the given differential equation. 


23. 

t 2 y" 

- 4 ty' + 6y = 0, 

t > 0; 

yi(t) = t 2 

24. 

t 2 y" 

+ 2 ty' - 2y = 0, 

t > 0; 

yi(t) = t 

25. 

t 2 y" 

+ 3 ty' + y = 0, 

t > 0; 

yi(t) = r 1 

26. 

t 2 y" 

1 

+ 

■77 

+ 

+• 2 )y = 0, 

t > 0; yi(f) = t 

27. 

xy" 

- / + 4x 3 y = 0, 

x > 0; 

yj(x) = sinx 2 

28. 

(x- 

1 )y" - xy'+y = 

0, x > 1 

; yi(x) = e* 

29. 

x 2 y" 

- (x - 0.1875 )y 

= 0, x > 

0; yi (x) = x 1/A e 2 ^ 

30. 

x 2 y" 

+ xy' + (x 2 - 0.25 )y = 0, 

x > 0; yi(x) = x~ 1/2 sinx 

31. 

The 

differential equation 



xy" ~(x + N)y' + Ny = 0, 


where IV is a nonnegative integer, has been discussed by several authors. 6 One reason 
why it is interesting is that it has an exponential solution and a polynomial solution. 

(a) Verify that one solution is yi(x) = e x . 

(b) Show that a second solution has the form y 2 (v) = ce* J x N e~ x dx. Calculate yifa) for 
N — 1 and N = 2; convince yourself that, with c = — 1/AH, 


y 2 (x) = 1 + 


2 N 

X X X 

v. + 2\ + "' + m 


Note that y 2 (x) is exactly the first N + 1 terms in the Taylor series about x = 0 for e x . that 
is, for y\(x). 

32. The differential equation 

y" + 8(xy' + y) = 0 


arises in the study of the turbulent flow of a uniform stream past a circular cylinder. Verify 
that y\(x) = exp(— &x 2 /2) is one solution and then find the general solution in the form of 
an integral. 


6 T. A. Newton, “On Using a Differential Equation to Generate Polynomials,” American Mathematical 
Monthly 81 (1974), pp. 592-601. Also see the references given there. 
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33. The method of Problem 20 can be extended to second order equations with variable 
coefficients. If yi is a known nonvanishing solution of y" + p(t)y' + q(t)y = 0, show that 
a second solution y 2 satisfies (yi/yiY = W(y\,y 2 )/y\, where W(yi,y 2 ) is the Wronskian of 
yi and y 2 . Then use Abel’s formula [Eq. (22) of Section 3.2] to determine y 2 . 

In each of Problems 34 through 37 use the method of Problem 33 to find a second independent 
solution of the given equation. 

34. t 2 y" + 3ty' + y = 0, t > 0; y x (t) = r 1 

35. ty" — y' + 4t 3 y = 0, t > 0; yi(t) = sin(f 2 ) 

36. (x-l)y"-x/+y = 0, x > 1; yi ( X ) = e x 

37. x 2 y" + xy' + (x 2 — 0.25)y = 0, x > 0; yi(x) = x _1/2 sinx 

Behavior of Solutions as t -*• oo. Problems 38 through 40 are concerned with the behavior 
of solutions as t -> oo. 

38. If n, b. and c are positive constants, show that all solutions of ay" + by' + cy = 0 approach 
zero as t -> oo. 

39. (a) If a > 0 and c > 0, but b = 0, show that the result of Problem 38 is no longer true, but 
that all solutions are bounded as t — > oo. 

(b) If a > 0 and b > 0, but c = 0, show that the result of Problem 38 is no longer true, but 
that all solutions approach a constant that depends on the initial conditions as t — > oo. 
Determine this constant for the initial conditions y(0) = yo>/(0) = y' a . 

40. Show that y = sin t is a solution of 

y" + ( k sin 2 t)y' + (1 — k cos t sin t)y = 0 

for any value of the constant k. If 0 < k < 2, show that 1 — k cos t sin t > 0 and k sin 2 1 > 0. 
Thus observe that even though the coefficients of this variable-coefficient differential equa- 
tion are nonnegative (and the coefficient of y' is zero only at the points t = 0,n,2n , . . .), 
it has a solution that does not approach zero as t — >■ oo. Compare this situation with the 
result of Problem 38. Thus we observe a not unusual situation in the study of differential 
equations: equations that are apparently very similar can have quite different properties. 

Euler Equations. In each of Problems 41 through 46 use the substitution introduced in Prob- 
lem 34 in Section 3.3 to solve the given differential equation. 

41. f-y" - 3 ty' + 4y = 0, t > 0 

42. t 2 y" + 2 ty' + 0.25y = 0, t > 0 

43. 2 t 2 y" - 5 ty' + 5y = 0, t > 0 

44. t 2 y" + 3ty' + y = 0, / > 0 

45. 4 t 2 y" - 8ty' + 9y = 0, t > 0 

46. t 2 y" + 5 ty' + 13y = 0, t > 0 


3.5 Nonhomogeneous Equations; Method of Undetermined Coefficients 

We now return to the nonhomogeneous equation 

L[y] = y" + p(t)y' + q(t)y = g(t), (1) 

where p, q, and g are given (continuous) functions on the open interval 1. The 
equation 


L\y] = y" +p(t)y' + q(t)y = 0, 


( 2 ) 
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Theorem 3.5.1 


Theorem 3.5.2 


in which g(t ) = 0 and p and q are the same as in Eq. (1), is called the homogeneous 
equation corresponding to Eq. (1). The following two results describe the structure 
of solutions of the nonhomogeneous equation (1) and provide a basis for constructing 
its general solution. 


If Y\ and Y 2 are two solutions of the nonhomogeneous equation (1), then their 
difference Y\ — Y 2 is a solution of the corresponding homogeneous equation (2). 
If, in addition, y\ and y 2 are a fundamental set of solutions of Eq. (2), then 

Yi(t) - Y 2 {t) = c\y\(t) + c 2 y 2 (0, (3) 

where c\ and c 2 are certain constants. 


To prove this result, note that Y\ and E 2 satisfy the equations 
L[Y 1 m=g(t), L[Y 2 m=g(t). 

Subtracting the second of these equations from the first, we have 
L[Y<m - L[Y 2 Kt) = git ) - git) = 0. 


However, 


L[Y 1 ] - L[Y 2 ] = L[Y\ - y 2 ], 


(4) 

(5) 


so Eq. (5) becomes 


L[Y 1 -Y 2 m = 0. 


( 6 ) 


Equation (6) states that Y\ — Y 2 is a solution of Eq. (2). Finally, since all solutions of 
Eq. (2) can be expressed as linear combinations of a fundamental set of solutions by 
Theorem 3.2.4, it follows that the solution Y\ — T 2 can be so written. Hence Eq. (3) 
holds and the proof is complete. 


The general solution of the nonhomogeneous equation (1) can be written in the 
form 

y = 0(0 = ciyiit) + c 2 yiit) + Yit), (7) 

where y 1 and y 2 are a fundamental set of solutions of the corresponding homoge- 
neous equation (2), c\ and c 2 are arbitrary constants, and Y is some specific solution 
of the nonhomogeneous equation (1). 


The proof of Theorem 3.5.2 follows quickly from the preceding theorem. Note 
that Eq. (3) holds if we identify Y\ with an arbitrary solution cp of Eq. (1) and Y 2 with 
the specific solution Y. From Eq. (3) we thereby obtain 

(pit) - Y it) = ciyi it) + c 2 _y 2 it), (8) 

which is equivalent to Eq. (7). Since cp is an arbitrary solution of Eq. (1), the expres- 
sion on the right side of Eq. (7) includes all solutions of Eq. (1); thus it is natural to 
call it the general solution of Eq. (1). 
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EXAMPLE 

1 


In somewhat different words. Theorem 3.5.2 states that to solve the nonhomoge- 
neous equation (1), we must do three things: 

1. Find the general solution ciyift) + c 2 y 2 {t ) of the corresponding homogeneous equation. 
This solution is frequently called the complementary solution and may be denoted by y c (t). 

2. Find some single solution Y ( t ) of the nonhomogeneous equation. Often this solution is 
referred to as a particular solution. 

3. Add together the functions found in the two preceding steps. 

We have already discussed how to find y c (t), at least when the homogeneous equa- 
tion (2) has constant coefficients. Therefore, in the remainder of this section and in 
the next, we will focus on finding a particular solution Y ( t ) of the nonhomogeneous 
equation (1). There are two methods that we wish to discuss. They are known as the 
method of undetermined coefficients (discussed here) and the method of variation 
of parameters (see Section 3.6), respectively. Each has some advantages and some 
possible shortcomings. 

Method of Undetermined Coefficients. The method of undetermined coefficients requires 
us to make an initial assumption about the form of the particular solution Y ( t ), but 
with the coefficients left unspecified. We then substitute the assumed expression into 
Eq. (1) and attempt to determine the coefficients so as to satisfy that equation. If we 
are successful, then we have found a solution of the differential equation (1) and can 
use it for the particular solution Y(t). If we cannot determine the coefficients, then 
this means that there is no solution of the form that we assumed. In this case we may 
modify the initial assumption and try again. 

The main advantage of the method of undetermined coefficients is that it is straight- 
forward to execute once the assumption is made as to the form of Y(t). Its major 
limitation is that it is useful primarily for equations for which we can easily write 
down the correct form of the particular solution in advance. For this reason, this 
method is usually used only for problems in which the homogeneous equation has 
constant coefficients and the nonhomogeneous term is restricted to a relatively small 
class of functions. In particular, we consider only nonhomogeneous terms that con- 
sist of polynomials, exponential functions, sines, and cosines. Despite this limitation, 
the method of undetermined coefficients is useful for solving many problems that 
have important applications. However, the algebraic details may become tedious, 
and a computer algebra system can be very helpful in practical applications. We will 
illustrate the method of undetermined coefficients by several simple examples and 
then summarize some rules for using it. 


Find a particular solution of 

y" - 3/ -4 y = 3e 2 '. (9) 

We seek a function Y such that the combination Y"(t) — 3 Y'(t) — 4 Y (t) is equal to 3e 2 '. 
Since the exponential function reproduces itself through differentiation, the most plausible 
way to achieve the desired result is to assume that Y ft) is some multiple of e 2t , that is, 


Y ft) = Ae 2 ', 
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EXAMPLE 

2 


where the coefficient A is yet to be determined. To find A , we calculate 

Y'(t) = 2 Ae 2 ', Y"(t) = 4 Ae 2t , 

and substitute for y, y ' , and y" in Eq. (9). We obtain 

(4 A -6 A- 4A)e 2! = 3e 2 '. 

Hence — 6Ae 2 ' must equal 3e 2 ', so A = —1/2. Thus a particular solution is 

Y(f) = -\e 2t . (10) 


Find a particular solution of 

y" — 3/ — 4y = 2 sin t. (11) 

By analogy with Example 1, let us first assume that Y (t) = A sin f, where A is a constant to 
be determined. On substituting in Eq. (11) and rearranging the terms, we obtain 

— 5A sin t — 3 A cos t = 2 sin t. 


or 

(2 + 5/1) sin t + 3A cos t = 0. (12) 

We want Eq. (12) to hold for all t. Thus it must hold for two specific points, such as t = 0 
and t = jr/2. At these points Eq. (12) reduces to 3A = 0 and 2 + 5A = 0, respectively. These 
contradictory requirements mean that there is no choice of the constants that makes Eq. (12) 
true for t = 0 and t = n / 2, much less for all t. Thus we conclude that our assumption concerning 
Y ( t ) is inadequate. The appearance of the cosine term in Eq. (12) suggests that we modify our 
original assumption to include a cosine term in Y (f); that is, 

Y(t) = A sin t + B cos t, 

where A and B are to be determined. Then 

Y\t) = A cost — Bsinf, Y"(t) = — Asint — Boost. 

By substituting these expressions for y , /, and y" in Eq. (11 ) and collecting terms, we obtain 

(—A + 3 B — 4 A) sin t + (— B — 3A — 4B) cos t = 2 sin t . (13) 

To satisfy Eq. (13), we must match the coefficients of sin t and cos t on each side of the equation; 
thus A and B must satisfy the equations 

—5 A + 3B = 2, -3 A -5B = 0. 

Hence A = —5/17 and B = 3/17, so a particular solution of Eq. (11) is 

Y(0 = -^sinr+ ^ cos t. 


The method illustrated in the preceding examples can also be used when the right 
side of the equation is a polynomial. Thus, to find a particular solution of 

y" - 3/ - 4 y = 4 1 2 - 1, (14) 

we initially assume that Y (t) is a polynomial of the same degree as the nonhomoge- 
neous term, that is, Y ( t ) = At 2 + Bt + C. 
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EXAMPLE 

4 


To summarize our conclusions up to this point: if the nonhomogeneous term g(t ) 
in Eq. (1) is an exponential function e at , then assume that Y ( t ) is proportional to 
the same exponential function; if g(t) is sin fit or cos fit, then assume that Y(t) is a 
linear combination of sin fit and cos fit; if g{t) is a polynomial, then assume that Y ( t ) 
is a polynomial of like degree. The same principle extends to the case where g(t ) is 
a product of any two, or all three, of these types of functions, as the next example 
illustrates. 


Find a particular solution of 

y" — 3 y' — 4 y = —8e ! cos 2 1. 


(15) 


In this case we assume that Y (t) is the product of e' and a linear combination of cos 2 1 and 
sin 2f , that is, 


Y (f) = Ae‘ cos 2 1 + Be' sin 2 1. 

The algebra is more tedious in this example, but it follows that 

Y'(t) = {A + 2 B)e' cos 2 1 + (—2 A + B)e' sin 2 1 


and 

Y"(f) = (— 3/1 + AB)e‘ cos2r + (— AA — 3B)e‘ sin2f. 

By substituting these expressions in Eq. (15), we find that A and B must satisfy 


1QA + 2B = 8, 2 A- 10 B = 0. 


Hence A = 10/13 and B = 2/13; therefore a particular solution of Eq. (15) is 

Y (t) = f|e r cos 2 1 + j^e' sin2t. 


Now suppose that g(t) is the sum of two terms, g(t) = gi(t) + g 2 (t ), and suppose 
that Yi and Y 2 are solutions of the equations 

ay" + by' + cy = gi{t) (16) 

and 

ay" + by' + cy = g 2 (t), (17) 

respectively. Then Y\ + Y 2 is a solution of the equation 

ay" + by' + cy = g{t). (18) 

To prove this statement, substitute Yi(t) + Y 2 {t) for y in Eq. (18) and make use 
of Eqs. (16) and (17). A similar conclusion holds if g(t) is the sum of any finite 
number of terms. The practical significance of this result is that for an equation 
whose nonhomogeneous function g(t) can be expressed as a sum, one can consider 
instead several simpler equations and then add the results together. The following 
example is an illustration of this procedure. 


Find a particular solution of 

y" — 3 y' — 4 y = 3e 2 ' + 2 sin t — 8e' cos 2 1. (19) 

By splitting up the right side of Eq. (19), we obtain the three equations 

y" - 3/ -4 y = 3e 2 ', 
y" — 3 y' — Ay = 2 sin f. 


3.5 Nonhomogeneous Equations; Method of Undetermined Coefficients 


179 


and 


y" — 3/ — Ay = — 8e' cos 2 1. 

Solutions of these three equations have been found in Examples 1, 2, and 3, respectively. 
Therefore a particular solution of Eq. (19) is their sum, namely, 


Y(t) = — \e 2 ‘ + T cos t — Yj sinf + cos 2? + sin 2?. 


The procedure illustrated in these examples enables us to solve a fairly large class 
of problems in a reasonably efficient manner. However, there is one difficulty that 
sometimes occurs. The next example illustrates how it arises. 


EXAMPLE 

5 


Find a particular solution of 

y" - 3/ - Ay = 2e~‘. (20) 

Proceeding as in Example 1, we assume that Y (t) = Ae~‘. By substituting in Eq. (20), we 
then obtain 

(A + 3A - AA)e~‘ = 2e~' . (21) 

Since the left side of Eq. (21) is zero, there is no choice of A that satisfies this equation. 
Therefore, there is no particular solution of Eq. (20) of the assumed form. The reason for this 
possibly unexpected result becomes clear if we solve the homogeneous equation 

y" - 3/ - Ay = 0 (22) 

that corresponds to Eq. (20). A fundamental set of solutions of Eq. (22) is yi(t) = e~‘ and 
y 2 (t) — e 4 '. Thus our assumed particular solution of Eq. (20) is actually a solution of the 
homogeneous equation (22); consequently, it cannot possibly be a solution of the nonhomo- 
geneous equation (20). To find a solution of Eq. (20), we must therefore consider functions of 
a somewhat different form. 

At this stage, we have several possible alternatives. One is simply to try to guess the proper 
form of the particular solution of Eq. (20). Another is to solve this equation in some different 
way and then to use the result to guide our assumptions if this situation arises again in the 
future; see Problems 27 and 33 for other solution methods. Still another possibility is to 
seek a simpler equation where this difficulty occurs and to use its solution to suggest how we 
might proceed with Eq. (20). Adopting the latter approach, we look for a first order equation 
analogous to Eq. (20). One possibility is the linear equation 

y' + y = 2e~‘. (23) 

If we try to find a particular solution of Eq. (23) of the form Ae~‘, we will fail because e~‘ is 
a solution of the homogeneous equation / + y = 0. However, from Section 2.1 we already 
know how to solve Eq. (23). An integrating factor is /r(r) = e', and by multiplying by /z(f) and 
then integrating both sides, we obtain the solution 

y = 2te~‘ + ce~‘ . (24) 


The second term on the right side of Eq. (24) is the general solution of the homogeneous 
equation y' + y = 0, but the first term is a solution of the full nonhomogeneous equation (23). 
Observe that it involves the exponential factor e - ' multiplied by the factor t. This is the clue 
that we were looking for. 

We now return to Eq. (20) and assume a particular solution of the form Y (f) = Ate~‘. Then 


Y'(t) = Ae~‘ - Ate~‘, 


(25) 


Y"(t) = -2 Ae~' +Ate~‘. 
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Substituting these expressions for y, y' , and y" in Eq. (20), we obtain 
(-2 A - 3 A)e~‘ + (A + 3A - 4A)te~' = 2 e -‘ . 

Hence — 5zt = 2, so A = —2/5. Thus a particular solution of Eq. (20) is 

Y(t) = -\te-‘. (26) 


The outcome of Example 5 suggests a modification of the principle stated pre- 
viously: if the assumed form of the particular solution duplicates a solution of the 
corresponding homogeneous equation, then modify the assumed particular solution 
by multiplying it by t. Occasionally, this modification will be insufficient to remove 
all duplication with the solutions of the homogeneous equation, in which case it is 
necessary to multiply by t a second time. For a second order equation, it will never 
be necessary to carry the process further than this. 


Summary. We now summarize the steps involved in finding the solution of an initial 
value problem consisting of a nonhomogeneous equation of the form 

ay” + by' + cy = g(t), (27) 

where the coefficients a , b , and c are constants, together with a given set of initial 
conditions: 

1. Find the general solution of the corresponding homogeneous equation. 

2 . Make sure that the function g(t) in Eq. (27) belongs to the class of functions discussed 
in this section; that is, be sure it involves nothing more than exponential functions, sines, 
cosines, polynomials, or sums or products of such functions. If this is not the case, use the 
method of variation of parameters (discussed in the next section). 

3 . If g(t) = gi {t) + • • • + g n (t), that is, if g(t) is a sum of n terms, then form n subproblems, 
each of which contains only one of the terms g\(t), . . . ,g„(t). The ith subproblem consists 
of the equation 

ay" + by' + cy = g,-(r), 

where i runs from 1 to n. 

4 . For the ith subproblem assume a particular solution Y,(f) consisting of the appropriate 
exponential function, sine, cosine, polynomial, or combination thereof. If there is any 
duplication in the assumed form of Y; (t) with the solutions of the homogeneous equation 
(found in step 1), then multiply Y ; (r) by f, or (if necessary) by t 2 , so as to remove the 
duplication. See Table 3.5.1. 

5 . Find a particular solution Y,(t) for each of the subproblems. Then the sum 

Yi(f) + ■ ■ ■ + Y n (t) is a particular solution of the full nonhomogeneous equation (27). 

6. Form the sum of the general solution of the homogeneous equation (step 1) and the 
particular solution of the nonhomogeneous equation (step 5). This is the general solution 
of the nonhomogeneous equation. 

7 . Use the initial conditions to determine the values of the arbitrary constants remaining in 
the general solution. 

For some problems this entire procedure is easy to carry out by hand, but in many 
cases it requires considerable algebra. Once you understand clearly how the method 
works, a computer algebra system can be of great assistance in executing the details. 
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TABLE 3.5.1 The Particular Solution of ay" + by' + cy = gi(t) 


gift) 

Yj(t) 

P„(t) = a 0 t n + flif" -1 -I 1 - a n 

f(A 0 t n +A 1 t n ~ 1 + ■ 

■ ■ + A n ) 

P n (t)e°" 

t s (A 0 r + A x t n ~ x + • 

■■+A n )e al 

t { sin Bt 

Pn(t)e at 

d[ (A 0 t n + A x t n ~ l + 

■ ■ ■ + A n )e a ‘ cos f)t 

1 cos f)t 

+ (B 0 t n + B x t n 

- 1 + • • • + B n )e at sin /St] 


Notes. Here s is the smallest nonnegative integer (s = 0, 1, or 2) that will ensure that no 
term in y,(t) is a solution of the corresponding homogeneous equation. Equivalently, 
for the three cases, s is the number of times 0 is a root of the characteristic equation, a is 
a root of the characteristic equation, and a + if) is a root of the characteristic equation, 
respectively. 


The method of undetermined coefficients is self-correcting in the sense that if you 
assume too little for Y (f), then a contradiction is soon reached that usually points the 
way to the modification that is needed in the assumed form. On the other hand, if you 
assume too many terms, then some unnecessary work is done and some coefficients 
turn out to be zero, but at least the correct answer is obtained. 

Proof of the Method of Undetermined Coefficients. In the preceding discussion we have de- 
scribed the method of undetermined coefficients on the basis of several examples. 
To prove that the procedure always works as stated, we now give a general argu- 
ment, in which we consider several cases corresponding to different forms for the 
nonhomogeneous termg(f). 

git) = P„{t) = + flif" -1 + • • • + a„. In this case Eq. (27) becomes 

ay" + by ' + cy = aot" + a\t"~ l + ••• + «„. (28) 

To obtain a particular solution, we assume that 

Y (t) = Aot" A\t n 1 + ■ ■ • + An-jt" + A n -\t + A n . (29) 

Substituting in Eq. (28), we obtain 

a[n(n — Y)Aot" " + ••■ + 2A n _2\ + b(nAot" 1 + ■ • • + A n - 1 ) 

+ c(Aot” A\t n 1 + • • • -Y A n ) — a$t n + • • • + a n . (30) 

Equating the coefficients of like powers of t gives 

cA 0 = a 0 , 
cA\ + nbAo = fli, 


cAyi T bA n -\ -f* 2aA n —2 — . 

Provided that c ^ 0, the solution of the first equation is Aq = ao/c, and the remaining 
equations determine A\,...,A n successively. Ifc = 0butb^0, then the polynomial 
on the left side of Eq. (30) is of degree n — 1, and we cannot satisfy Eq. (30). To be 
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sure that aY"(t ) + bY'(t ) is a polynomial of degree n, we must choose Y(t) to be a 
polynomial of degree n + 1. Hence we assume that 

Y (?) = t(Aot n + • • • + A n ). 

There is no constant term in this expression for Y (?), but there is no need to include 
such a term since a constant is a solution of the homogeneous equation when c = 0. 
Since b ^ 0, we have Aq — ao/b(n + 1), and the other coefficients A\, ...,A n can be 
determined similarly. If both c and b are zero, we assume that 

Y ( t ) = t~(Aot n + • ■ ■ + A n ). 

The term aY"(t) gives rise to a term of degree n, and we can proceed as before. 
Again the constant and linear terms in Y ( t ) are omitted, since in this case they are 
both solutions of the homogeneous equation. 

g(t) = e at P„(t). The problem of determining a particular solution of 

ay" + by' + cy = e at P„(t) (31) 

can be reduced to the preceding case by a substitution. Let 

Y{t) = e at u{t)\ 

then 

Y'(t) — e a, [u'(t) + a«(()] 

and 

Y"(t) = e at [u"(t) + 2a u!{t) + a 2 u(f)]. 

Substituting for y,y', and y" in Eq. (31), canceling the factor e at , and collecting terms, 
we obtain 

au"(t) + (2 aa + b)u'{t) + (aa 2 + ba + c)u(t ) = P n (t). (32) 

The determination of a particular solution of Eq. (32) is precisely the same problem, 
except for the names of the constants, as solving Eq. (28). Therefore, if aa 1 + ba + c 

is not zero, we assume that u{t) — Aot n H + A n \ hence a particular solution of 

Eq. (31) is of the form 

Y(t) = e at (A 0 f + AY"- 1 + • • • + A n ). (33) 

On the other hand, if aa 2 + ba + c is zero but 2 aa + b is not, we must take u(t ) 

to be of the form t(Aot " H +A„). The corresponding form for Y(t) is t times 

the expression on the right side of Eq. (33). Note that if aa 2 + ba + c is zero, then 
e at is a solution of the homogeneous equation. If both aa 2 + ba + c and 2 aa + b 
are zero (and this implies that both e at and te at are solutions of the homogeneous 
equation), then the correct form for u(t) is t 2 (A 0 t n + • • • + A n ). Hence Y (?) is t 2 times 
the expression on the right side of Eq. (33). 

g(t) = e a, P„(t) cos fit or e at P„(t) sin /it. These two cases are similar, so we con- 
sider only the latter. We can reduce this problem to the preceding one by noting that, 
as a consequence of Euler’s formula, sin jit = (e‘P‘ — e~'^)/2i. Hence g(t) is of the 
form 

e (a+ift)t _ e (a—ift)t 

g(t) = Pn(t) 


2 i 
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and we should choose 

Y(t) = e (a+ip) \A Q t n + ...+A n ) + e^- m \B 0 t n + ■ ■ • + B„), 
or, equivalently, 

Y ( t ) = e at (A 0 t n + f- A n ) cos fit + e^iBot" H + B„) sin fit. 

Usually, the latter form is preferred. If a ± if satisfy the characteristic equation 
corresponding to the homogeneous equation, we must, of course, multiply each of 
the polynomials by t to increase their degrees by one. 

If the nonhomogeneous function involves both cos ft and sin ft, it is usually con- 
venient to treat these terms together, since each one individually may give rise to the 
same form for a particular solution. For example, if g(t) — t sin t + 2 cos t, the form 
for Y (t) would be 

Y(t) = ( Aot + A\) sinf + ( Btf + B\) cos t, 
provided that sin t and cos t are not solutions of the homogeneous equation. 


PROBLEMS 


In each of Problems 1 through 12 find the general solution of the given differential equation. 

1. y" - 2 y -3 y = 3e 2t 2. y" + 2/ + 5y = 3 sin2 1 

3. y" - 2yf -3 y= -3 te~‘ 4. y" + 2/ = 3 + 4 sin 2 1 

5. y" + 9 y = t 2 e 3 ' + 6 6. y" + 2y' + y = 2e~‘ 

7. 2y" + 3/ + y = t 2 + 3 sin t 8. y" + y = 3 sin 2 1 + t cos 2t 

9. ll" + (WqM = COSwf, CD 2 7 ^ COq 10. ll" + OJqM = COS (Dot 


11. y" + y' + 4y = 2 sinh t 

12. y" —y' — 2y = cosh 2 1 


Hint: sinhf = (e ( — e ')/2 
Hint: coshf = (e' + e~‘)/2 


In each of Problems 13 through 18 find the solution of the given initial value problem. 

13. y" +y' - 2y = 2 1, y(0) = 0, / (0) = 1 

14. y" + 4 y = t 2 4- 3e', y(0) = 0, /( 0) = 2 

15. y" — 2y'+y = te‘ + 4, y(0) = 1, /(0) = 1 

16. y" - 2 y -3 y = 3 te 2 ‘, y( 0) = 1, y'(0) = 0 

17. y" + Ay = 3 sin 2 1, y( 0) = 2, /(0) = -1 

18. y" + 2 / + 5y = 4e~‘ cos 2 1, y (0) = 1, / (0) = 0 

In each of Problems 19 through 26: 

(a) Determine a suitable form for Y ( t ) if the method of undetermined coefficients is to be 
used. 

(b) Use a computer algebra system to find a particular solution of the given equation. 

19. y" + 3/ = 2r 4 + t 2 e~ 3 ‘ + sin3t 

20. y" + y = t (1 + sin t) 

21. y" — 5 y' +6y — e‘ cos 2 1 + e 2, (3 1 + 4) sin t 

22. y" + 2/ + 2y — 3e~‘ + 2<U cos t + 4e~'f 2 sin t 

23. y" - 4 ■/ + 4 y = 2 1 2 + 4te 2 ' + t sin 2 1 

24. y" + 4 y = t 1 sin 2 1 + (6 1 + 7) cos 2 1 

25. y" + 3/ + 2y = e'(t 2 + 1) sin 2f + 3e - ' cos t + 4e' 
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26. y" + 2/ + 5y = 3 te ' cos 2? — 2 te 2 ‘ cos ? 

27. Consider the equation 

y" - 3 / - 4 y = 2e~‘ (i) 

from Example 5 . Recall that vi (?) = e~‘ and y 2 (t) = e 4 ' are solutions of the corresponding 
homogeneous equation. Adapting the method of reduction of order (Section 3.4), seek a 
solution of the nonhomogeneous equation of the form Y (t) = v(t)yi(t) = v(t)e~', where 
v (?) is to be determined. 

(a) Substitute Y(?), Y'(r), and Y"(?) into Eq. (i) and show that v(t) must satisfy 
v" - 5v' = 2. 

(b) Let w(t) = v'(t) and show that w(t) must satisfy w' — 5w — 2. Solve this equation 
for w(t). 

(c) Integrate w(t) to find v(t) and then show that 

Y (?) = - + ^cie 4 ' + c 2 e~‘. 

The first term on the right side is the desired particular solution of the nonhomogeneous 
equation. Note that it is a product of ? and e~‘. 

28. Determine the general solution of 


N 

y" + k 2 y — ^2 a m sin mnt, 

m = 1 


where k > 0 and k / mn for m — 1, . . . , N . 

29. In many physical problems the nonhomogeneous term may be specified by different for- 
mulas in different time periods. As an example, determine the solution y = 0 (?) of 


y" + y = 



0 < t < n, 
t > n. 


satisfying the initial conditions y(0) = 0 and y'(0) = 1. Assume that y and y’ are also 
continuous at ? = n. Plot the nonhomogeneous term and the solution as functions of time. 
Hint: First solve the initial value problem for t <: r; then solve for ? > jr, determining the 
constants in the latter solution from the continuity conditions at t = it. 

30. Follow the instructions in Problem 29 to solve the differential equation 


? + 2? + 5y= 


0 < ? < jr/2, 
t > n /2 


with the initial conditions y(0) = 0 and y'(0) = 0. 

Behavior of Solutions as t — ► oo. In Problems 31 and 32 we continue the discussion started 
with Problems 38 through 40 of Section 3.4. Consider the differential equation 


ay" + by' + cy = g(?). 


(i) 


where a, b , and c are positive. 

31. If YT (?) and Y 2 (?) are solutions of Eq. (i), show that Yi(?) — Y 2 (?) — > 0 as ? — ► oo. Is this 
result true if b = 0? 

32. If g(?) = d, a constant, show that every solution of Eq. (i) approaches d/c as t -> oo. What 
happens if c = 0? What if b = 0 also? 
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33. In this problem we indicate an alternative procedure 7 for solving the differential equation 

y" + by' + cy = (D 2 + bD + c)y = g(f), (i) 

where b and c are constants, and D denotes differentiation with respect to t. Let r\ and r 2 
be the zeros of the characteristic polynomial of the corresponding homogeneous equation. 
These roots may be real and different, real and equal, or conjugate complex numbers. 

(a) Verify that Eq. (i) can be written in the factored form 

( D - n)(D - r 2 )y = g(t), 


where r\ + r 2 = — b and r\r 2 = c. 

(b) Let u = (D — r 2 )y. Then show that the solution of Eq (i) can be found by solving the 
following two first order equations: 

CD - n)u = g(t), (D - r 2 )y = u{t). 


In each of Problems 34 through 37 use the method of Problem 33 to solve the given differential 
equation. 

34. y" — 3/ — 4y = 3e 2 ' (see Example 1) 

35. 2y" + 3/ + y = t 1 + 3 sin t (see Problem 7) 

36. y" + 2/ + y = 2e~' (see Problem 6) 

37. y" + 2/ = 3 + 4 sin 2 1 (see Problem 4) 


3.6 Variation of Parameters 

In this section we describe another method of finding a particular solution of a non- 
homogeneous equation. This method, known as variation of parameters, is due to 
Lagrange and complements the method of undetermined coefficients rather well. 
The main advantage of variation of parameters is that it is a general method ; in 
principle at least, it can be applied to any equation, and it requires no detailed as- 
sumptions about the form of the solution. In fact, later in this section we use this 
method to derive a formula for a particular solution of an arbitrary second order 
linear nonhomogeneous differential equation. On the other hand, the method of 
variation of parameters eventually requires us to evaluate certain integrals involving 
the nonhomogeneous term in the differential equation, and this may present diffi- 
culties. Before looking at this method in the general case, we illustrate its use in an 
example. 

Find a particular solution of 

I EXAMPLE y" + 4y = 3 esc t. (1) 

1 Observe that this problem is not a good candidate for the method of undetermined coeffi- 

cients, as described in Section 3.5, because the nonhomogeneous term g(t) = 3 esc t involves 


7 R. S. Luthar, “Another Approach to a Standard Differential Equation," Two Year College Mathematics 
Journal 10 (1979), pp. 200-201; also see D. C. Sandell and F. M. Stein, “Factorization of Operators of 
Second Order Linear Homogeneous Ordinary Differential Equations,” Two Year College Mathematics 
Journal 8 (1977), pp. 132-141, for a more general discussion of factoring operators. 
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a quotient (rather than a sum or a product) of sin t or cos t. Therefore, we need a different 
approach. Observe also that the homogeneous equation corresponding to Eq. (1) is 

y" + 4v = 0, (2) 

and that the general solution of Eq. (2) is 

y c ( t ) = cj cos 2 1 + c 2 sin 2 1. (3) 

The basic idea in the method of variation of parameters is to replace the constants cq and c 2 
in Eq. (3) by functions and u 2 {t), respectively, and then to determine these functions so 
that the resulting expression 


y = u\ (t) cos 2t + u 2 (t) sin 2t (4) 

is a solution of the nonhomogeneous equation (1). 

To determine and u 2 , we need to substitute for y from Eq. (4) in Eq. (1). However, even 
without carrying out this substitution, we can anticipate that the result will be a single equation 
involving some combination of u\, u 2 , and their first two derivatives. Since there is only one 
equation and two unknown functions, we can expect that there are many possible choices of 
ui and u 2 that will meet our needs. Alternatively, we may be able to impose a second condition 
of our own choosing, thereby obtaining two equations for the two unknown functions Ui and 
u 2 . We will soon show (following Lagrange) that it is possible to choose this second condition 
in a way that makes the computation markedly more efficient. 

Returning now to Eq. (4), we differentiate it and rearrange the terms, thereby obtaining 

y' = — 2«i ( t ) sin 2? + 2u 2 ( t ) cos 2 1 + u[ ( t ) cos 2 1 + u 2 ( t ) sin 2t . (5) 

Keeping in mind the possibility of choosing a second condition on ui and u 2 , let us require the 
sum of the last two terms on the right side of Eq. (5) to be zero; that is, we require that 

m'i ( t ) cos 2 1 + u 2 (t) sin 2t = 0. (6) 

It then follows from Eq. (5) that 

y' = —2 ui (0 sin 2 1 + 2 u 2 (t) cos 2 1. (7) 


Although the ultimate effect of the condition (6) is not yet clear, at the very least it has simplified 
the expression for y ' . Further, by differentiating Eq. (7), we obtain 


y" = — 4ni (r) cos 2 1 — 4«2 (f) sin 2 1 — 2 u\ (t) sin 2 1 + 2 u' 2 (t) cos 2 1. (8) 

Then, substituting for y and y" in Eq. (1) from Eqs. (4) and (8), respectively, we find that u 2 
and u 2 must satisfy 

—2 u\ (r) sin 2 1 + 2 u 2 (/) cos 2r = 3 esc t . (9) 


Summarizing our results to this point, we want to choose ii\ and u 2 so as to satisfy Eqs. (6) 
and (9). These equations can be viewed as a pair of linear algebraic equations for the two 
unknown quantities u\ (t) and u' 2 (t). Equations (6) and (9) can be solved in various ways. For 
example, solving Eq. (6) for u 2 (t ), we have 


u 2 (t) = —u[(t) 


cos 2 1 
sin 2 1 


( 10 ) 


Then, substituting for u 2 (t) in Eq. (9) and simplifying, we obtain 

3 esc t sin 2 1 


u[(t) = 


2 


= —3 cost. 


( 11 ) 
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Further, putting this expression for u[ ( t ) back in Eq. (10) and using the double-angle formulas, 
we find that 


, 3 cost cos 2t 3(1 — 2 sin 2 1) 3 

u 2 (t) = — = — = - esc f — 3 sin f . 

sin2f 2sinr 2 


( 12 ) 


Flaving obtained nj(t) and u' 2 (t), we next integrate so as to find wi(t) and U 2 (t). The result is 


hi (t) = —3 sinf + ci 


and 


u 2 (t) = | In | csct — cot t\ + 3 cost + c 2 . 

On substituting these expressions in Eq. (4), we have 

y = —3 sin t cos 2t + | In | esc t — cot t \ sin 2t + 3 cos t sin 2 1 
+ ci cos 2 1 + c 2 sin 2 1. 


(13) 

(14) 


Finally, by using the double-angle formulas once more, we obtain 

y = 3 sin t 4- | In | esc t — cot t\ sin 2 1 + ci cos 2 1 + c 2 sin 2 1. (15) 

The terms in Eq. (15) involving the arbitrary constants ci and c 2 are the general solution of the 
corresponding homogeneous equation, while the remaining terms are a particular solution of 
the nonhomogeneous equation (1). Thus Eq. (15) is the general solution of Eq. (1). 


In the preceding example the method of variation of parameters worked well in 
determining a particular solution, and hence the general solution, of Eq. (1). The next 
question is whether this method can be applied effectively to an arbitrary equation. 
Therefore we consider 

y" + p(t)y' + q(t)y = g(t), (16) 

where p , q, and g are given continuous functions. As a starting point, we assume that 
we know the general solution 

y c (t) = c\y\(t) + c 2 y 2 (0 (17) 

of the corresponding homogeneous equation 

f+p(t)y' + q(t)y = 0. (18) 

This is a major assumption because so far we have shown how to solve Eq. (18) only if 
it has constant coefficients. If Eq. (18) has coefficients that depend on t, then usually 
the methods described in Chapter 5 must be used to obtain y c (t). 

The crucial idea, as illustrated in Example 1, is to replace the constants c\ and c 2 
in Eq. (17) by functions u\(t) and m 2 (0, respectively; this gives 

y = «i (t)y\ (t) + U 2 (t)y 2 (t). (19) 

Then we try to determine u\ ( t ) and u 2 ( t ) so that the expression in Eq. (19) is a solution 
of the nonhomogeneous equation (16) rather than the homogeneous equation (18). 
Thus we differentiate Eq. (19), obtaining 

y 1 = «j (t)y\ it) + u\ (f)yj it) + u’ 2 (t)y 2 {t) + u 2 (t)y 2 (t). (20) 

As in Example 1, we now set the terms involving u\ (t) and u' 2 (t) in Eq. (20) equal to 
zero; that is, we require that 


u\(t)yi (t) + u' 2 (t)y 2 (t) = 0. 


( 21 ) 
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Then, from Eq. (20), we have 

/ = Mi(0/i(0 + u 2 (t)y' 2 (t). (22) 

Further, by differentiating again, we obtain 

y" = u\ (t)y\ (t) + u 2 (t)y"(t) + u' 2 (t)y 2 (t) + u 2 (t)y 2 (t). (23) 

Now we substitute for y , y ' , and y" in Eq. (16) from Eqs. (19), (22), and (23), 
respectively. After rearranging the terms in the resulting equation, we find that 

ui(t)\y"(t) + p(t)y[(t) + qiOy^t)] 

+ u 2 (t)[y 2 {t) + p(t)y 2 (t) + q{t)y 2 {t)] 

+ u'\ (t)y\ (t) + u 2 (t)y' 2 (t) = g(t). (24) 

Each of the expressions in square brackets in Eq. (24) is zero because both y i and y 2 
are solutions of the homogeneous equation (18). Therefore Eq. (24) reduces to 

u\ (O/i (f) + u' 2 (t)y' 2 {t) = g(t). (25) 


Equations (21) and (25) form a system of two linear algebraic equations for the 
derivatives u\ (t) and u' 2 (t) of the unknown functions. They correspond exactly to 
Eqs. (6) and (9) in Example 1. 

By solving the system (21), (25) we obtain 


u\(t) = - 


yiiOgjt) 

W(yi,y 2 )(0 ’ 


u 2 (t) 


yi(t)g(t) 
W(yi,y 2 )(t) ’ 


(26) 


where W(yi,y 2 ) is the Wronskian of vi and y 2 . Note that division by W is permissible 
since y\ and y 2 are a fundamental set of solutions, and therefore their Wronskian is 
nonzero. By integrating Eqs. (26), we find the desired functions u\(t) and u 2 {t), 
namely, 


itiit) = 


i 


yi(t)g{t) 

W(y u y 2 )(t) 


dt T c \ , 


u 2 (t) — 


i 


yi(t)g{t) 

W{yi,y 2 )(t) 


dt + c 2 . 


(27) 


If the integrals in Eqs. (27) can be evaluated in terms of elementary functions, then we 
substitute the results in Eq. (19), thereby obtaining the general solution of Eq. (16). 
More generally, the solution can always be expressed in terms of integrals, as stated 
in the following theorem. 


If the functions p , q , and g are continuous on an open interval /, and if the functions 
yi and y 2 are a fundamental set of solutions of the homogeneous equation (18) 
corresponding to the nonhomogeneous equation (16) 

y" + p{t)y' + q(t)y = g(t). 


then a particular solution of Eq. (16) is 

y 2 (s)g(s) 


Y(t ) = -yi(f) 


f 

J tf) 


t 0 W(y l ,y 2 )(.'0 


ds + y 2 (t) 


f 

J to 


yi(s)g(s) 
to W(y 1 ,y 2 )(s) 


ds , 


(28) 
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where to is any conveniently chosen point in /. The general solution is 

y = ciyi (f) + c 2 y 2 (0 + Y (0, (29) 

as prescribed by Theorem 3.5.2. 


By examining the expression (28) and reviewing the process by which we derived 
it, we can see that there may be two major difficulties in using the method of variation 
of parameters. As we have mentioned earlier, one is the determination of y\(t) and 
y 2 (t), a fundamental set of solutions of the homogeneous equation (18), when the 
coefficients in that equation are not constants. The other possible difficulty lies in 
the evaluation of the integrals appearing in Eq. (28). This depends entirely on the 
nature of the functions y\, y 2 , and g. In using Eq. (28), be sure that the differential 
equation is exactly in the form (16); otherwise, the nonhomogeneous term g(t) will 
not be correctly identified. 

A major advantage of the method of variation of parameters is that Eq. (28) pro- 
vides an expression for the particular solution Y (t) in terms of an arbitrary forcing 
function g{t). This expression is a good starting point if you wish to investigate the 
effect of variations in the forcing function, or if you wish to analyze the response of 
a system to a number of different forcing functions. 


PROBLEMS 


In each of Problems 1 through 4 use the method of variation of parameters to find a particular 
solution of the given differential equation. Then check your answer by using the method of 
undetermined coefficients. 

1. y" - 5/ + 6 y = 2e‘ 2. y" - / - 2y = 2e~‘ 

3. y" + 2/ + y = 3e~‘ 4. 4/' - 4 y' + y = 16e ,/z 


In each of Problems 5 through 12 find the general solution of the given differential equation. 
In Problems 11 and 12, g is an arbitrary continuous function. 


5. y" + y = tan t, 0 < t < n /2 
1. y" + 4/ + 4 y = r 2 e~ 2 ‘, t > 0 
9. Ay" + y = 2 sec(t/2), — n < t < n 

11. y" - 5/ + 6y = g{t) 


6. y" + 9y = 9 sec 2 3f, 0 < t < tc / 6 

8. y" + Ay = 3 esc 2 1, 0 < t < n/2 

10. y" -2 y' + y = e'/(l + t 2 ) 

12. y" + Ay = g(t) 


In each of Problems 13 through 20 verify that the given functions yi and y 2 satisfy the corre- 
sponding homogeneous equation; then find a particular solution of the given nonhomogeneous 
equation. In Problems 19 and 20, g is an arbitrary continuous function. 

13. t 2 y" — 2y = 3f 2 — 1, t > 0; yi(t) = t 2 , y 2 (t) = t~ l 

14. t 2 y" — t(t + 2)y' + (t + 2)y = 2t 3 , t > 0; y 1 (t) = t , y 2 (t) = te‘ 

15. ty" - (1 + t)y' + y = t 2 e 2 ‘, t > 0; yi(t) = 1 + t, y 2 (t ) = e' 

16. (1 - t)y" + ty' - y = 2(t - l) 2 e~', 0 < t < 1; y 1 (t) = e‘, y 2 (t) = t 

17. x 2 y" — 3x / + Ay = x 2 ln.r, x > 0; yi(x) = x 2 , y 2 {x) = x 2 lnx 

18. x 2 y" + xy' + (x 2 — 0.25)y = 3x 3/2 sinx, x > 0; 
yi(x) = x _1/2 sinx, y 2 (x) = x _1/2 cosx 

19. (1 - x)y" + xy' - y = g(x), 0 < x < 1; y 1 (x) = e x , y 2 (x) = x 

20. x 2 y" + xy' + (x 2 — 0.25)y = g(x), x > 0; yi(x) = x -1/2 sinx, y 2 (x) = x _1/2 cosx 
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21. Show that the solution of the initial value problem 

L[y] = y" + p(t)y' + q(t)y = g(f), y(!o) = yo, y'(fo) = yo (i) 

can be written as y = u(t) + v(t). where u and v are solutions of the two initial value 
problems 

L\u\ = 0, K(fo) = yo, u'(t 0 ) = y' 0 , (ii) 

L[v] = g(f), w(/ 0 ) = 0, t >'(,t 0 ) = 0, (hi) 

respectively. In other words, the nonhomogeneities in the differential equation and in 
the initial conditions can be dealt with separately. Observe that u is easy to find if a 
fundamental set of solutions of L[u\ = 0 is known. 

22. By choosing the lower limit of integration in Eq. (28) in the text as the initial point t 0 , 
show that Y(t) becomes 


Y(t) = 


I" yi(s)y 2 (t) -yi(t)y 2 (s) 

*t 0 yiWy-iO) -y[(s)y2(s) 


g(s) ds. 


Show that Y (/) is a solution of the initial value problem 

L[y] = g(t), y(k>) = 0, y'(fo) = 0. 

Thus y can be identified with v in Problem 21. 

23. (a) Use the result of Problem 22 to show that the solution of the initial value problem 


is 


y" +y = g(t), y(k ) = 0, y'(f 0 ) = 0 


(i) 


y= f si 

J to 


sin(f — s)g(s) ds. 


(ii) 


(b) Use the result of Problem 21 to find the solution of the initial value problem 

y" + y = g(t), y(0) = yo, y'(0) = y' 0 . 


24. Use the result of Problem 22 to find the solution of the initial value problem 

L[y] = (D — a)(D - b)y = g(t), y(t 0 ) = 0, /(to) = 0, 

where a and b are real numbers with a / b. 

25. Use the result of Problem 22 to find the solution of the initial value problem 

L[y] = [D 1 - 2 XD + (X 2 + fi 2 )]y = g(t), y(t 0 ) = 0, y'(f 0 ) = 0. 

Note that the roots of the characteristic equation are X ± ifi. 

26. Use the result of Problem 22 to find the solution of the initial value problem 

L[y] = (D - a) 2 y = g(t), y(t 0 ) = 0, y'(t 0 ) = 0, 


where a is any real number. 
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27. By combining the results of Problems 24 through 26, show that the solution of the initial 
value problem 

L\y\ = (D 2 + bD + c)y = g{t), yflo) = 0, y\t 0 ) = 0, 
where b and c are constants, has the form 

y — 0(0 — f K(t — s)g(s) ds. (i) 

JtQ 

The function K depends only on the solutions yi andy 2 of the corresponding homogeneous 
equation and is independent of the nonhomogeneous term. Once K is determined, all 
nonhomogeneous problems involving the same differential operator L are reduced to the 
evaluation of an integral. Note also that although K depends on both t and s, only the 
combination t — s appears, so K is actually a function of a single variable. When we think 
of g(t) as the input to the problem and of (t) as the output, it follows from Eq. (i) that the 
output depends on the input over the entire interval from the initial point t 0 to the current 
value t. The integral in Eq. (i) is called the convolution of K and g, and K is referred to as 
the kernel. 

28. The method of reduction of order (Section 3.4) can also be used for the nonhomogeneous 
equation 

y" + p(t)y' + q(t)y = g(f), (i) 

provided one solution y\ of the corresponding homogeneous equation is known. Let 
y = v(t)yi(t) and show that y satisfies Eq. (i) if v is a solution of 

yi(t)v” + [2 y'^t) + p(f)yi(0V = g(t). (ii) 

Equation (ii) is a first order linear equation for v'. Solving this equation, integrating the 
result, and then multiplying by y\ (t) lead to the general solution of Eq. (i). 

In each of Problems 29 through 32 use the method outlined in Problem 28 to solve the given 
differential equation. 

29. t 2 y" — 2ty’ + 2y — 4t 2 , t > 0; y, (t) = t 

30. t 2 y" + 7ty' + 5y = t, t > 0; yi{t) = r 1 

31. ty" — (1 + t)y' + y = t 2 e 2 ‘, t > 0; yi(t) = 1 + t (see Problem 15) 

32. (1 — t)y" + ty 1 — y = 2 (t — V) 2 e~‘, 0 < t < 1; yi(t) = e‘ (see Problem 16) 


3.7 Mechanical and Electrical Vibrations 


One of the reasons why second order linear equations with constant coefficients 
are worth studying is that they serve as mathematical models of some important 
physical processes. Two important areas of application are the fields of mechanical 
and electrical oscillations. For example, the motion of a mass on a vibrating spring, 
the torsional oscillations of a shaft with a flywheel, the flow of electric current in 
a simple series circuit, and many other physical problems are all described by the 
solution of an initial value problem of the form 

ay" + by' + cy = g(t), y(0) = y 0 , y'(0) = yo- 


( 1 ) 
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This illustrates a fundamental relationship between mathematics and physics: 
many physical problems may have the same mathematical model. Thus, once we 
know how to solve the initial value problem (1), it is only necessary to make appro- 
priate interpretations of the constants a, b, and c, and of the functions y and g, to 
obtain solutions of different physical problems. 

We will study the motion of a mass on a spring in detail because an understanding 
of the behavior of this simple system is the first step in the investigation of more 
complex vibrating systems. Further, the principles involved are common to many 
problems. Consider a mass m hanging on the end of a vertical spring of original 
length l, as shown in Figure 3.7.1. The mass causes an elongation L of the spring in 
the downward (positive) direction. There are two forces acting at the point where 
the mass is attached to the spring; see Figure 3.7.2. The gravitational force, or weight 
of the mass, acts downward and has magnitude mg, where g is the acceleration due to 
gravity. There is also a force F s , due to the spring, that acts upward. If we assume that 
the elongation L of the spring is small, the spring force is very nearly proportional 
to L; this is known as Flooke’s 8 law. Thus we write F, = — kL, where the constant of 
proportionality k is called the spring constant, and the minus sign is due to the fact 
that the spring force acts in the upward (negative) direction. Since the mass is in 
equilibrium, the two forces balance each other, which means that 


mg — kL = 0. (2) 

For a given weight w = mg, one can measure L and then use Eq. (2) to determine k. 
Note that k has the units of force/length. 



m ' r 
u 


FIGURE 3.7.1 A spring-mass system. 

In the corresponding dynamic problem, we are interested in studying the motion 
of the mass when it is acted on by an external force or is initially displaced. Let 
u{t), measured positive downward, denote the displacement of the mass from its 
equilibrium position at time f; see Figure 3.7.1. Then u(t) is related to the forces 
acting on the mass through Newton’s law of motion 

mu" {t) = f{t), (3) 


^Robert Hooke (1635-1703) was an English scientist with wide-ranging interests. His most important 
book, Micrographia, was published in 1665 and described a variety of microscopical observations. Hooke 
first published his law of elastic behavior in 1676 as ceiiinosssttuv ; in 1678 he gave the interpretation ut 
tensio sic vis , which means, roughly, “as the force so is the displacement.” 
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F s = -kL 


o 


w = mg 

FIGURE 3.7.2 Force diagram for a spring-mass system. 


where u" is the acceleration of the mass and / is the net force acting on the mass. 

Observe that both u and / are functions of time. In determining / there are four 

separate forces that must be considered: 

1. The weight u> = mg of the mass always acts downward. 

2. The spring force F s is assumed to be proportional to the total elongation L + u of the 
spring and always acts to restore the spring to its natural position. If L + u > 0, then the 
spring is extended, and the spring force is directed upward. In this case 

F s = —k(L + u). (4) 

On the other hand, if L + u < 0, then the spring is compressed a distance |L + u\, and 
the spring force, which is now directed downward, is given by F s = k\L + u\. However, 
when L + u < 0, it follows that \L + u\ = — (L + a), so F s is again given by Eq. (4). Thus, 
regardless of the position of the mass, the force exerted by the spring is always expressed 
by Eq. (4). 

3. The damping or resistive force F d always acts in the direction opposite to the direction of 
motion of the mass. This force may arise from several sources: resistance from the air or 
other medium in which the mass moves, internal energy dissipation due to the extension or 
compression of the spring, friction between the mass and the guides (if any) that constrain 
its motion to one dimension, or a mechanical device (dashpot) that imparts a resistive 
force to the mass. In any case, we assume that the resistive force is proportional to the 
speed \du/dt\ of the mass: this is usually referred to as viscous damping. If du/dt > 0,then 
u is increasing, so the mass is moving downward. Then F d is directed upward and is given 
by 

Fd(t) = —yu'{t), (5) 

where y is a positive constant of proportionality known as the damping constant. On 
the other hand, if du/dt < 0, then u is decreasing, the mass is moving upward, and F d is 
directed downward. In this case, F d = y\u'(t)\; since \u\t)\ = —u'(t), it follows that F d (t) 
is again given by Eq. (5). Thus, regardless of the direction of motion of the mass, the 
damping force is always expressed by Eq. (5). 

The damping force may be rather complicated, and the assumption that it is modeled 
adequately by Eq. (5) may be open to question. Some dashpots do behave as Eq. (5) 
states, and if the other sources of dissipation are small, it may be possible to neglect 
them altogether or to adjust the damping constant y to approximate them. An important 
benefit of the assumption (5) is that it leads to a linear (rather than a nonlinear) differential 
equation. In turn, this means that a thorough analysis of the system is straightforward, as 
we will show in this section and the next. 

4. An applied external force F(t) is directed downward or upward as F(t) is positive or 
negative. This could be a force due to the motion of the mount to which the spring is 
attached, or it could be a force applied directly to the mass. Often the external force is 
periodic. 
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Taking account of these forces, we can now rewrite Newton’s law (3) as 

mu"(t) = mg + F s (t) + F d (t) + F(t) 

= mg — k[L + «(/)] — yu'{t) + F{t). (6) 

Since mg — kL = 0 by Eq. (2), it follows that the equation of motion of the mass is 

mu" if) + yu'(t) + ku(t) = F(t), (7) 

where the constants m , y, and k are positive. Note that Eq. (7) has the same form as 
Eq. (1). 

It is important to understand that Eq. (7) is only an approximate equation for 
the displacement u(t). In particular, both Eqs. (4) and (5) should be viewed as 
approximations for the spring force and the damping force, respectively. In our 
derivation we have also neglected the mass of the spring in comparison with the 
mass of the attached body. 

The complete formulation of the vibration problem requires that we specify two 
initial conditions, namely, the initial position uq and the initial velocity i>o of the mass: 

u( 0) = m'( 0) = i>o- (8) 

It follows from Theorem 3.2.1 that these conditions give a mathematical problem that 
has a unique solution. This is consistent with our physical intuition that if the mass is 
set in motion with a given initial displacement and velocity, then its position will be 
determined uniquely at all future times. The position of the mass is given (approxi- 
mately) by the solution of Eq. (7) subject to the prescribed initial conditions (8). 


EXAMPLE 

1 


A mass weighing 4 lb stretches a spring 2 in. Suppose that the mass is displaced an additional 
6 in. in the positive direction and then released. The mass is in a medium that exerts a viscous 
resistance of 6 lb when the mass has a velocity of 3 ft/s. Under the assumptions discussed in 
this section, formulate the initial value problem that governs the motion of the mass. 

The required initial value problem consists of the differential equation (7) and initial condi- 
tions (8), so our task is to determine the various constants that appear in these equations. The 
first step is to choose the units of measurement. Based on the statement of the problem, it is 
natural to use the English rather than the metric system of units. The only time unit mentioned 
is the second, so we will measure t in seconds. On the other hand, both the foot and the inch 
appear in the statement as units of length. It is immaterial which one we use, but having made 
a choice, we must be consistent. To be definite, let us measure the displacement u in feet. 

Since nothing is said in the statement of the problem about an external force, we assume 
that F(t) = 0. To determine m, note that 

_ w _ 4 lb _ 1 lb-s 2 

m ~ ~g ~ 32ft7? “ 8 ft 

The damping coefficient y is determined from the statement that yu' is equal to 6 lb when u' 
is 3 ft/s. Therefore 

6 lb _ ? lb-s 
Y ~ 3 ft/s ~ 2 IT' 


The spring constant k is found from the statement that the mass stretches the spring by 2 in., 
or 1/6 ft. Thus 


k = 


41b lb 

= 24 — . 

1/6 ft ft 
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Consequently, Eq. (7) becomes 

| u " + 2m' + 24m = 0, 


or 

u" + 16 m' + 192m = 0. (9) 

The initial conditions are 

m(0) = m'(0) = 0. (10) 

The second initial condition is implied by the word “released” in the statement of the problem, 
which we interpret to mean that the mass is set in motion with no initial velocity. 


Undamped Free Vibrations. If there is no external force, then F(t) = 0 in Eq. (7). Let 
us also suppose that there is no damping, so that y — 0; this is an idealized config- 
uration of the system, seldom (if ever) completely attainable in practice. However, 
if the actual damping is very small, then the assumption of no damping may yield 
satisfactory results over short to moderate time intervals. In this case the equation 
of motion (7) reduces to 

mu" + ku — 0. (11) 

The general solution of Eq. (11) is 

u — A cos wot + B sin coot, (12) 

where 

w q = k/m. (13) 

The arbitrary constants A and B can be determined if initial conditions of the form 
(8) are given. 

In discussing the solution of Eq. (11), it is convenient to rewrite Eq. (12) in the 
form 

u = R cos(®of — <5), (14) 


or 


u = R cos 8 cos a>ot + R sin 8 sin wot . 


(15) 


By comparing Eq. (15) with Eq. (12), we find that A, B , R , and <5 are related by the 
equations 

A = Rcos8, B = l?sin<5. (16) 

Thus 

R = y/A 2 + B 2 , tan8 = B/A. (17) 


In calculating 8, we must take care to choose the correct quadrant; this can be done 
by checking the signs of cos 8 and sin 8 in Eqs. (16). 

The graph of Eq. (14), or the equivalent Eq. ( 12) , for a typical set of initial conditions 
is shown in Figure 3.7.3. The graph is a displaced cosine wave that describes a periodic, 
or simple harmonic, motion of the mass. The period of the motion is 


T = 



&>o 


/ m \ V 2 

\k) 


(18) 


The circular frequency wo = A /k/m , measured in radians per unit time, is called the 
natural frequency of the vibration. The maximum displacement R of the mass from 
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EXAMPLE 

2 


equilibrium is the amplitude of the motion. The dimensionless parameter <5 is called 
the phase, or phase angle, and measures the displacement of the wave from its normal 
position corresponding to 8 = 0. 


u 

_JL R 

R cos 8 






1 



-R 

5 

8 + n 

8 + 2n 

Q) 0 t 






FIGURE 3.7.3 Simple harmonic motion; it = R cos(a>of — 8). 


Note that the motion described by Eq. (14) has a constant amplitude that does not 
diminish with time. This reflects the fact that, in the absence of damping, there is no 
way for the system to dissipate the energy imparted to it by the initial displacement 
and velocity. Further, for a given mass m and spring constant k, the system always 
vibrates at the same frequency to o, regardless of the initial conditions. However, the 
initial conditions do help to determine the amplitude of the motion. Finally, observe 
from Eq. (18) that T increases as m increases, so larger masses vibrate more slowly. 
On the other hand, T decreases as k increases, which means that stiffer springs cause 
the system to vibrate more rapidly. 


Suppose that a mass weighing 10 lb stretches a spring 2 in. If the mass is displaced an additional 
2 in. and is then set in motion with an initial upward velocity of 1 ft/s, determine the position 
of the mass at any later time. Also determine the period, amplitude, and phase of the motion. 

The spring constant is k = 10 lb/2 in.= 60 lb/ft, and the mass is m = w/g = 10/32 lb-s 2 /ft. 
Hence the equation of motion reduces to 

u" + 192;/ = 0, (19) 


and the general solution is 

it = A cos(8-\/3f) + B sin(8\/3t). 

The solution satisfying the initial conditions i/(0) = 1/6 ft and zz'(0) = — 1 ft/s is 


;/ = - cos(8\/3r) sin(8\/3t). 

6 8^3 


( 20 ) 


The natural frequency is a>o = \/l92 = 13.856 rad/s, so the period is T — 2n/u>o = 0.45345 s. 
The amplitude R and phase 8 are found from Eqs. (17). We have 


1 


1 


19 


J^Z 

“ 36 192 “ 576’ 


so R = 0.18162 ft. 
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The second of Eqs. (17) yields tan <5 = — v / 3/4. There are two solutions of this equation, one 
in the second quadrant and one in the fourth. In the present problem cos 8 > 0 and sin 8 < 0, 
so 8 is in the fourth quadrant, namely, 

8 = — arctan(V3/4) = —0.40864 rad. 

The graph of the solution (20) is shown in Figure 3.7.4. 


u 

0.2 

R = 0.182 

A\ 
\ / \ 

u = 0.182 cos(8V3 t + 0.409) 

r\ r\ r\ 

\ \ \ 

1 \ i 

-0.2 

\ / 0 5 \ / V / \ 15 / \ * 

W \J \J XJ v 

I T= 0.453 | 


FIGURE 3.7.4 An undamped free vibration; u" + 192« = 0, w(0) = 1/6, u'( 0) = — 1. 


Damped Free Vibrations. If we include the effect of damping, the differential equation 
governing the motion of the mass is 

mu" + yu' + ku = 0. (21) 


We are especially interested in examining the effect of variations in the damping 
coefficient y for given values of the mass m and spring constant k. The roots of the 
corresponding characteristic equation are 





( 22 ) 


Depending on the sign of y 2 — 4 km, the solution u has one of the following forms: 

y 2 — 4km >0, u = Ae nt + Be r2t \ (23) 

y 2 - 4 km = 0, u — (A + Bt)e~ yt/2m \ (24) 

v/2\l/2 

y 2 — 4km < 0, u — e~ y ‘^ 2m (A cos jit + B sin fit), n = > 0. (25) 

2m 

Since m, y, and k are positive, y 2 — 4km is always less than y 2 . Hence, if 
y 2 — 4 km > 0, then the values of r t and r 2 given by Eq. (22) are negative. If 
y 2 — 4 km < 0, then the values of and r 2 are complex, but with negative real part. 
Thus, in all cases, the solution u tends to zero as t -> oo; this occurs regardless of the 
values of the arbitrary constants A and B , that is, regardless of the initial conditions. 
This confirms our intuitive expectation, namely, that damping gradually dissipates 
the energy initially imparted to the system, and consequently the motion dies out 
with increasing time. 
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The most important case is the third one, which occurs when the damping is small. 
If we let A — R cos S and B = R sin S in Eq. (25), then we obtain 

u — Re~ Ytllm cos(pt — 8). (26) 

The displacement u lies between the curves u = ±Re~ Yt/2m \ hence it resembles a 
cosine wave whose amplitude decreases as t increases. A typical example is sketched 
in Figure 3.7.5. The motion is called a damped oscillation or a damped vibration. 
The amplitude factor R depends on m,y, k , and the initial conditions. 


u 

\ 

R e -n/2 m 

R cos 5 

l 

i i i i 


S 8 + 71 8 + 2n \8+3n pt 


______ 


' X - R e - ytl2m 


FIGURE 3.7.5 Damped vibration; u = Re y,/2m cos (pt — 8). 


Although the motion is not periodic, the parameter /i determines the frequency 
with which the mass oscillates back and forth; consequently, p is called the quasi 
frequency. By comparing p with the frequency &>o of undamped motion, we find that 


p ( 4km — y 2 ) l/1 /2m ^ y 2 ^ ^ y 2 

co 0 y/k/m V 4km J 8km' 


(27) 


The last approximation is valid when y 2 /4km is small; we refer to this situation as 
“small damping.” Thus the effect of small damping is to reduce slightly the frequency 
of the oscillation. By analogy with Eq. (18), the quantity 7)/ = 2n / p is called the 
quasi period. It is the time between successive maxima or successive minima of 
the position of the mass, or between successive passages of the mass through its 
equilibrium position while going in the same direction. The relation between T c i and 
T is given by 


Td _ «o 



(28) 


where again the last approximation is valid when y 2 /4km is small. Thus small damp- 
ing increases the quasi period. 

Equations (27) and (28) reinforce the significance of the dimensionless ratio 
y 2 /4km. It is not the magnitude of y alone that determines whether damping is 
large or small, but the magnitude of y 2 compared to 4 km. When y 2 /4km is small, 
then damping has a small effect on the quasi frequency and quasi period of the mo- 
tion. On the other hand, if we want to study the detailed motion of the mass for all 
time, then we can never neglect the damping force, no matter how small. 
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As y 2 /4km increases, the quasi frequency // decreases and the quasi period Td 
increases. In fact, /r -* Oand Td -> ooasy — > 2 Vkm. As indicated by Eqs. (23), (24), 
and (25), the nature of the solution changes as y passes through the value 2 Vkm. 
This value is known as critical damping, while for larger values of y the motion is said 
to be overdamped. In these cases, given by Eqs. (24) and (23), respectively, the mass 
creeps back to its equilibrium position but does not oscillate about it, as for small y. 
Two typical examples of critically damped motion are shown in Figure 3.7.6, and the 
situation is discussed further in Problems 21 and 22. 



FIGURE 3.7.6 Critically damped motions: u" + u' + 0.25 u = 0; u = (A + Bt)e (/2 . 


EXAMPLE 

3 


The motion of a certain spring-mass system is governed by the differential equation 

u" + 0.125 u + u = 0, (29) 


where u is measured in feet and t in seconds. If w(0) = 2 and u'( 0) = 0, determine the position 
of the mass at any time. Find the quasi frequency and the quasi period, as well as the time 
at which the mass first passes through its equilibrium position. Also find the time r such that 
|w(t)l < 0.1 for all t > r. 

The solution of Eq. (29) is 


u = e 


— f/16 


A cos 


V255 

16 


t + B sin 


V255 ' 
16 1 


To satisfy the initial conditions, we must choose A = 2 and B = 2/V255; hence the solution of 
the initial value problem is 


u = e 


—f/16 


„ V255 

2 cos — — f + 
16 


2 . a/255 \ 

. sin t 

-v/255 16 ) 


32 

a/255 


e -'/ 16 


COS 



(30) 


where tan <5 = 1/V255, so <5 = 0.06254. The displacement of the mass as a function of time is 
shown in Figure 3.7.7. For purposes of comparison, we also show the motion if the damping 
term is neglected. 

The quasi frequency is /r = V255/16 = 0.998 and the quasi period is T d — 2n / /x = 6.295 s. 
These values differ only slightly from the corresponding values (1 and 2n, respectively) for 
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FIGURE 3.7.7 Vibration with small damping (solid curve) and with no damping (dashed 
curve). 



FIGURE 3.7.8 Solution of Example 3; determination of r . 


the undamped oscillation. This is evident also from the graphs in Figure 3.7.7, which rise 
and fall almost together. The damping coefficient is small in this example, only one-sixteenth 
of the critical value, in fact. Nevertheless, the amplitude of the oscillation is reduced rather 
rapidly. Figure 3.7.8 shows the graph of the solution for 40 < t < 60, together with the graphs 
of u = ±0.1. From the graph it appears that r is about 47.5, and by a more precise calculation 
we find that r = 47.5149 s. 

To find the time at which the mass first passes through its equilibrium position, we refer to 
Eq. (30) and set \/255f/16 — S equal to n/2, the smallest positive zero of the cosine function. 
Then, by solving for f, we obtain 

t = — (- + 5 ) = 1.637 s. 

x/255 V2 ) 
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Electric Circuits. A second example of the occurrence of second order linear differen- 
tial equations with constant coefficients is their use as a model of the flow of electric 
current in the simple series circuit shown in Figure 3.7.9. The current /, measured in 
amperes (A), is a function of time t. The resistance R in ohms (£2), the capacitance 
C in farads (F), and the inductance L in henrys (FI) are all positive and are assumed 
to be known constants. The impressed voltage E in volts (V) is a given function of 
time. Another physical quantity that enters the discussion is the total charge Q in 
coulombs (C) on the capacitor at time t. The relation between charge Q and cur- 
rent 1 is 

/ = dQ/dt. (31) 


Resistance R 



Capacitance C 



Inductance L 


Impressed voltage E{t) 

FIGURE 3.7.9 A simple electric circuit. 


The flow of current in the circuit is governed by Kirchhoffs 9 second law: In a 
closed circuit the impressed voltage is equal to the sum of the voltage drops in the rest 
of the circuit. 

According to the elementary laws of electricity, we know that 
The voltage drop across the resistor is IR. 

The voltage drop across the capacitor is Q/C. 

The voltage drop across the inductor is Ldl /dt. 

Flence, by Kirchhoffs law, 

dl i 

L— + RI + -Q = E(t). (32) 

dt C 

The units have been chosen so that 1 volt = 1 ohm • 1 ampere = 1 coulomb/1 farad 
= 1 henry • 1 ampere/1 second. 

Substituting for I from Eq. (31), we obtain the differential equation 

LQ" + RQ' + = E(t) (33) 

for the charge Q. The initial conditions are 

Q(k) = Qo, 8' (to) = I (to) = Io. (34) 


9 Gustav Kirchhoff (1824-1887), professor at Breslau, Heidelberg, and Berlin, was one of the leading 
physicists of the nineteenth century. He discovered the basic laws of electric circuits about 1845 while 
still a student at Konigsberg. He is also famous for fundamental work in electromagnetic absorption and 
emission and was one of the founders of spectroscopy. 
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Thus we must know the charge on the capacitor and the current in the circuit at some 
initial time to. 

Alternatively, we can obtain a differential equation for the current I by differenti- 
ating Eq. (33) with respect to t , and then substituting for dQ/dt from Eq. (31). The 
result is 

LI" + Rr+y = E'(t), (35) 

with the initial conditions 

I (to) — /o, I' (to) = I’o ■ (36) 

From Eq. (32) it follows that 

T , _ E(t 0 ) — Rio — (1/QQo 

■'o — ^ ■ (3i) 

Hence /( is also determined by the initial charge and current, which are physically 
measurable quantities. 

The most important conclusion from this discussion is that the flow of current in 
the circuit is described by an initial value problem of precisely the same form as 
the one that describes the motion of a spring-mass system. This is a good example 
of the unifying role of mathematics: Once you know how to solve second order 
linear equations with constant coefficients, you can interpret the results in terms of 
mechanical vibrations, electric circuits, or any other physical situation that leads to 
the same problem. 


PROBLEMS In each of Problems 1 through 4 determine a> 0 , R , and & so as to write the given expression in 

the form u = R cospao t — S ). 

1. u = 3 cos 2r + 4 sin 2t 2. u = — cos t + V3 sin t 

3. u = 4cos3t — 2sin3f 4. u — — 2 coin t — 3sin7rf 

4fl 5. A mass weighing 2 lb stretches a spring 6 in. If the mass is pulled down an additional 3 in. 

and then released, and if there is no damping, determine the position u of the mass at any 

time t. Plot u versus t. Find the frequency, period, and amplitude of the motion. 

6. A mass of 100 g stretches a spring 5 cm. If the mass is set in motion from its equilibrium posi- 
tion with a downward velocity of 10 cm/s, and if there is no damping, determine the position 
u of the mass at any time t. When does the mass first return to its equilibrium position? 

7. A mass weighing 3 lb stretches a spring 3 in. If the mass is pushed upward, contracting 
the spring a distance of 1 in., and then set in motion with a downward velocity of 2 ft/s, 
and if there is no damping, find the position u of the mass at any time t. Determine the 
frequency, period, amplitude, and phase of the motion. 

8. A series circuit has a capacitor of 0.25 x 10~ * 1 * * * * 6 7 8 9 F and an inductor of 1 H. If the initial charge 
on the capacitor is 10~ 6 C and there is no initial current, find the charge Q on the capacitor 
at any time t. 

9. A mass of 20 g stretches a spring 5 cm. Suppose that the mass is also attached to a vis- 
cous damper with a damping constant of 400 dyn-s/cm. If the mass is pulled down an 
additional 2 cm and then released, find its position u at any time t. Plot u versus t. Deter- 
mine the quasi frequency and the quasi period. Determine the ratio of the quasi period 
to the period of the corresponding undamped motion. Also find the time r such that 
\u{t)\ < 0.05 cm for all t > r. 
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4fl 10 - A mass weighing 16 lb stretches a spring 3 in. The mass is attached to a viscous damper 
with a damping constant of 2 lb-s/ft. If the mass is set in motion from its equilibrium 
position with a downward velocity of 3 in/s, find its position u at any time t. Plot u versus 
t. Determine when the mass first returns to its equilibrium position. Also find the time r 
such that \u{t)\ < 0.01 in for all t > r. 

11. A spring is stretched 10 cm by a force of 3 N. A mass of 2 kg is hung from the spring and 
is also attached to a viscous damper that exerts a force of 3 N when the velocity of the 
mass is 5 m/s. If the mass is pulled down 5 cm below its equilibrium position and given an 
initial downward velocity of 10 cm/s, determine its position u at any time t. Find the quasi 
frequency /r and the ratio of fi to the natural frequency of the corresponding undamped 
motion. 

12. A series circuit has a capacitor of 10 -5 F, a resistor of 3 x 10 2 £2, and an inductor of 0.2 H. 
The initial charge on the capacitor is 1CF 6 C and there is no initial current. Find the charge 
Q on the capacitor at any time t. 

13. A certain vibrating system satisfies the equation u" + yu' + u = 0. Find the value of the 
damping coefficient y for which the quasi period of the damped motion is 50% greater 
than the period of the corresponding undamped motion. 

14. Show that the period of motion of an undamped vibration of a mass hanging from a ver- 
tical spring is 2 n^L/g, where L is the elongation of the spring due to the mass and g is 
the acceleration due to gravity. 

15. Show that the solution of the initial value problem 

mu" + yu! + kit — 0, u(to) = u\ta) = u' 0 

can be expressed as the sum u = v + w, where v satisfies the initial conditions 
v(t 0 ) — uq, v'(to) — 0, w satisfies the initial conditions w(t 0 ) = 0, w'(t 0 ) — u' 0 , and both v 
and w satisfy the same differential equation as u. This is another instance of superposing 
solutions of simpler problems to obtain the solution of a more general problem. 

16. Show that A cos a> 0 t + B sin a> a t can be written in the form r sinfwot — 9). Determine r 
and 9 in terms of A and B. If R cos(a>of — 5) = r sin(a>ot — 9), determine the relationship 
among R, r, 5, and 9. 

17. A mass weighing 8 lb stretches a spring 1.5 in. The mass is also attached to a damper with 
coefficient y . Determine the value of y for which the system is critically damped; be sure 
to give the units for y. 

18. If a series circuit has a capacitor of C = 0.8 x 10~ 6 F and an inductor of L = 0.2 H, find 
the resistance R so that the circuit is critically damped. 

19. Assume that the system described by the equation mu" + yu' + ku = 0 is either critically 
damped or overdamped. Show that the mass can pass through the equilibrium position 
at most once, regardless of the initial conditions. 

Hint: Determine all possible values of t for which u = 0. 

20. Assume that the system described by the equation mu" + yu' + ku = 0 is critically damped 
and that the initial conditions are u( 0) = u 0 ,u'(0) = v 0 . If v 0 — 0, show that u — > 0 as 
t — > oo but that u is never zero. If uo is positive, determine a condition on vq that will 
ensure that the mass passes through its equilibrium position after it is released. 

21. Logarithmic Decrement, (a) For the damped oscillation described by Eq. (26), show 
that the time between successive maxima is T d = 2: r/fi. 

(b) Show that the ratio of the displacements at two successive maxima is given by 
exp(yTii/2m). Observe that this ratio does not depend on which pair of maxima is 
chosen. The natural logarithm of this ratio is called the logarithmic decrement and is 
denoted by A. 
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(c) Show that A = ny/mp. Since m,p, and A are quantities that can be measured easily 
for a mechanical system, this result provides a convenient and practical method for de- 
termining the damping constant of the system, which is more difficult to measure directly. 
In particular, for the motion of a vibrating mass in a viscous fluid, the damping constant 
depends on the viscosity of the fluid; for simple geometric shapes the form of this depen- 
dence is known, and the preceding relation allows the experimental determination of the 
viscosity. This is one of the most accurate ways of determining the viscosity of a gas at 
high pressure. 

22. Referring to Problem 21, find the logarithmic decrement of the system in Problem 10. 

23. For the system in Problem 17 suppose that A = 3 and Tj = 0.3 s. Referring to Problem 
21, determine the value of the damping coefficient y. 

24. The position of a certain spring-mass system satisfies the initial value problem 

|m" + ku = 0, u( 0) = 2, w'( 0) = v. 

If the period and amplitude of the resulting motion are observed to be n and 3, respectively, 
determine the values of k and v. 

25. Consider the initial value problem 

u" + yu' + u = 0, m(0) = 2, w'(0) = 0. 

We wish to explore how long a time interval is required for the solution to become “neg- 
ligible” and how this interval depends on the damping coefficient y. To be more precise, 
let us seek the time r such that |w(f)| < 0.01 for all t > r. Note that critical damping for 
this problem occurs for y = 2. 

(a) Let y = 0.25 and determine r, or at least estimate it fairly accurately from a plot of 
the solution. 

(b) Repeat part (a) for several other values of y in the interval 0 < y < 1.5. Note that r 
steadily decreases as y increases for y in this range. 

(c) Create a graph of r versus y by plotting the pairs of values found in parts (a) and (b). 
Is the graph a smooth curve? 

(d) Repeat part (b) for values of y between 1.5 and 2. Show that r continues to de- 
crease until y reaches a certain critical value y 0 , after which r increases. Find y 0 and the 
corresponding minimum value of r to two decimal places. 

(e) Another way to proceed is to write the solution of the initial value problem in the 
form (26). Neglect the cosine factor and consider only the exponential factor and the 
amplitude R. Then find an expression for rasa function of y . Compare the approximate 
results obtained in this way with the values determined in parts (a), (b), and (d). 

26. Consider the initial value problem 

mu" + yu' + ku = 0, i/(0) = uo, u'( 0) = vo- 

Assume that y 2 < 4 km. 

(a) Solve the initial value problem. 

(b) Write the solution in the form u(t) = R exp(—yt/2m) cos (pt — S). Determine R in 
terms of m,y,k , no, and vo- 

(c) Investigate the dependence of R on the damping coefficient y for fixed values of the 
other parameters. 

27. A cubic block of side I and mass density p per unit volume is floating in a fluid of mass 
density p 0 per unit volume, where p 0 > p. If the block is slightly depressed and then re- 
leased, it oscillates in the vertical direction. Assuming that the viscous damping of the 
fluid and air can be neglected, derive the differential equation of motion and determine 
the period of the motion. 

Hint: Use Archimedes’s principle. An object that is completely or partially submerged in 
a fluid is acted on by an upward (buoyant) force equal to the weight of the displaced fluid. 


3. 7 Mechanical and Electrical Vibrations 


205 


28. The position of a certain undamped spring-mass system satisfies the initial value problem 

u" + 2 u = 0, u( 0) = 0, u'( 0) = 2. 

(a) Find the solution of this initial value problem. 

(b) Plot u versus t and u' versus t on the same axes. 

(c) Plot u! versus u; that is, plot u(t) and u'(t) parametrically with t as the parameter. This 
plot is known as a phase plot, and the uu' -plane is called the phase plane. Observe that 
a closed curve in the phase plane corresponds to a periodic solution u(t). What is the 
direction of motion on the phase plot as t increases? 

The position of a certain spring-mass system satisfies the initial value problem 
u" + \u' + 2 u = 0, u( 0) = 0, u'( 0) = 2. 

(a) Find the solution of this initial value problem. 

(b) Plot u versus t and u' versus t on the same axes. 

(c) Plot u! versus u in the phase plane (see Problem 28). Identify several corresponding 
points on the curves in parts (b) and (c). What is the direction of motion on the phase 
plot as t increases? 

In the absence of damping the motion of a spring-mass system satisfies the initial value 
problem 

mil" + ku = 0, u( 0) = a, i/'(0) = b. 

(a) Show that the kinetic energy initially imparted to the mass is mb 2 / 2 and that the po- 
tential energy initially stored in the spring is ka 2 /2, so that initially the total energy in the 
system is (ka 2 + mb 2 )/ 2. 

(b) Solve the given initial value problem. 

(c) Using the solution in part (b), determine the total energy in the system at any time t. 
Your result should confirm the principle of conservation of energy for this system. 

Suppose that a mass m slides without friction on a horizontal surface. The mass is at- 
tached to a spring with spring constant k , as shown in Figure 3.7.10, and is also subject 
to viscous air resistance with coefficient y. Show that the displacement u(t) of the mass 
from its equilibrium position satisfies Eq. (21). How does the derivation of the equation 
of motion in this case differ from the derivation given in the text? 

i u(t) 


k 



FIGURE 3.7.10 A spring-mass system. 

4fl 32. In the spring-mass system of Problem 31, suppose that the spring force is not given by 
Hooke’s law but instead satisfies the relation 

F s = — (ku + eu 3 ), 

where k > 0 and e is small but may be of either sign. The spring is called a hardening 
spring if e > 0 and a softening spring if e < 0. Why are these terms appropriate? 


4fl 29 - 


30. 


31. 
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(a) Show that the displacement u(t) of the mass from its equilibrium position satisfies the 
differential equation 

mu" + yu' + ku + cm 3 = 0. 

Suppose that the initial conditions are 

m( 0) = 0, n'(0) = 1. 

In the remainder of this problem assume that m = 1, k = 1, and y = 0. 

(b) Find u(t) when e = 0 and also determine the amplitude and period of the motion. 

(c) Let e = 0.1. Plot a numerical approximation to the solution. Does the motion appear 
to be periodic? Estimate the amplitude and period. 

(d) Repeat part (c) for e = 0.2 and e = 0.3. 

(e) Plot your estimated values of the amplitude A and the period T versus e. Describe 
the way in which A and T, respectively, depend on e. 

(f) Repeat parts (c), (d), and (e) for negative values of e. 
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We will now investigate the situation in which a periodic external force is applied 
to a spring-mass system. The behavior of this simple system models that of many 
oscillatory systems with an external force due, for example, to a motor attached to 
the system. We will first consider the case in which damping is present and will look 
later at the idealized special case in which there is assumed to be no damping. 

Forced Vibrations with Damping. The algebraic calculations can be fairly complicated in 
this kind of problem, so we will begin with a relatively simple example. 


EXAMPLE 

1 


Suppose that the motion of a certain spring-mass system satisfies the differential equation 

u" + u! + 1.25m = 3 cos t (1) 

and the initial conditions 

m(0) = 2, m'(0) = 3. (2) 

Find the solution of this initial value problem and describe the behavior of the solution for 
large t. 

The homogeneous equation corresponding to Eq. (1) has the characteristic equation 
r 2 + r + 1.25 = 0 with roots r = —0.5 ± i. Thus the general solution u c (t) of this homogeneous 
equation is 

u c (t) = cie -,/2 cos t + c 2 e _,/2 sin t. (3) 

A particular solution of Eq. (1) has the form U{t) = A cost + Rsinf, where A and B 
are found by substituting U(t) for u in Eq. (1). We have U'(t) = — A sin t + B cos t and 
U"(t) = —A cos t — B sin t. Thus, from Eq. (1) we obtain 

(0.25A + B) cos t + (—A + 0.25 B) sin t = 3 cos t. 

Consequently, A and B must satisfy the equations 


0.25A + B = 3, 


—A + 0.255 = 0, 
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with the result that A = 12/17 and B — 48/17. Therefore the particular solution is 

U(t) = cos t + Yj s in f , (4) 

and the general solution of Eq. (1) is 

u = u c (t) + U(t) = cie~ t/2 cos t + c 2 e~' 12 sin t + || cos t + sin t. (5) 

The remaining constants C\ and c 2 are determined by the initial conditions (2). From Eq. (5) 
we have 

;<(0) = Ci + Yj = 2, u'( 0) = - \c\ + c 2 + = 3, 

so Ci = 22/17 and c 2 — 14/17. Thus we finally obtain the solution of the given initial value 
problem, namely, 

u = Yje~' 12 cos t + Yje~‘ 12 sin t + cos t + sin t. (6) 

The graph of the solution (6) is shown by the black curve in Figure 3.8.1. 



FIGURE 3.8.1 Solution of the initial value problem (1), (2). 


It is important to note that the solution consists of two distinct parts. The first two terms on 
the right side of Eq. (6) contain the exponential factor e~' /2 \ as a result they rapidly approach 
zero. It is customary to call these terms transient. The remaining terms in Eq. (6) involve only 
sines and cosines, and therefore represent an oscillation that continues indefinitely. We refer to 
them as a steady state. The solid and dashed blue curves in Figure 3.8.1 show the transient and 
the steady state parts of the solution, respectively. The transient part comes from the solution 
of the homogeneous equation corresponding to Eq. (1) and is needed to satisfy the initial 
conditions. The steady state is the particular solution of the full nonhomogeneous equation. 
After a fairly short time the transient is vanishingly small and the full solution is essentially 
indistinguishable from the steady state. 
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The equation of motion of a general spring-mass system subject to an external 
force F(t) is [Eq. (7) in Section 3.7] 

mu" {t) + yu!(t) + ku(t) = F(t), (7) 

where m, y , and k are the mass, damping coefficient, and spring constant of the spring- 
mass system. Suppose now that the external force is given by Fq cos cot, where Fq and 
co are positive constants representing the amplitude and frequency, respectively, of 
the force. Then Eq. (7) becomes 

mu" + yu' + ku = Fq cos cot. (8) 

Solutions of Eq. (8) behave very much like the solution in the preceding example. 
The general solution of Eq. (8) must have the form 

u = c\u\(t) + C 2 U 2 (t) + A cos cot + B sin cot — u c (t ) + U(t). (9) 

The first two terms on the right side of Eq. (9) are the general solution u c (t ) of 
the homogeneous equation corresponding to Eq. (8), while the latter two terms 
are a particular solution U(t) of the full nonhomogeneous equation. The coeffi- 
cients A and B can be found, as usual, by substituting these terms into the differ- 
ential equation (8), while the arbitrary constants ci and C 2 are available to satisfy 
initial conditions, if any are prescribed. The solutions m(f) and 1 ( 2 ( 1 ) of the ho- 
mogeneous equation depend on the roots r\ and r 2 of the characteristic equation 
mr 2 + yr + k = 0. Since m, y, and k are all positive, it follows that r\ and r 2 either 
are real and negative or are complex conjugates with negative real part. In either 
case, both u\ (t) and / 12 (f) approach zero as t — >■ 00 . Since u c (t) dies out as t increases, 
it is called the transient solution. In many applications, it is of little importance and 
(depending on the value of y) may well be undetectable after only a few seconds. 

The remaining terms in Eq. (9), namely, U(t) = A cos cot + B sin cot, do not die out 
as t increases but persist indefinitely, or as long as the external force is applied. They 
represent a steady oscillation with the same frequency as the external force and 
are called the steady state solution or the forced response. The transient solution 
enables us to satisfy whatever initial conditions may be imposed; with increasing time, 
the energy put into the system by the initial displacement and velocity is dissipated 
through the damping force, and the motion then becomes the response of the system 
to the external force. Without damping, the effect of the initial conditions would 
persist for all time. 

It is convenient to express U (?) as a single trigonometric term rather than as a sum 
of two terms. Recall that we did this for other similar expressions in Section 3.7. 
Thus we write 

U (t) — R cos(cot — 8) . (10) 


The amplitude R and phase S depend directly on A and B and indirectly on the 
parameters in the differential equation (8). It is possible to show, by straightforward 
but somewhat lengthy algebraic computations, that 



cos 8 = 


m(col — co 2 ) 
A 


• 1 y w 
sina = — , 

A 


( 11 ) 


where 


A = 


m 2 (co q — co 2 ) 2 + y 2 co 2 and col = k/m. 


( 12 ) 
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Recall that co o is the natural frequency of the unforced system in the absence of 
damping. 

We now investigate how the amplitude R of the steady state oscillation depends on 
the frequency co of the external force. Substituting from Eq. (12) into the expression 
for R in Eq. (11) and executing some algebraic manipulations, we find that 



(13) 


where F = y 2 /mk. Observe that the quantity Rk/Fo is the ratio of the amplitude R 
of the forced response to F 0 /k , the static displacement of the spring produced by a 
force Ff). 

For low frequency excitation, that is, as co —*■ 0, it follows from Eq. (13) that 
Rk/Fo — »■ 1 or R —> Fo/k. At the other extreme, for very high frequency excita- 
tion, Eq. (13) implies that R -> 0 as co — »■ oo. At an intermediate value of co the 
amplitude may have a maximum. To find this maximum point, we can differentiate 
R with respect to co and set the result equal to zero. In this way we find that the 
maximum amplitude occurs when co = tt> max , where 



(14) 


Note that ® max < a> 0 and that ® max is close to ooq when y is small. The maximum 
value of R is 



(15) 


where the last expression is an approximation for small y. If y 2 /mk > 2, then <u max 
as given by Eq. (14) is imaginary; in this case the maximum value of R occurs for 
oo — 0, and R is a monotone decreasing function of co. Recall that critical damping 
occurs when y 2 /mk — 4. 

For small y it follows from Eq. (15) that R max = Fo/ycoo- Thus, for lightly damped 
systems, the amplitude R of the forced response when co is near coq is quite large 
even for relatively small external forces, and the smaller the value of y, the more 
pronounced is this effect. This phenomenon is known as resonance, and it is often an 
important design consideration. Resonance can be either good or bad, depending 
on the circumstances. It must be taken very seriously in the design of structures, 
such as buildings and bridges, where it can produce instabilities that might lead to 
the catastrophic failure of the structure. On the other hand, resonance can be put 
to good use in the design of instruments, such as seismographs, that are intended to 
detect weak periodic incoming signals. 

Figure 3.8.2 contains some representative graphs of Rk/Fo versus co/coq for several 
values of T = y 2 /mk. The graph corresponding to T = 0.015625 is included because 
this is the value of T that occurs in Example 2 below. Note particularly the sharp 
peak in the curve corresponding to T = 0.015625 near co/coo — 1. The limiting case 
as T — > 0 is also shown. It follows from Eq. (13), or from Eqs. (11) and (12), that 
R Fo/m\co q — co 2 | as y — > 0 and hence Rk/Fo is asymptotic to the vertical line 
co — u>o, as shown in the figure. As the damping in the system increases, the peak 
response gradually diminishes. 
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FIGURE 3.8.2 Forced vibration with damping: amplitude of steady state response versus 
frequency of driving force; r = y 2 /mk. 



FIGURE 3.8.3 Forced vibration with damping; phase of steady state response versus 
frequency of driving force; V = y 2 /mk. 

Figure 3.8.2 also illustrates the usefulness of dimensionless variables. You can 
easily verify that each of the quantities Rk/Fo , co/wo, and F is dimensionless. The 
importance of this observation is that the number of significant parameters in the 
problem has been reduced to three rather than the five that appear in Eq. (8). Thus 
only one family of curves, of which a few are shown in Figure 3.8.2, is needed to 
describe the response-versus-frequency behavior of all systems governed by Eq. (8). 

The phase angle <5 also depends in an interesting way on a>. For &> near zero, 
it follows from Eqs. (11) and (12) that cos S = 1 and sin <5 = 0. Thus <5 = 0, and 
the response is nearly in phase with the excitation, meaning that they rise and fall 
together and, in particular, assume their respective maxima nearly together and their 
respective minima nearly together. For co — a > o we find that cos 5 = 0 and sin S = 1, 
so 8 — jt /2. In this case the response lags behind the excitation by n/2\ that is, the 
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peaks of the response occur n /2 later than the peaks of the excitation, and similarly 
for the valleys. Finally, for co very large, we have cos 8 = — 1 and sin S = 0. Thus <5 = tt, 
so that the response is nearly out of phase with the excitation; this means that the 
response is minimum when the excitation is maximum, and vice versa. Figure 3.8.3 
shows the graphs of 8 versus co/coq for several values of T. For small damping, the 
phase transition from near S = 0 to near S — it occurs rather abruptly, whereas for 
larger values of the damping parameter, the transition takes place more gradually. 


EXAMPLE 

2 


Consider the initial value problem 

u" + 0.\25u' + u = 3coswt, n( 0) = 2, u'( 0) = 0. (16) 

Show plots of the solution for different values of the forcing frequency a>, and compare them 
with corresponding plots of the forcing function. 

For this system we have wo = landF = 1/64 = 0.015625. Its unforced motion was discussed 
in Example 3 of Section 3.7, and Figure 3.7.7 shows the graph of the solution of the unforced 
problem. Figures3. 8.4, 3.8.5, and 3.8.6 show the solution of the forced problem (16) for w = 0.3, 
co = l, and ai = 2, respectively. The graph of the corresponding forcing function is also shown 
in each figure. In this example the static displacement, F 0 /k , is equal to 3. 
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FIGURE 3.8.4 

Solution Forcing function 

A forced vibration with damping; solution of u" + 0.125 u' + u = 


m(0) = 2, h'(0) = 0. 


Figure 3.8.4 shows the low frequency case, at/wo = 0.3. After the initial transient response is 
substantially damped out, the remaining steady state response is essentially in phase with the 
excitation, and the amplitude of the response is somewhat larger than the static displacement. 
To be specific, R = 3.2939 and 5 = 0.041185. 

The resonant case, co/a > o = 1, is shown in Figure 3.8.5. Here the amplitude of the steady 
state response is eight times the static displacement, and the figure also shows the predicted 
phase lag of n/2 relative to the external force. 

The case of comparatively high frequency excitation is shown in Figure 3.8.6. Observe that 
the amplitude of the steady forced response is approximately one-third the static displacement 
and that the phase difference between the excitation and response is approximately n. More 
precisely, we find that R = 0.99655 and that 8 = 3.0585. 
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Forcing function Solution 

FIGURE 3.8.5 A forced vibration with damping; solution of u" + 0.125«' + u = 3 cos t, 

m(0) = 2, m'(0) = 0. 



Forcing function Solution 

FIGURE 3.8.6 A forced vibration with damping; solution of u" + 0.125 u! + u = 3 cos It, 

u( 0) = 2, m'(0) = 0. 


Forced Vibrations Without Damping. We now assume that y = 0 in Eq. (8), thereby ob- 
taining the equation of motion of an undamped forced oscillator 

mu" + ku — Fq cos cot. (17) 

The form of the general solution of Eq. (17) is different, depending on whether the 
forcing frequency co is different from or equal to the natural frequency &>o = yjk/m 
of the unforced system. First consider the case co / moi then the general solution of 
Eq. (17) is 

F 0 

m(col — co 2 ) 


u — ci cos coot + C 2 sin coot + 


cos cot. 


(18) 
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The constants c\ and t? are determined by the initial conditions. The resulting motion 
is, in general, the sum of two periodic motions of different frequencies (&>o and co) 
and different amplitudes as well. 

It is particularly interesting to suppose that the mass is initially at rest, so that the 
initial conditions are u(0) — 0 and u'( 0) = 0. Then the energy driving the system 
comes entirely from the external force, with no contribution from the initial con- 
ditions. In this case it turns out that the constants ci and ci in Eq. (18) are given 
by 

F 0 

c t= - 

and the solution of Eq. (17) is 


m(u>Q 


■co 2 ) 




it 


I 

m(col — co 2 ) 


(cos cot — coscoot). 


( 20 ) 


This is the sum of two periodic functions of different periods but the same amplitude. 
Making use of the trigonometric identities for cos(A ± B) with A — (coq + co)t / 2 and 
B = (a>o — co)t/2, we can write Eq. (20) in the form 


2Fn 


m(&>5 


■ co 2 ) 


sin 


(&>o — co)t 


sin 


(ct>o + co)t 


( 21 ) 


If |a>o — co\ is small, then coq + co is much greater than \coq — co\. Consequently, 
sin(&>o + co)t/ 2 is a rapidly oscillating function compared to sin(®o — co)t/2. Thus 
the motion is a rapid oscillation with frequency (a>o + co) /2 but with a slowly varying 
sinusoidal amplitude 

2Fo . (co o — co)t 
9 T sin ■ 

vn | coq — co 2 1 2 


This type of motion, possessing a periodic variation of amplitude, exhibits what is 
called a beat. For example, such a phenomenon occurs in acoustics when two tuning 
forks of nearly equal frequency are excited simultaneously. In this case the peri- 
odic variation of amplitude is quite apparent to the unaided ear. In electronics, the 
variation of the amplitude with time is called amplitude modulation. 


EXAMPLE 

3 


Solve the initial value problem 

u" + u = 0.5cos0.8f, u( 0) = 0, u'( 0) = 0, (22) 

and plot the solution. 

In this case a>o = 1, co = 0.8, and F 0 = 0.5, so from Eq. (21) the solution of the given problem 
is 

u = 2.77778(sin0.1f)(sin0.9f). (23) 


A graph of this solution is shown in Figure 3.8.7. The amplitude variation has a slow frequency 
of 0.1 and a corresponding slow period of 20^-. Note that a half-period of I0n corresponds to 
a single cycle of increasing and then decreasing amplitude. The displacement of the spring- 
mass system oscillates with a relatively fast frequency of 0.9, which is only slightly less than 
the natural frequency coq. 
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u — 2.77778(sin0.1/)(sin0.9/). 


Now imagine that the forcing frequency a> is further increased, say, to co = 0.9. Then the 
slow frequency is halved to 0.05, and the corresponding slow half-period is doubled to 207r. 
The multiplier 2.7778 also increases substantially, to 5.2632. However, the fast frequency is 
only marginally increased, to 0.95. Can you visualize what happens as a> takes on values closer 
and closer to the natural frequency a>o — 1? 



FIGURE 3.8.8 Resonance; solution of u" + u = 0.5 cos/, n(0) = 0, w'(0) = 0; 

u — 0.25/ sin /. 
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Now let us return to Eq. (17) and consider the case of resonance, where a> — co o; 
that is, the frequency of the forcing function is the same as the natural frequency of the 
system. Then the nonhomogeneous term Fq cos cot is a solution of the homogeneous 
equation. In this case the solution of Eq. (17) is 


u — c\ cos ooot + C 2 sin ooot + 


F 0 

2ma>o 


t sin coot. 


(24) 


Because of the term t sin (Dot. the solution (24) predicts that the motion will become 
unbounded as t —*■ oo regardless of the values of c i and ci', see Figure 3.8.8 for a 
typical example. Of course, in reality, unbounded oscillations do not occur. As soon 
as u becomes large, the mathematical model on which Eq. (17) is based is no longer 
valid, since the assumption that the spring force depends linearly on the displacement 
requires that u be small. As we have seen, if damping is included in the model, the 
predicted motion remains bounded; however, the response to the input function 
Fq cos ( ot may be quite large if the damping is small and co is close to a>o . 


PROBLEMS In each of Problems 1 through 4 write the given expression as a product of two trigonometric 
functions of different frequencies. 

1. cos 9? — cos It 2. sin7f — sin6f 

3. cosjrf + cos2jrt 4. sin3t + sin4f 

5. A mass weighing 4 lb stretches a spring 1.5 in. The mass is displaced 2 in. in the positive 
direction from its equilibrium position and released with no initial velocity. Assuming 
that there is no damping and that the mass is acted on by an external force of 2 cos 3f lb, 
formulate the initial value problem describing the motion of the mass. 

6. A mass of 5 kg stretches a spring 10 cm. The mass is acted on by an external force of 
10 sin(t/2) N (newtons) and moves in a medium that imparts a viscous force of 2 N when 
the speed of the mass is 4 cm/s. If the mass is set in motion from its equilibrium position 
with an initial velocity of 3 cm/s, formulate the initial value problem describing the motion 
of the mass. 

7. (a) Find the solution of Problem 5. 

(b) Plot the graph of the solution. 

(c) If the given external force is replaced by a force 4 sin cot of frequency a>, find the value 
of cd for which resonance occurs. 

8. (a) Find the solution of the initial value problem in Problem 6. 

(b) Identify the transient and steady state parts of the solution. 

(c) Plot the graph of the steady state solution. 

(d) If the given external force is replaced by a force of 2 cos cot of frequency co. find the 
value of co for which the amplitude of the forced response is maximum. 

9. If an undamped spring-mass system with a mass that weighs 6 lb and a spring constant 
1 lb/in is suddenly set in motion at t = 0 by an external force of 4 cos It lb, determine the 
position of the mass at any time and draw a graph of the displacement versus t. 

10. A mass that weighs 8 lb stretches a spring 6 in. The system is acted on by an external force 
of 8 sin 8r lb. If the mass is pulled down 3 in and then released, determine the position of 
the mass at any time. Determine the first four times at which the velocity of the mass is 


zero. 
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11. A spring is stretched 6 in by a mass that weighs 8 lb. The mass is attached to a dashpot 
mechanism that has a damping constant of 0.25 lb-s/ft and is acted on by an external force 
of 4 cos 2 1 lb. 

(a) Determine the steady state response of this system. 

(b) If the given mass is replaced by a mass m, determine the value of m for which the 
amplitude of the steady state response is maximum. 

12. A spring-mass system has a spring constant of 3 N/m. A mass of 2 kg is attached to the 
spring, and the motion takes place in a viscous fluid that offers a resistance numerically 
equal to the magnitude of the instantaneous velocity. If the system is driven by an external 
force of (3 cos 3t — 2 sin 3 t) N, determine the steady state response. Express your answer 
in the form R cos (cot — S). 

13. In this problem we ask you to supply some of the details in the analysis of a forced damped 
oscillator. 

(a) Derive Eqs. (10), (11), and (12) for the steady state solution of Eq. (8). 

(b) Derive the expression in Eq. (13) for Rk/F 0 . 

(c) Show that w^ ax and 7? max are given by Eqs. (14) and (15), respectively. 

14. Find the velocity of the steady state response given by Eq. (10). Then show that the 
velocity is maximum when co = co 0 . 

15. Find the solution of the initial value problem 


where 


F(t) = 


F(t ), 

m(0) = 0, u\ 0) = 

Fot, 

0 <t<n. 

F 0 (2n 

- t), n < t < 2 tc, 

0, 

2n < t. 


Flint: Treat each time interval separately, and match the solutions in the different intervals 
by requiring u and u' to be continuous functions of t. 

16. A series circuit has a capacitor of 0.25 x 10~ 6 F, a resistor of 5 x 10 3 Q, and an inductor 
of 1 H. The initial charge on the capacitor is zero. If a 12-volt battery is connected to the 
circuit and the circuit is closed at t = 0, determine the charge on the capacitor at t = 0.001 s, 
at t = 0.01 s, and at any time t. Also determine the limiting charge as t — > oo. 

17. Consider a vibrating system described by the initial value problem 

u" + \u + 2u = 2cosa>f, m(0) = 0, u'( 0) = 2. 

(a) Determine the steady state part of the solution of this problem. 

(b) Find the amplitude A of the steady state solution in terms of u>. 

(c) Plot A versus co. 

(d) Find the maximum value of A and the frequency to for which it occurs. 

(fo 18. Consider the forced but undamped system described by the initial value problem 

u" + u = 3 cos cot, u( 0) = 0, u'( 0) = 0. 


(a) Find the solution u(t) for co ^ 1. 

(b) Plot the solution u(t) versus t for co = 0.7, co = 0.8, and a> = 0.9. Describe how the 
response u(t) changes as co varies in this interval. What happens as co takes on values 
closer and closer to 1? Note that the natural frequency of the unforced system is co 0 = 1. 
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19. Consider the vibrating system described by the initial value problem 

u" + u = 3 cos cot, u(0) — 1 , u'( 0) = 1. 

(a) Find the solution for cd 1. 

(b) Plot the solution u(t) versus t for cd = 0.7, cd = 0.8, and id = 0.9. Compare the results 
with those of Problem 18; that is, describe the effect of the nonzero initial conditions. 

20. For the initial value problem in Problem 18 plot u' versus u for id = 0.7, id = 0.8, and 

cd = 0.9. Such a plot is called a phase plot. Use a t interval that is long enough so that the 

phase plot appears as a closed curve. Mark your curve with arrows to show the direction 
in which it is traversed as t increases. 

Problems 21 through 23 deal with the initial value problem 

u" + 0.125«' + 4 u = F(t), u(0) = 2, «'(0) = 0. 

In each of these problems: 

(a) Plot the given forcing function F(t) versus f, and also plot the solution u{t) versus t on the 
same set of axes. Use a t interval that is long enough so the initial transients are substantially 
eliminated. Observe the relation between the amplitude and phase of the forcing term and 
the amplitude and phase of the response. Note that cd 0 = J~k[m = 2. 

(b) Draw the phase plot of the solution; that is, plot u' versus u. 

21. F(t) = 3cos(t/4) 

4fl 22. F(t) = 3cos2f 
4fl 23. F(t) = 3 cos 6 1 

41 24 - A spring-mass system with a hardening spring (Problem 32 of Section 3.7) is acted on by 
a periodic external force. In the absence of damping, suppose that the displacement of 
the mass satisfies the initial value problem 

u" + it + \u? = cos cot, u(0) = 0, m'(0) = 0. 

(a) Let cd = 1 and plot a computer-generated solution of the given problem. Does the 
system exhibit a beat? 

(b) Plot the solution for several values of cd between 1/2 and 2. Describe how the solution 
changes as cd increases. 

25. Suppose that the system of Problem 24 is modified to include a damping term and that 
the resulting initial value problem is 

u" + lu + u + ir< 3 = cos cot, u( 0) = 0, u! (()) = 0. 

(a) Plot a computer-generated solution of the given problem for several values of cd 
between 1 /2 and 2, and estimate the amplitude R of the steady response in each case. 

(b) Using the data from part (a), plot the graph of R versus cd. For what frequency cd is 
the amplitude greatest? 

(c) Compare the results of parts (a) and (b) with the corresponding results for the linear 
spring. 


REFERENCES Coddington, E. A. .An Introduction to Ordinary Differential Equations (Englewood Cliffs, NJ: Prentice- 

Hall, 1961; New York: Dover, 1989). 

There are many books on mechanical vibrations and electric circuits. One that deals with both is 
Close, C. M., and Frederick, D. ^..Modeling and Analysis of Dynamic Systems (3rd ed.) (New York: Wiley, 
2001 ). 
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1985). 

A more recent intermediate-level book is 

Thomson, W. T., Theory of Vibrations with Applications (5th ed.) (Englewood Cliffs, NJ: Prentice-Hall, 
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An elementary book on electrical circuits is 
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CHAPTER 


Higher Order Linear 
Equations 


The theoretical structure and methods of solution developed in the preceding chapter 
for second order linear equations extend directly to linear equations of third and 
higher order. In this chapter we briefly review this generalization, taking particular 
note of those instances where new phenomena may appear, because of the greater 
variety of situations that can occur for equations of higher order. 


4.1 General Theory of nth Order Linear Equations 


An nth order linear differential equation is an equation of the form 

d n y d n ~ l y dy , 

P «(t)— + P l (t) df H _l + ' ' ' + Pn ~l (0^ + Pn(0y = G(t). (1) 

We assume that the functions Pq, . . . , P n , and G are continuous real-valued functions 
on some interval 7: a < t < p, and that Pq is nowhere zero in this interval. Then, 
dividing Eq. (1) by Po(t), we obtain 

d n y d n ~ l y dy 

Ply] = +Pl ^~^T + " ' +Pn(t)y = g(t). (2) 

The linear differential operator L of order n defined by Eq. (2) is similar to the second 
order operator introduced in Chapter 3. The mathematical theory associated with 
Eq. (2) is completely analogous to that for the second order linear equation; for this 
reason we simply state the results for the nth order problem. The proofs of most of 
the results are also similar to those for the second order equation and are usually left 
as exercises. 
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Theorem 4.1.1 


Since Eq. (2) involves the nth derivative of y with respect to t, it will, so to speak, 
require n integrations to solve Eq. (2). Each of these integrations introduces an 
arbitrary constant. Hence we can expect that, to obtain a unique solution, it is 
necessary to specify n initial conditions 

y(to) = yo, y'ito) =/ 0 , . . . , y ( " _1) (f 0 ) = (3) 

whereto maybe any point in the interval/ and yo,yo> ■ ■ • , >’f " 11 is any set of prescribed 
real constants. The following theorem, which is similar to Theorem 3.2.1, guarantees 
that the initial value problem (2), (3) has a solution and that it is unique. 


If the functions p\, p 2 , . . . . p n , and g are continuous on the open interval /, 
then there exists exactly one solution y = (p(t) of the differential equation (2) 
that also satisfies the initial conditions (3). This solution exists throughout the 
interval I. 


We will not give a proof of this theorem here. However, if the coefficients/?!, ■ ■ ■ • Pn 
are constants, then we can construct the solution of the initial value problem (2), (3) 
much as in Chapter 3; see Sections 4.2 through 4.4. Even though we may find a 
solution in this case, we do not know that it is unique without the use of Theo- 
rem 4.1.1. A proof of the theorem can be found in Ince (Section 3.32) or Coddington 
(Chapter 6). 

The Homogeneous Equation. As in the corresponding second order problem, we first 
discuss the homogeneous equation 

L[yi = / n) + Pl (f)y ( " _1) + ■ ■ • + p n - 1 (t)y' + p n (f)y = 0. (4) 

If the functions y\, . . . ,y n are solutions of Eq. (4), then it follows by direct com- 

putation that the linear combination 

y — ciyi(t) + C2y 2 (0 H b c n y n (t), (5) 

where c \, . . . , c„ are arbitrary constants, is also a solution of Eq. (4). It is then natural 
to ask whether every solution of Eq. (4) can be expressed as a linear combination 
of y\, . . . ,y n . This will be true if, regardless of the initial conditions (3) that are pre- 
scribed, it is possible to choose the constants c \, . . . , c„ so that the linear combination 
(5) satisfies the initial conditions. That is, for any choice of the point to in /, and for 
any choice of y 0 , y ' Q , . . . ,y ( Q~ l \ we must be able to determine <q , . . . , c n so that the 
equations 

ciyi(fo) H h c n y n (t 0 ) = yo 

ciy'x (to) + • • • + c n y' n (to) = y'o 

( 6 ) 

ciyi" _1, ( f o) H b c«y‘" _1, (to) = yo'” 1 ’ 

are satisfied. Equations (6) can be solved uniquely for the constants ci,...,c„, 
provided that the determinant of coefficients is not zero. On the other hand, if 
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Theorem 4.1.2 


EXAMPLE 

1 


the determinant of coefficients is zero, then it is always possible to choose values 
of yo, Yq, ■ ■ ■ ,y ( Q -1) so that Eqs. (6) do not have a solution. Hence a necessary and 
sufficient condition for the existence of a solution of Eqs. (6) for arbitrary values of 


yo,y' 0 ,- 

. ,y q" 11 is that the Wronskian 






yi 

y 2 

y n 



W (yi, . . . ,y ;1 ) = 

y'1 

y'i 

y'n 

(7) 



yf-b 

1 

Scj 

' ^ 

= = 

1 



is not zero at t = t 0 . Since to can be any point in the interval /, it is necessary and 
sufficient that W(yi,y 2 , ■ ■ ■ ,y n ) be nonzero at every point in the interval. Just as for 
the second order linear equation, it can be shown that if yi, y 2 , . . . ,y n are solutions 
of Eq. (4), then W(yi,y 2 , . . . ,y„) either is zero for every t in the interval I or else is 
never zero there; see Problem 20. Hence we have the following theorem. 


If the functions p i, p 2 , . . . ,p„ are continuous on the open interval 7, if the 
functions yi, y 2 , . . . ,y„ are solutions of Eq. (4), and if W(yi,y 2 , . . . ,y n )(f) Y 0 f° r 
at least one point in /, then every solution of Eq. (4) can be expressed as a linear 
combination of the solutions yi,y 2 , . . . ,y„. 


A set of solutions yi, . . . ,y„ of Eq. (4) whose Wronskian is nonzero is referred to 
as a fundamental set of solutions. The existence of a fundamental set of solutions can 
be demonstrated in precisely the same way as for the second order linear equation 
(see Theorem 3.2.5). Since all solutions of Eq. (4) are of the form (5), we use the 
term general solution to refer to an arbitrary linear combination of any fundamental 
set of solutions of Eq. (4). 

Linear Dependence and Independence. We now explore the relationship between funda- 
mental sets of solutions and the concept of linear independence, a central idea in the 
study of linear algebra. The functions / ,/ 2 , . . . ,/„ are said to be linearly dependent 
on an interval I if there exists a set of constants k\,k 2 ,..., k n , not all zero, such that 

k\fi (t) + k 2 f 2 (t) 4 f k n f n (t) = 0 (8) 

for all t in I. The functions ,f n are said to be linearly independent on / if they 
are not linearly dependent there. 


Determine whether the functions fi(t) = l,/ 2 (0 = t , and/ 3 (t) = t 2 are linearly independent or 
dependent on the interval I : — oo < f < oo. 

Form the linear combination 

£i/i(0 + k2f 2 (t) + £ 3 / 3(0 = £1 + k 2 t + k 3 t 2 , 

and set it equal to zero to obtain 


£1 + k 2 t + kit 2 = 0 . 


(9) 


222 


Chapter 4. Higher Order Linear Equations 


EXAMPLE 

2 


If Eq. (9) is to hold for all t in 7, then it must certainly be true at any three distinct points in I. 
Any three points will serve our purpose, but it is convenient to choose t = 0, t = 1 , and t = — 1 . 
Evaluating Eq. (9) at each of these points, we obtain the system of equations 

ki = 0 , 

k 1 + k 2 + k 3 = 0, (10) 

*i — k 2 + *3 = 0. 


From the first of Eqs. (10) we note that *i = 0; then from the other two equations it follows 
that* 2 = k 3 = Oaswell. Therefore there is no set of constants k 3 ,k 2 , k 3 , not all zero, for which 
Eq. (9) holds even at the three chosen points, much less throughout 7. Thus the given functions 
are not linearly dependent on 7, so they must be linearly independent. Indeed, they are linearly 
independent on any interval. This can be established just as in this example, possibly using a 
different set of three points. 


Determine whether the functions /!(?) = 1, f 2 { t ) = 2 + t , f 3 ( t ) = 3 — t 2 , and 

f 4 ( t ) = 4 1 + t 2 are linearly independent or dependent on any interval 7. 

Form the linear combination 

k -\ f \ (?) + *2/2(7) + k 3 f 3 ( t ) + *4/4(1) = *1 + fe(2 + f ) + *3(3 — r ) + *4(4? + t 2 ) 

= ( k \ + 2*2 + 3*3) + (*2 + 4*4)? + (—*3 T *4)?“. ( 31 ) 

This expression is zero throughout an interval provided that 


*4 T 2*2 T 3*3 — 0, *2 T 4*4 — 0, — *3 T *4 — 0. 


These three equations, with four unknowns, have many solutions. For instance, if * 4 = 1, then 
*3 = 1,*2 = — 4,and*i = 5. Thus the given functions are linearly dependent on every interval. 


The concept of linear independence provides an alternative characterization of 
fundamental sets of solutions of the homogeneous equation (4). Suppose that the 
functions y\,...,y n are solutions of Eq. ( 4) on an interval 7 and consider the equation 

k\y\ (t) 4 h k n y n (f) = 0. (12) 


By differentiating Eq. (12) repeatedly, we obtain the additional n — 1 equations 


kiy[(t) H b k n y' n (t) = 0, 




(13) 


The system consisting of Eqs. (12) and (13) is a system of n linear algebraic equations 
for the n unknowns * 1 , . . . , *„. The determinant of coefficients for this system is the 
Wronskian W(y 1 , . . . ,y n )(t) of yi, . . . ,y n - This leads to the following theorem. 
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Theorem 4.1.3 


If yi (t), . . . ,y„(t) is a fundamental set of solutions of Eq. (4) 

L[y] = y w + pi{t)y in ~ l) + • • • + (t)y' + p n {t)y = 0 

on an interval /, then yi (t), . . . ,y n (t ) are linearly independent on 1. Conversely, if 
,y n (t) are linearly independent solutions of Eq. (4) on /, then they form a 
fundamental set of solutions on I. 


To prove this theorem, first suppose that yi(t),... ,y„(t) is a fundamental set of 
solutions of Eq. (4) on /. Then the Wronskian W(y\, . . . ,y n )(t ) ^ 0 for every t in 1. 
Hence the system (12), (13) has only the solution k\ = • • • = k n = 0 for every t in I. 
Thus y i (t ) , . . . , y n (t) cannot be linearly dependent on I and must therefore be linearly 
independent there. 

To demonstrate the converse, let yi(f), . . . ,y„(f) be linearly independent on I. To 
show that they form a fundamental set of solutions, we need to show that their 
Wronskian is never zero in I. Suppose that this is not true; then there is at least 
one point to where the Wronskian is zero. At this point the system (12), (13) has a 
nonzero solution; let us denote it by k *, . . . , k*. Now form the linear combination 

4>(t) = k*y\ (f) + ■■ ■ + k*y n (t). (14) 

Then r/j (t) satisfies the initial value problem 

L[y] = 0, y(f 0 ) = 0, y'(f 0 ) = 0, ..., y ( ' I_1) (r 0 ) = 0. (15) 

The function 0 satisfies the differential equation because it is a linear combination of 
solutions; it satisfies the initial conditions because these are just the equations in the 
system (12), (13) evaluated at to. However, the function y(t) = 0 for all t in I is also 
a solution of this initial value problem and, by Theorem 4.1.1, the solution is unique. 
Thus <p(t) = 0 for all t in I. Consequently, yi (r), . . . ,y n (t) are linearly dependent on 
/, which is a contradiction. Hence the assumption that there is a point where the 
Wronskian is zero is untenable. Therefore the Wronskian is never zero on I, as was 
to be proved. 

Note that for a set of functions f\ , . . . ,/„ that are not solutions of Eq. (4) the con- 
verse part of Theorem 4.1.3 is not necessarily true. They may be linearly independent 
on I even though the Wronskian is zero at some points, or even every point, but with 
different sets of constants k\,...,k n at different points. See Problem 25 for an ex- 
ample. 

The Nonhomogeneoas Equation. Now consider the nonhomogeneous equation (2) 

L[y] = y (n) +pi(f)y «"- 1) + • ■ -+Pn(t)y = gif). 

If Y\ and Yi are any two solutions of Eq. (2), then it follows immediately from the 
linearity of the operator L that 

L[Y X - Y 2 ](.t) = L[Y{]{t) - L[Y 2 ](t) = g(t ) - g(t) = 0. 

Hence the difference of any two solutions of the nonhomogeneous equation (2) is a 
solution of the homogeneous equation (4). Since any solution of the homogeneous 
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PROBLEMS 


equation can be expressed as a linear combination of a fundamental set of solutions 
yi, . . . ,y„,it follows that any solution of Eq. (2) can be written as 


y = ciyi{t) + c 2 y 2 (t) H b c n y n (t) + Y (t), (16) 

where Y is some particular solution of the nonhomogeneous equation (2). The linear 
combination (16) is called the general solution of the nonhomogeneous equation (2). 

Thus the primary problem is to determine a fundamental set of solutions y \, . . . , y„ 
of the homogeneous equation (4). If the coefficients are constants, this is a fairly 
simple problem; it is discussed in the next section. If the coefficients are not constants, 
it is usually necessary to use numerical methods such as those in Chapter 8 or series 
methods similar to those in Chapter 5. These tend to become more cumbersome as 
the order of the equation increases. 

The method of reduction of order (Section 3.4) also applies to nth order linear 
equations. If y i is one solution of Eq. (4), then the substitution y — v{t)y\ ( t ) leads to 
a linear differential equation of order n — 1 for v' (see Problem 26 for the case when 
n = 3). However, if n > 3, the reduced equation is itself at least of second order, and 
only rarely will it be significantly simpler than the original equation. Thus, in practice, 
reduction of order is seldom useful for equations of higher than second order. 


In each of Problems 1 through 6 determine intervals in which solutions are sure to exist. 

1. y (4) + Ay"' + 3 y = t 2. ty"' + (sin t)y" + 3 y — cos t 

3. t(t - 1 )y (4) + e‘y" + At 2 y = 0 4. /" + ty" + t 2 y' + t 3 y = In t 

5. (x — l)y (4) + (x + l)y" + (tanx)y = 0 6. (x 2 - 4)y (6) + x 2 y'" + 9y = 0 

In each of Problems 7 through 10 determine whether the given set of functions is linearly 
dependent or linearly independent. If they are linearly dependent, find a linear relation 
among them. 

7. /i(0 = 2t - 3, f 2 (t) = t 2 + 1, / 3 (t) = 2t 2 -f 

8. fi(t) = 2t - 3, f 2 (t) = 2t 2 + h fiit) =3t 2 + t 

9-/i(0 = 2f-3, f 2 (t) = t 2 + 1, / 3 (f) = 2f 2 -f, / 4 (/) = f 2 + t + l 

10. /i(f) = 2t - 3, f 2 (t) = t 3 + 1, f 3 (t) = 2t 2 — t, f A {t) = t 2 + t + l 

In each of Problems 1 1 through 1 6 verify that the given functions are solutions of the differential 
equation, and determine their Wronskian. 

11. y"' + / = 0; 1, cost, sint 

12. y (4) + y" = 0; 1, t, cost, sint 

13. y'" + 2 y" - y' - 2 y = 0; e\ e~\ e~ 2t 

14. y (4) + 2y"' + y" — 0; 1, t, e~‘, te" r 

15. xy'" - y" = 0; 1, x, x 3 

16. x 3 y'" + x 2 y" - 2xy' + 2 y = 0; x, x 2 , 1/x 

17. Show that W(5,sin 2 f, cos2f) = 0 for all t. Can you establish this result without direct 
evaluation of the Wronskian? 

18. Verify that the differential operator defined by 


L[y] = y (n) + pi(t)y ( "“ * 1) 11 + • • • + p„{t)y 
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is a linear differential operator. That is, show that 

L[ciyi + c 2 y 2 ] = CiL[yi\ + c 2 L[y 2 ], 

where y 1 and y 2 are n times differentiable functions and c\ and c 2 are arbitrary constants. 
Hence, show that if y\, y 2 ,... , y„ are solutions of L\y ] = 0, then the linear combination 
Ciyi + • • • + c„y n is also a solution of L[y] = 0. 

19. Let the linear differential operator L be defined by 

L[y] = a 0 y (n) + «iy ( " _1) + • • • + a n y, 

where ao,a\,...,a n are real constants. 

(a) Find L\t n \. 

(b) Find L[e"]. 

(c) Determine four solutions of the equation y (4) — 5y" + 4 y — 0. Do you think the four 
solutions form a fundamental set of solutions? Why? 

20. In this problem we show how to generalize Theorem 3.2.6 (Abel’s theorem) to higher 
order equations. We first outline the procedure for the third order equation 

/" + Pi(t)y" + p 2 (t)y' + p 3 (t)y = 0. 


Let y\,y 2 , and y 3 be solutions of this equation on an interval I. 
(a) If W = W(yi,y 2 ,y 3 ), show that 


W' = 


y i yi L3 

y'i y 2 y'i 

y'i' y’z y 2 


Hint: The derivative of a 3-by-3 determinant is the sum of three 3-by-3 determinants 
obtained by differentiating the first, second, and third rows, respectively. 

(b) Substitute for y'",y 2 , and y" from the differential equation; multiply the first row by 
P 3 , multiply the second row by p 2 , and add these to the last row to obtain 


W' = -p\(t)W. 


(c) Show that 

W(yuy 2 ,y 3 )(t) = cexp 

It follows that W is either always zero or nowhere zero on I . 

(d) Generalize this argument to the nth order equation 

y w +Pi(t)/ n - 1) + ---+ P n(t)y = 0 


-/ 


Pi(t) dt 


with solutions y 3 , . . . ,y n . That is, establish Abel’s formula 
W(yi,...,y„)(0 = cexp 

for this case. 


[-/ 


- / Pi (t) dt 


In each of Problems 21 through 24 use Abel’s formula (Problem 20) to find the Wronskian of 
a fundamental set of solutions of the given differential equation. 

21. f + 2 y" -y' -3y = 0 22. y (4) + y = 0 

23. tyt" + 2y" - / + ty = 0 24. t 2 y (4) + ty"' + y" - 4y = 0 
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25. (a) Show that the functions f(t) = t 2 \t\ and g(t) = t 3 are linearly dependent on 
0 < t < 1 and on — 1 < t < 0. 

(b) Show that /(f) andg(f) are linearly independent on — 1 < f < 1. 

(c) Show that W(f,g)(t) is zero for all t in — 1 < t < 1. 

26. Show that if y\ is a solution of 

/" + Pi(t)y" + P 2 (t)y' +p 3 (t)y = 0, 

then the substitution y = yi(t)v(t) leads to the following second order equation for v'\ 
yw'" + (3yi + piyi)v" + (3 y'[ + 2p l y' l + p 2 yi)v' = 0. 

In each of Problems 27 and 28 use the method of reduction of order (Problem 26) to solve the 
given differential equation. 

27. (2 — t)y'" + (2f — 3 )y" — ty' + y = 0, t < 2; y\(t) = e‘ 

28. f 2 (f + 3)/" - 3f(f + 2)y" + 6(1 + t)y' - 6y = 0, f > 0; y t (t)=t 2 , y 2 (t) = f 3 


4.2 Homogeneous Equations with Constant Coefficients 

Consider the nth order linear homogeneous differential equation 

L[y ] = a 0 y (n) + a 1 y (n ~ 1) H h a n _iy' + a n y = 0, (1) 

where ao.fli, . . . ,a„ are real constants. From our knowledge of second order linear 
equations with constant coefficients, it is natural to anticipate that y = e rt is a solution 
of Eq. (1) for suitable values of r. Indeed, 

L[e rt ] = e rt (a 0 r n + air n ~ x H f a n -\r + a n ) = e rt Z(r ) (2) 

for all r. where 

Z(r) = a 0 r n + air" -1 H b a„_ir + a„. (3) 

For those values of r for which Z(r) = 0, it follows that L[e rt ] = 0 and y = e rt is a 
solution of Eq. (1). The polynomial Z(r) is called the characteristic polynomial, and 
the equation Z(r) = 0 is the characteristic equation of the differential equation (1). 
A polynomial of degree n has n zeros, 1 say, rj, r 2 , . . . , r n , some of which may be equal; 
hence we can write the characteristic polynomial in the form 

Z(r) = a 0 (r - ri)(r - r 2 ) ■ ■ • (r - r n ). (4) 


1 An important question in mathematics for more than 200 years was whether every polynomial equation 
has at least one root. The affirmative answer to this question, the fundamental theorem of algebra, was 
given by Carl Friedrich Gauss (1777-1855) in his doctoral dissertation in 1799, although his proof does 
not meet modern standards of rigor. Several other proofs have been discovered since, including three 
by Gauss himself. Today, students often meet the fundamental theorem of algebra in a first course on 
complex variables, where it can be established as a consequence of some of the basic properties of complex 
analytic functions. 
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Real and Unequal Roots. If the roots of the characteristic equation are real and no two 
are equal, then we have n distinct solutions e nt ,e r2t , . . . ,e r " 1 of Eq. (1). If these 
functions are linearly independent, then the general solution of Eq. (1) is 

y — c\e nt + c 2 e'' 2 ' + f c n e r "‘ . (5) 

One way to establish the linear independence of e nt , e rit , . . . , e rnt is to evaluate their 
Wronskian determinant. Another way is outlined in Problem 40. 


EXAMPLE 

1 


Find the general solution of 

y (4) + /" _ ly" — y' + 6y = 0. (6) 

Also find the solution that satisfies the initial conditions 

y( 0) = 1, /(0) = 0, /( 0) = -2, /'( 0) = -1 (7) 

and plot its graph. 

Assuming that y = e n , we must determine r by solving the polynomial equation 

r 4 + r 3 — 7r 2 — r + 6 = 0. (8) 

The roots of this equation are r\ = 1, r 2 = —1, r 3 = 2, and r 4 = —3. Therefore the general 
solution of Eq. (6) is 

y = c\e' + c 2 e _r + c 3 e 2 ' + c 4 e~ 3 '. (9) 

The initial conditions (7) require that ci, . . . , c 4 satisfy the four equations 

Ci + c 2 + C3 T c 4 = 1, 

Ci — Co + 2 C3 — 3c 4 = 0, 

(10) 

Ci + c 2 + 4c 3 + 9c 4 = —2, 
ci — c 2 + 8c 3 — 27c 4 = —1. 


By solving this system of four linear algebraic equations, we find that 


„ _ tl 
Cl - -3-, 


^ — 5 
C 2 - 12 . 


C3 3 9 

Therefore the solution of the initial value problem is 

v = — -i- T ^ 

y 8 e ' n t 

The graph of the solution is shown in Figure 4.2.1. 


c 4 — 


l e 2t ■ 


(ii) 


y 




l 





0.5 

l 

\ 

-l 





FIGURE 4.2.1 Solution of the initial value problem of Example 1. 
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As Example 1 illustrates, the procedure for solving an nth order linear differential 
equation with constant coefficients depends on finding the roots of a corresponding 
nth degree polynomial equation. If initial conditions are prescribed, then a system 
of n linear algebraic equations must be solved to determine the proper values of the 
constants c\,...,c n . Although each of these tasks becomes much more complicated as 
n increases, they can often be handled without difficulty with a calculator or computer. 

For third and fourth degree polynomials there are formulas, 2 analogous to the 
formula for quadratic equations but more complicated, that give exact expressions for 
the roots. Root-finding algorithms are readily available on calculators and computers. 
Sometimes they are included in the differential equation solver, so that the process of 
factoring the characteristic polynomial is hidden and the solution of the differential 
equation is produced automatically. 

If you are faced with the need to factor the characteristic polynomial by hand, here 
is one result that is sometimes helpful. Suppose that the polynomial 

a 0 r" + a\r n ~ l H b a n ^r + a n — 0 (12) 

has integer coefficients. If r = p/q is a rational root, where p and q have no common 
factors, then p must be a factor of a n , and q must be a factor of «o- For example, in 
Eq. (8) the factors of a o are ±1 and the factors of a n are ±1, ±2, ±3, and ±6. Thus 
the only possible rational roots of this equation are ±1, ±2, ±3, and ±6. By testing 
these possible roots, we find that 1, —1, 2, and —3 are actual roots. In this case there 
are no other roots, since the polynomial is of fourth degree. If some of the roots are 
irrational or complex, as is usually the case, then this process will not find them, but 
at least the degree of the polynomial can be reduced by dividing the polynomial by 
the factors corresponding to the rational roots. 

If the roots of the characteristic equation are real and different, we have seen that 
the general solution (5) is simply a sum of exponential functions. For large values of t 
the solution is dominated by the term corresponding to the algebraically largest root. 
If this root is positive, then solutions become exponentially unbounded, whereas if 
it is negative, then solutions tend exponentially to zero. Finally, if the largest root 
is zero, then solutions approach a nonzero constant as t becomes large. Of course, 
for certain initial conditions, the coefficient of the otherwise dominant term may be 
zero; then the nature of the solution for large t is determined by the next largest root. 

Complex Roots. If the characteristic equation has complex roots, they must occur in 
conjugate pairs, /, ± ip, since the coefficients ao,...,a n are real numbers. Provided 
that none of the roots is repeated, the general solution of Eq. (1) is still of the form 
(5). However, just as for the second order equation (Section 3.3), we can replace the 


2 The method for solving the cubic equation was apparently discovered by Scipione dal Ferro (1465— 
1526) about 1500, although it was first published in 1545 by Girolamo Cardano (1501-1576) in his Ars 
Magna. This book also contains a method for solving quartic equations that Cardano attributes to his 
pupil Ludovico Ferrari (1522-1565). The question of whether analogous formulas exist for the roots of 
higher degree equations remained open for more than two centuries, until 1826, when Niels Abel showed 
that no general solution formulas can exist for polynomial equations of degree five or higher. A more 
general theory was developed by Evariste Galois (1811-1832) in 1831, but unfortunately it did not become 
widely known for several decades. 
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EXAMPLE 

2 


complex-valued solutions e (A+lll)t and e (l ^ l,l)t by the real-valued solutions 

e Xt cos /it, e x, sm/it (13) 

obtained as the real and imaginary parts of Thus, even though some of the 

roots of the characteristic equation are complex, it is still possible to express the 
general solution of Eq. (1) as a linear combination of real-valued solutions. 


Find the general solution of 

yW-y = Q. (14) 

Also find the solution that satisfies the initial conditions 

y(0) = 7/2, /( 0) = -4, /'( 0) = 5/2, /"( 0) = -2 (15) 

and draw its graph. 

Substituting e" for y, we find that the characteristic equation is 

r 4 — 1 = (r 2 — l)(r 2 + 1) = 0. 


Therefore the roots are r = 1, —1, i, —i, and the general solution of Eq. (14) is 

y = C\e‘ + C2e~ ( + C3 cos t + C4 sin t. 

If we impose the initial conditions (15), we find that 

ci = 0, C 2 = 3, C 3 = 1/2, C 4 = —1; 

thus the solution of the given initial value problem is 

y = 3e~' + f cost — sint. (16) 

The graph of this solution is shown in Figure 4.2.2. 

Observe that the initial conditions (15) cause the coefficient ci of the exponentially growing 
term in the general solution to be zero. Therefore this term is absent in the solution (16), 
which describes an exponential decay to a steady oscillation, as Figure 4.2.2 shows. However, 
if the initial conditions are changed slightly, then ci is likely to be nonzero, and the nature of 
the solution changes enormously. For example, if the first three initial conditions remain the 
same, but the value of y"'(0) is changed from —2 to —15/8, then the solution of the initial value 
problem becomes 

y = ^e‘ + ^e~ l + l cost — j^sint. (17) 

The coefficients in Eq. (17) differ only slightly from those in Eq. (16), but the exponentially 
growing term, even with the relatively small coefficient of 1/32, completely dominates the 
solution by the time t is larger than about 4 or 5. This is clearly seen in Figure 4.2.3, which 
shows the graphs of the two solutions (16) and (17). 
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FIGURE 4.2.2 A plot of the solution (16). 



FIGURE 4.2.3 Plots of the solutions (16) (light curve) and (17) (heavy curve). 


Repeated Roots. If the roots of the characteristic equation are not distinct — that is, 
if some of the roots are repeated — then the solution (5) is clearly not the general 
solution of Eq. (1). Recall that if ri is a repeated root for the second order linear 
equation a^y" + a\y' + a^y — 0, then two linearly independent solutions are e nt and 
te nt . For an equation of order n, if a root of Z(r) — 0, say r = r\, has multiplicity 5 
(where s < n), then 

e nt , te n ‘, t 2 e nt , ..., (18) 

are corresponding solutions of Eq. (1); see Problem 41 for a proof of this statement. 

If a complex root X + ip, is repeated s times, the complex conjugate X — ip is 
also repeated s times. Corresponding to these 2s complex-valued solutions, we can 
find 2s real-valued solutions by noting that the real and imaginary parts of e {x+lll)t , 
t e (l+l,l> ‘ , . . . , p-tgP+'Mh are a ) so linearly independent solutions: 

e lt cos pt, e xt sin pt, te k ‘ cos pt, te u sin pt. 

..., f~ l e xt cos pt, f^e 11 sin pt. 

Hence the general solution of Eq. (1) can always be expressed as a linear combination 
of n real-valued solutions. Consider the following example. 


Find the general solution of 

EXAMPLE / 4) + 2y" + y = 0. (19) 

3 

The characteristic equation is 

r 4 + 2r 2 + 1 = (r 2 + 1 )(r 2 + 1) = 0. 

The roots are r = i,i, —i, — i , and the general solution of Eq. (19) is 
y = Ci cos t + C 2 sin t + c 3 t cos t + c 4 t sin t. 
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EXAMPLE 

4 


PROBLEMS 


In determining the roots of the characteristic equation, it may be necessary to 
compute the cube roots, the fourth roots, or even higher roots of a (possibly com- 
plex) number. This can usually be done most conveniently by using Euler’s formula 
e lt = cost + i sin t and the algebraic laws given in Section 3.3. This is illustrated in 
the following example. 


Find the general solution of 

y (4) + y = 0. (20) 

The characteristic equation is 

r 4 + 1 = 0. 


To solve the equation, we must compute the fourth roots of —1. Now —1, thought of as a 
complex number, is — 1 + 0/. It has magnitude 1 and polar angle n. Thus 

—1 = cosjt + isin7r = e m . 


Moreover, the angle is determined only up to a multiple of 2n . Thus 
— 1 = cos(tt + 2mn) + isin(7r + 2mn) = 


where m is zero or any positive or negative integer. Thus 

(_l)l/4 = e iW4 +ITO /2) = CQS Q + /sin g + . 

The four fourth roots of —1 are obtained by setting m = 0, 1, 2, and 3; they are 
1 T i — 1 T i — 1 — i 1 — i 

~7T’ ~7T’ "7T 

It is easy to verify that, for any other value of m, we obtain one of these four roots. For example, 
corresponding to m = 4, we obtain (1 + i)/V2. The general solution of Eq. (20) is 


y = e' /v ^ ( ci cos * 1 + C 2 sin — — 

Jl J2. 


+ e 




c 3 cos + c 4 sin — — 


( 21 ) 


In conclusion, we note that the problem of finding all the roots of a polynomial 
equation may not be entirely straightforward, even with computer assistance. For 
instance, it may be difficult to determine whether two roots are equal or merely 
very close together. Recall that the form of the general solution is different in these 
two cases. 

If the constants inEq. (1) are complex numbers, the solution of Eq. (1) 

is still of the form (4). In this case, however, the roots of the characteristic equation 
are, in general, complex numbers, and it is no longer true that the complex conjugate 
of a root is also a root. The corresponding solutions are complex-valued. 


In each of Problems 1 through 6 express the given complex number in the form 

.R(cos0 + / sin©) = Re ,e . 

1. 1 + i 2. -1 + V3 i 

3. -3 4. -i 

5. V3-i 6. -1-/ 
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41 

41 


41 

41 

41 

41 

41 

41 

41 

41 


In each of Problems 7 through 10 follow the procedure illustrated in Example 4 to determine 
the indicated roots of the given complex number. 

7. 1 1/3 8. (1 - i) 1/2 

9. 1 1/4 10. [2(cos7r/3 + isin7r/3)] 1/2 


In each of Problems 11 through 28 find the general solution of the given differential equation. 


11. y"'-y"-/+y = 0 

13. 2y'" - 4 y" - 2y' + 4y = 0 

15. y (6) + y = 0 

17. y (6) - 3y (4) + 3 y" -y = 0 

19. y (5) - 3 y® + 3 y"' - 3 y" + 2y' = 0 

21. y® + 8y (4 > + 16y = 0 

23. /" - 5 y" + 3/ + y = 0 

25. 18/" + 21/' + 14/ + 4y = 0 

27. 12/ 4 >+ 31/"+ 75/'+ 37/+ 5y = 0 


12. /" - 3/' + 3/ - y = 0 

14. y< 4 > - 4/" + 4y" = 0 

16. y (4) - 5y" + 4y = 0 

18. y (6) - y" = 0 

20. y (4) - 8/ = 0 

22. y (4 » + 2y" + y = 0 

24. /" + 5y" + 6/ + 2y = 0 

26. y (4) - 7/" + 6y" + 30y' - 36y = 0 

28. y (4 > + 6/" + 17y" + 22/ + 14y = 0 


In each of Problems 29 through 36 find the solution of the given initial value problem, and 
plot its graph. How does the solution behave as t — > oo? 


29. 

+ 

II 

0 

y(0) = 0, y'(0) = 

= 1, /'( 0) 

= 2 



30. 

0 

II 

+ 

y(0) = 0, y'(0) = 

= 0, y"(0) 

= -1, y 

"(0) 

= 0 

31. 

y(4) _ 4y"' + 4y" 

= 0; y(l) = - 

1, /( D = 

2, y"(l) 

= 0. 

, y"'(l) = 0 

32. 

/" -/' + /- y 

= 0; y(0) = 2. 

, y'(0) = - 

■1, /'( 0) 

= - 

-2 

33. 

2y< 4 ) _ y>" _ ()y" 

+ 4/ + 4y = 0; 

y (0) = -2 

/(0) = 

: 0, 

/'( 0) = -2, 


/"( 0) = 0 






34. 

4/" + y' + 5y = 

0; y (0) = 2, 

/(0) = 1, 

y"( 0) = - 

1 


35. 

6/" + 5y" + / = 

= 0; y(0) = -2 

, /( 0) = 2 

, y"(0) = 

: 0 


36. 

y(4) + 6/" + 17/ 

+ 22y' + 14y = 0 

; y(0) = 

1, /( 0) 

= - 

■2, y" (0) = ( 


y'"(0) = 3 






37. 

Show that the general solution of y (4) — y = 0 can be written 

as 


y = Ci cos t + C2 sin t + C3 cosh t + C4 sinh t . 


Determine the solution satisfying the initial conditions y (0) = 0, y'(0) = 0, y"(0) = 1, 
y"'(0) = 1. Why is it convenient to use the solutions cosh t and sinh t rather than e‘ 
and e~‘l 

38. Consider the equation y <4) — y = 0. 

(a) Use Abel’s formula [Problem 20(d) of Section 4.1] to find the Wronskian of a funda- 
mental set of solutions of the given equation. 

(b) Determine the Wronskian of the solutions e', e~‘, cos t, and sin t. 

(c) Determine the Wronskian of the solutions cosh t, sinh t, cos t, and sin t. 

39. Consider the spring-mass system, shown in Figure 4.2.4, consisting of two unit masses 
suspended from springs with spring constants 3 and 2, respectively. Assume that there is 
no damping in the system. 

(a) Show that the displacements u\ and u 2 of the masses from their respective equilibrium 
positions satisfy the equations 


u![ + 5mi = 2u 2 , 


u 2 + 2u 2 = 2mi. 


(i) 
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FIGURE 4.2.4 A two-spring, two-mass system. 


(b) Solve the first of Eqs. (i) for u 2 and substitute into the second equation, thereby 
obtaining the following fourth order equation for u\ : 

(4 4) + lu'l + 6«i = 0. (ii) 

Find the general solution of Eq. (ii). 

(c) Suppose that the initial conditions are 

!<i(0) = 1, «i(0) = 0, z/ 2 CO) = 2, u' 2 ( 0) = 0. (iii) 

Use the first of Eqs. (i) and the initial conditions (iii) to obtain values for m"( 0) and m"'( 0). 
Then show that the solution of Eq. (ii) that satisfies the four initial conditions on ui is 
iq(f) = cos t. Show that the corresponding solution u 2 is u 2 (t) = 2 cos t. 

(d) Now suppose that the initial conditions are 

u 1 ( 0) = — 2, Mj(0) = 0, u 2 ( 0) = 1, u' 2 (0) = 0. (iv) 

Proceed as in part (c) to show that the corresponding solutions are u\(t) = — 2cos\/6 1 
and u 2 (t) = cos *J 6 1 . 

(e) Observe that the solutions obtained in parts (c) and (d) describe two distinct modes 
of vibration. In the first, the frequency of the motion is 1, and the two masses move in 
phase, both moving up or down together; the second mass moves twice as far as the first. 
The second motion has frequency V6, and the masses move out of phase with each other, 
one moving down while the other is moving up, and vice versa. In this mode the first mass 
moves twice as far as the second. For other initial conditions, not proportional to either 
of Eqs. (iii) or (iv), the motion of the masses is a combination of these two modes. 

40. In this problem we outline one way to show that if r 2 , . . . , r n are all real and different, then 
e n ‘, . . . ,e rnt are linearly independent on — oo < t < oo. To do this, we consider the linear 
relation 


cie ri ' + • • • + c n e r "‘ = 0, — oo < t < oo (i) 

and show that all the constants are zero. 

(a) Multiply Eq. (i) by e~ n ‘ and differentiate with respect to r, thereby obtaining 
c 2 (r 2 - n)^-^ + • • • + c n (r n - n)e (r "- ri)f = 0. 
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(b) Multiply the result of part (a) by e ri)t and differentiate with respect to t to obtain 

C3O-3 - r 2 )(r 3 - rM n ~ ri)t + ■ ■ ■ + c n (r„ - r 2 )(r n - = 0. 

(c) Continue the procedure from parts (a) and (b), eventually obtaining 

c n ir n - r n _i) ■■■(?„ — r 1 )e (r "~ r "- l)t = 0. 

Hence c„ = 0, and therefore 


cie" 1 ' H h c n _ 1 e rn ~ 1 ' = 0. 

(d) Repeat the preceding argument to show that c„_i = 0. In a similar way it follows that 
c „_ 2 = • • • = Ci = 0. Thus the functions e n ‘, . . . , e r "' are linearly independent. 

41. In this problem we indicate one way to show that if r — r\ is a root of multiplicity s of the 
characteristic polynomial Z(r), then e n ‘, te n ‘ , . . . , f~ 1 e n ' are solutions of Eq. (1). This 
problem extends to nth order equations the method for second order equations given in 
Problem 22 of Section 3.4. We start from Eq. (2) in the text 

L[e n ] = e r, Z{r) (i) 


and differentiate repeatedly with respect to r , setting r — r t after each differentiation. 

(a) Observe that if ri is a root of multiplicity s, then Z(r) = (r — r\) s q(r), where q(r) is 
a polynomial of degree n — s and q(r\) ^ 0. Show that Z(r\), Z'(r {), . . . , Z (s_ 1 ) (r 1 ) are all 
zero, but Z (s) (ri) 7 ^ 0. 

(b) By differentiating Eq. (i) repeatedly with respect to r, show that 


^-L[e n ] = L 
dr 



L[te r, l 


gs-i 

9C" 1 


L[e rl ] = L[f~ l e n l 


(c) Show that e n \te nl , ...,f 1 e n ‘ are solutions of Eq. (1). 


4.3 The Method of Undetermined Coefficients 


A particular solution Y of the nonhomogeneous nth order linear equation with con- 
stant coefficients 

L[y] = a Q y (n) + aiy {n ~ l) H b a„_i/ + a n y = g(t) (1) 

can be obtained by the method of undetermined coefficients, provided that g(t) is of 
an appropriate form. Although the method of undetermined coefficients is not as 
general as the method of variation of parameters described in the next section, it is 
usually much easier to use when it is applicable. 
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Just as for the second order linear equation, when the constant coefficient linear 
differential operator L is applied to a polynomial Aot m + A\t' n ~ l H + A m . an ex- 

ponential function e at , a sine function sin fit, or a cosine function cos fit, the result 
is a polynomial, an exponential function, or a linear combination of sine and cosine 
functions, respectively. Hence, if g(t) is a sum of polynomials, exponentials, sines, and 
cosines, or products of such functions, we can expect that it is possible to find Y ( t ) 
by choosing a suitable combination of polynomials, exponentials, and so forth, mul- 
tiplied by a number of undetermined constants. The constants are then determined 
by substituting the assumed expression into Eq. (1). 

The main difference in using this method for higher order equations stems from 
the fact that roots of the characteristic polynomial equation may have multiplicity 
greater than 2. Consequently, terms proposed for the nonhomogeneous part of the 
solution may need to be multiplied by higher powers of t to make them different from 
terms in the solution of the corresponding homogeneous equation. The following 
examples illustrate this. In these examples we have omitted numerous straightfor- 
ward algebraic steps, because our main goal is to show how to arrive at the correct 
form for the assumed solution. 


EXAMPLE 

1 

so the general solution of the homogeneous equation is 

y c (t) = C\e ‘ + c 2 te‘ + c 3 tV . (3) 

To find a particular solution Y ( t ) of Eq. (2), we start by assuming that Y (t) = Ad. However, 
since e‘, te‘, and t 2 e' are all solutions of the homogeneous equation, we must multiply this 
initial choice by t 3 . Thus our final assumption is that Y(t) = At 3 e‘, where A is an undeter- 
mined coefficient. To find the correct value for A, we differentiate Y (t) three times, substitute 
for y and its derivatives in Eq. (2), and collect terms in the resulting equation. In this way 
we obtain 

6Ae‘ = 4e‘. 

Thus A — | and the particular solution is 

Y(t) = §fV. (4) 

The general solution of Eq. (2) is the sum of y c {t) from Eq. (3) and Y (t) from Eq. (4), that is, 

y = c\e‘ + c 2 te' + c 3 t 2 e‘ + | fV. 


Find the general solution of 

/" - 3y" + 3/ - y = 4<?'. (2) 

The characteristic polynomial for the homogeneous equation corresponding to Eq. (2) is 

? - 3r 2 + 3r — 1 = (r — l) 3 , 


EXAMPLE 

2 


Find a particular solution of the equation 

y (4) + 2 y" + y = 3 sin t — 5 cos t . (5) 

The general solution of the homogeneous equation was found in Example 3 of Section 4.2; 
it is 


y c (t) — Ci cos t + c 2 sin t + c 3 t cos t + c 4 t sin t. 


( 6 ) 
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corresponding to the roots r = i, i, —i, and — i of the characteristic equation. Our initial 
assumption for a particular solution is Y (t) = A sin t + B cos t , but we must multiply this choice 
by t 2 to make it different from all solutions of the homogeneous equation. Thus our final 
assumption is 

Y(t) = At 2 sin t + Bt 2 cos t. 

Next, we differentiate Y (f) four times, substitute into the differential equation (4), and collect 
terms, obtaining finally 

— 8A sin t - 85 cos t = 3 sin t — 5 cos t. 

Thus A = — | , B = | , and the particular solution of Eq. (4) is 

Y(t) = — |f 2 sin r+ |t 2 cost. (7) 

If g(t) is a sum of several terms, it may be easier in practice to compute separately 
the particular solution corresponding to each term in g(t). As for the second order 
equation, the particular solution of the complete problem is the sum of the particular 
solutions of the individual component problems. This is illustrated in the following 
example. 


EXAMPLE 

3 


Find a particular solution of 

y"' — 4/ = t + 3 cos f + e~ 2 '. (8) 

First we solve the homogeneous equation. The characteristic equation is r 3 — 4 r = 0, and 
the roots are r = 0, ±2; hence 

y c (t) = Cl + c 2 e 2 ' + C 3 e~ 2 '. 

We can write a particular solution of Eq. (8) as the sum of particular solutions of the differential 
equations 

y"'-4y' = t, y'" — 4y' = 3 cos t, y'" - 4/ = e~ 2t . 

Our initial choice for a particular solution Y\ (t) of the first equation is A 0 t + A \ , but a constant 
is a solution of the homogeneous equation, so we multiply by t. Thus 

Y\(t) = t(Aot + A\). 

For the second equation we choose 

Y 2 (t) = Boost + Csinr, 

and there is no need to modify this initial choice since sinf and cost are not solutions of 
the homogeneous equation. Finally, for the third equation, since e~ 2 ' is a solution of the 
homogeneous equation, we assume that 

Y 3 (0 = Ete~ 2 '. 

The constants are determined by substituting into the individual differential equations; they 
are A 0 = — f , Ai = 0, B = 0, C = — | , and E = f . Hence a particular solution of Eq. (8) is 

Y(t) = - If 2 - |sinf+ \te~ 2 ‘. (9) 


You should keep in mind that the amount of algebra required to calculate the 
coefficients may be quite substantial for higher order equations, especially if the 
nonhomogeneous term is even moderately complicated. A computer algebra system 
can be extremely helpful in executing these algebraic calculations. 


4.3 The Method of Undetermined Coefficients 
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The method of undetermined coefficients can be used whenever it is possible to 
guess the correct form for Y(t). However, this is usually impossible for differential 
equations not having constant coefficients, or for nonhomogeneous terms other than 
the type described previously. For more complicated problems we can use the method 
of variation of parameters, which is discussed in the next section. 


PROBLEMS 


In each of Problems 1 through 8 determine the general solution of the given differential 
equation. 

1. /" - y" - y' +y = 2e~' + 3 2. y (4) - y = 3t + cost 

3. y'" + y" + y' + y = e~‘ + 4 1 4. y"’ - y' = 2 sin t 

5. y (4) - 4 y" = t 2 + e' 6. y (4) + 2y" + y = 3 4- cos 2 1 


1. y (6) + /" = t 


. +/' = sin2f 


In each of Problems 9 through 12 find the solution of the given initial value problem. Then 
plot a graph of the solution. 

9. y'" + 4 y' = t; y(0) = y'(0) = 0, y"( 0) = 1 

10. y (4) + 2 y" + y = 3t + 4; y(0) = y'( 0) = 0, y"(0) = y"'(0) = 1 

11. y"' -3y" + 2/ = t + e<; y(0) = 1, y'( 0) = — |, y"(0) = -§ 

12. y (4) + 2 y'" + y" + 8y' - 12y = 12 sin t - e~‘; y(0) = 3, y'(0) = 0, 

/'( 0) = -1, y'"( 0) = 2 


In each of Problems 13 through 18 determine a suitable form for Y (f) if the method of unde- 
termined coefficients is to be used. Do not evaluate the constants. 

13. /" - 2 y" + y = t 3 + 2e‘ 14. /" - y' = te~‘ + 2 cos t 

15. y (4) - 2 y" + y = e‘ + sin t 16. y (4) + 4y" = sin 2 1 + te‘ + 4 

17. y (4) - /" - y" + y' = F + 4 + t sin f 18. y (4) + 2y"' + 2y" = 3e' + 2te“' + e"' sin f 

19. Consider the nonhomogeneous /zth order linear differential equation 

a 0 y (n) + niy ( ' ,_1) + • • • + a n y = g(f), (i) 


where ao,...,a„ are constants. Verify that if g(f) is of the form 

e°"(b 0 t m + • • • + b m ), 


then the substitution y = e a, u(t) reduces Eq. (i) to the form 

k 0 u w + kiu (n ~ * 1) 11 + • • • + k n u = b 0 t m + • • • + b m , (ii) 

where k Q , . . . ,k n are constants. Determine k 0 and k n in terms of the as and a. Thus 
the problem of determining a particular solution of the original equation is reduced to 
the simpler problem of determining a particular solution of an equation with constant 
coefficients and a polynomial for the nonhomogeneous term. 

Method of Annihilators. In Problems 20 through 22 we consider another way of arriving at the 
proper form of Y (t) for use in the method of undetermined coefficients. The procedure is based 
on the observation that exponential, polynomial, or sinusoidal terms (or sums and products of 
such terms) can be viewed as solutions of certain linear homogeneous differential equations 
with constant coefficients. It is convenient to use the symbol D for d/dt. Then, for example, e~‘ 
is a solution of ( D + l)y = 0; the differential operator D + 1 is said to annihilate, or to be an 
annihilator ot,e~‘ . Similarly, D 2 + 4 is an annihilator of sin 2t or cos 2t, (D — 3) 2 = D 2 — 6 D + 9 
is an annihilator of e 3 ' or te 3 ‘, and so forth. 
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20. Show that linear differential operators with constant coefficients obey the commutative 
law. That is, show that 


( D — a)(D — b)f = (D — b)(D — a)f 

for any twice differentiable function / and any constants a and b. The result extends at 
once to any finite number of factors. 

21. Consider the problem of finding the form of a particular solution Y (f) of 

(D -2f(D + 1)Y = 3e 2 ’ (i) 

where the left side of the equation is written in a form corresponding to the factorization 
of the characteristic polynomial. 

(a) Show that D — 2 and ( D + l) 2 , respectively, are annihilators of the terms on the right 
side of Eq. (i), and that the combined operator (D — 2) (D + l) 2 annihilates both terms on 
the right side of Eq. (i) simultaneously. 

(b) Apply the operator (D — 2 )(D + l) 2 to Eq. (i) and use the result of Problem 20 to 
obtain 

(D -2)\D + lfY = 0. (ii) 

Thus y is a solution of the homogeneous equation (ii). By solving Eq. (ii), show that 

Y(t) = cie 2 ' + Cite 2 ' + c^fe 2 ' + C4/ 3 e 2 ' + cse~' + cete~' + cjt 2 e~‘ , (iii) 

where ci, . . . , C7 are constants, as yet undetermined. 

(c) Observe that e 2 ', te 2 ', t 2 e 2 ‘, and e~‘ are solutions of the homogeneous equation cor- 
responding to Eq. (i); hence these terms are not useful in solving the nonhomogeneous 
equation. Therefore, choose ci, C2, C3, and C5 to be zero in Eq. (iii), so that 

Y(f) = c 4 t 3 e 2 ‘ + C(,te~‘ + cjt 2 e~‘ . (iv) 

This is the form of the particular solution Y of Eq. (i). The values of the coefficients c 4 ,c 6 , 
and c 7 can be found by substituting from Eq. (iv) in the differential equation (i). 

Summary. Suppose that 

L(D)y = g(t), (v) 

where L(D) is a linear differential operator with constant coefficients, and g(t) is a sum or 
product of exponential, polynomial, or sinusoidal terms. To find the form of a particular 
solution of Eq. (v), you can proceed as follows: 

(a) Find a differential operator H(D) with constant coefficients that annihilates g(t), that is, 
an operator such that H(D)g(t) = 0. 

(b) Apply H(D) to Eq. (v), obtaining 


H(D)L(D)y = 0, (vi) 

which is a homogeneous equation of higher order. 

(c) Solve Eq. (vi). 

(d) Eliminate from the solution found in step (c) the terms that also appear in the solution of 
L(D)y = 0. The remaining terms constitute the correct form of a particular solution of Eq. (v). 

22. Use the method of annihilators to find the form of a particular solution Y(t) for each of 

the equations in Problems 13 through 18. Do not evaluate the coefficients. 


4.4 The Method of Variation of Parameters 
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The method of variation of parameters for determining a particular solution of the 
nonhomogeneous nth order linear differential equation 

L[y] = y (n) + pi(t)y (n ~ 1) h f p n ~i(t)y' + p„(t)y = g(t ) (l) 

is a direct extension of the method for the second order differential equation (see 
Section 3.6). As before, to use the method of variation of parameters, it is first nec- 
essary to solve the corresponding homogeneous differential equation. In general, 
this may be difficult unless the coefficients are constants. However, the method of 
variation of parameters is still more general than the method of undetermined coef- 
ficients in that it leads to an expression for the particular solution for any continuous 
function g, whereas the method of undetermined coefficients is restricted in practice 
to a limited class of functions g. 

Suppose then that we know a fundamental set of solutions y\, y 2 , . . . ,y n of the 
homogeneous equation. Then the general solution of the homogeneous equation is 

y c (0 = ciyi(t) + c 2 y 2 (0 H f c„y n ( t ) . (2) 

The method of variation of parameters for determining a particular solution of Eq. ( 1 ) 
rests on the possibility of determining n functions uu u 2 , . . . , u n such that Y ( t ) is of 
the form 

Y ( t ) = u\ (t)y ] ( t ) + u 2 ( t)y 2 (t) H f u n (t)y n (, t ) . (3) 

Since we have n functions to determine, we will have to specify n conditions. One 
of these is clearly that Y satisfy Eq. (1). The other n — 1 conditions are chosen 
so as to make the calculations as simple as possible. Since we can hardly expect a 
simplification in determining Y if we must solve high order differential equations for 
mi, . . . , u n , it is natural to impose conditions to suppress the terms that lead to higher 
derivatives of m, . . . , u n . From Eq. (3) we obtain 

V = («i/i + uiy' 2 H 1- u n y '„ ) + (u[yi + u' 2 y 2 H b u' n y n ), (4) 

where we have omitted the independent variable t on which each function in Eq. (4) 
depends. Thus the first condition that we impose is that 


u\yi + u' 2 y 2 -) b u' n y n = 0. (5) 

Continuing this process in a similar manner through n — 1 derivatives of Y gives 

Y <m) = MiyJ m) + M 2 y 2 m) H b u n y^ n) , m = 0, 1, 2, . . . , n - 1, (6) 

and the following n — 1 conditions on the functions u n : 

«'iy ( , m_1) + u' 2 y { 2 ~ 1) H b u' n yn l ~ l) = m = 1, 2, 1. (7) 

The nth derivative of Y is 

yw = (Mr y™ + • • • + u n yM) + (u' iy [ n ~ l) + • • ■ + u' n y^). 


( 8 ) 
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Finally, we impose the condition that Y must be a solution of Eq. (1). On substi- 
tuting for the derivatives of Y from Eqs. (6) and (8), collecting terms, and making 
use of the fact that L[y,] = 0, i = 1, 2, . . . , n, we obtain 

«i y ( r l) + «2)4 n - 1) + • • • + “>^ 1, = g. (9) 

Equation (9), coupled with the n — 1 equations (7), gives n simultaneous linear non- 
homogeneous algebraic equations for u \ , u' 2 ,..., u' n : 


y\u[ +yiu' 2 h \-y n u' n = o, 

+ F2«2 + • • • + yX = 0, 

+ F 2 M 2 "1 + y'n u 'n = 0’ (10) 


(n— 1) / . 

Fl Ml + 


+ y ( n 1) u'n=8- 


The system (10) is a linear algebraic system for the unknown quantities u' n . 

By solving this system and then integrating the resulting expressions, you can obtain 
the coefficients u\,...,u n . A sufficient condition for the existence of a solution of the 
system of equations (10) is that the determinant of coefficients is nonzero for each 
value of t. However, the determinant of coefficients is precisely Wiy\,y 2 , ■ ■ ■ 
and it is nowhere zero since y\,...,y n is a fundamental set of solutions of the ho- 
mogeneous equation. Hence it is possible to determine u [, . . . , u' n . Using Cramer’s 3 
rule, we can write the solution of the system of equations (10) in the form 


<n (0 


g(t)W m (t) 

W{t) 


m = 1,2, .. . , n. 


( 11 ) 


Here W(t) = W(y],y 2 , . . . ,y n )(t), and W,„ is the determinant obtained from W by 
replacing the m\h column by the column (0, 0, . . . , 0, 1). With this notation a particular 
solution of Eq. (1) is given by 


n r 

Y(t) = Y.ymit) / 

m= 1 to 


1 g(s)W m (s) 
W(s) 


ds , 


( 12 ) 


where to is arbitrary. Although the procedure is straightforward, the algebraic com- 
putations involved in determining Y (t) from Eq. (12) become more and more com- 
plicated as n increases. In some cases the calculations may be simplified to some 
extent by using Abel’s identity (Problem 20 of Section 4.1) 


W(t) = W(yi, . . .,y n )it) = c exp 


J Piit) dt 


The constant c can be determined by evaluating W at some convenient point. 


3 Cramer's rule is credited to Gabriel Cramer (1704-1752), professor at the Academie de Calvin in Geneva, 
who published it in a general form (but without proof) in 1750. For small systems the result had been 
known earlier. 
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EXAMPLE 

1 


Given that y\(t) = e‘, y 2 {t ) = te‘ , and y 2 (t) = e ' are solutions of the homogeneous equation 
corresponding to 

y'"-y"-y' + y = g it), (13) 


determine a particular solution of Eq. (13) in terms of an integral. 
We use Eq. (12). First, we have 


W(t ) = W(e',te',e ‘){t) 


e' te' e~‘ 

e‘ ( t + l)e' — e~‘ 
e' (t + 2)e' e~' 


Factoring e‘ from each of the first two columns and e 1 from the third column, we obtain 


W(t) = e‘ 


1 

1 

1 


t 1 
t + 1 -1 

t + 2 1 


Then, by subtracting the first row from the second and third rows, we have 


W(t) = e‘ 


1 

0 

0 


t 1 

1 -2 

2 0 


Finally, evaluating the latter determinant by minors associated with the first column, we find 
that 

W(t) = 4e‘ . 


Next, 

0 te' e 

Wi(t) = 0 (r+l)e' — e 

1 (f + 2)e' e 

Using minors associated with the first column, we obtain 

te' e~‘ 

(t + l)e' — e~‘ 


Wi(0 = 


= —2t — 1. 


In a similar way 


and 


W 2 (t) = 


0 e~ 

0 -e~ 

1 e~ 


= 2 , 


e' te' 0 
W 3 (t) = e' (r+l)e ( 0 

e' (; t + 2)e‘ 1 

Substituting these results in Eq. (12), we have 


e te 
e' ( t + l)e' 


Y(t) = f> 


I 

Jtn 


' g(5 )(-l - 2 s) 
4e s 


ds + te‘ 


, f gm 2 ) 


/ 

J to 


4e s 


ds + e 


f 

J to 


g{s)e 2: 

4e s 


ds 


= \ f {e‘ s [-l + 2(t - j)] + e ( ' s) } g(s) ds. 

4 Jtn 
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PROBLEMS In each of Problems 1 through 6 use the method of variation of parameters to determine the 
general solution of the given differential equation. 

1. y'" + y' = tan t, 0 < t < n 2. y'" — y' = t 

3. /" - 2 y" - y' + 2y = e 4 ' 4. yf" + / = sec t, -n / 2 < t < n /2 

5. y'" — y" + y — y = e - ' sin f 6. y (4) + 2y" + y = sin t 

In each of Problems 7 and 8 find the general solution of the given differential equation. Leave 
your answer in terms of one or more integrals. 

7. y'" — y" + y' — y = sect, —n/2 < t < jt/2 

8. y'" — y' — esc t, 0 < t < n 

In each of Problems 9 through 12 find the solution of the given initial value problem. Then 
plot a graph of the solution. 

9. /" + y = sec f; y( 0) = 2, y'(0) = 1, y"( 0) = -2 

10. y (4) + 2y" + y = sin f; y(Q) = 2, / (0) = 0, y"( 0) = - 1 , y"'(0) = 1 

11. /"-/' + / -y = sect; y( 0) = 2, y'(0) = -1, y"(0) = 1 

12. /" -/ = esc f; y(tr/2) = 2, /(jt/2) = 1, y"(n/2) = -1 

13. Given that x,x 2 , and 1/x are solutions of the homogeneous equation corresponding to 

x 3 y'" + x 2 y" — 2xy' + 2 y = 2x 4 , x > 0, 
determine a particular solution. 

14. Find a formula involving integrals for a particular solution of the differential equation 

/" - / + / -y = g(t )• 

15. Find a formula involving integrals for a particular solution of the differential equation 

y^-y = g(t). 

Hint: The functions sin f , cos t, sinh f , and cosh t form a fundamental set of solutions of the 
homogeneous equation. 

16. Find a formula involving integrals for a particular solution of the differential equation 

/" — 3y" + 3y' — y = g{t). 

Ifg(t) = t~ 2 e‘, determine Y(t). 

17. Find a formula involving integrals for a particular solution of the differential equation 

x 3 y"' — 3 x 2 y" + 6 xy' — 6 y = g(x), x > 0. 

Hint: Verify that x,x 2 , and x 3 are solutions of the homogeneous equation. 


REFERENCES Coddington, E. A. ,An Introduction to Ordinary Differential Equations (Englewood Cliffs, NJ: Prentice- 
Hall, 1961; New York: Dover, 1989). 

Ince, E. L„ Ordinary Differential Equations (London: Longmans, Green, 1927; New York: Dover, 1953). 


CHAPTER 


Series Solutions 
of Second Order 
Linear Equations 


Finding the general solution of a linear differential equation depends on determining 
a fundamental set of solutions of the homogeneous equation. So far, we have given 
a systematic procedure for constructing fundamental solutions only if the equation 
has constant coefficients. To deal with the much larger class of equations that have 
variable coefficients, it is necessary to extend our search for solutions beyond the 
familiar elementary functions of calculus. The principal tool that we need is the 
representation of a given function by a power series. The basic idea is similar to 
that in the method of undetermined coefficients: we assume that the solutions of a 
given differential equation have power series expansions, and then we attempt to 
determine the coefficients so as to satisfy the differential equation. 


5.1 Review of Power Series 


In this chapter we discuss the use of power series to construct fundamental sets of 
solutions of second order linear differential equations whose coefficients are func- 
tions of the independent variable. We begin by summarizing very briefly the pertinent 
results about power series that we need. Readers who are familiar with power series 
may go on to Section 5.2. Those who need more details than are presented here 
should consult a book on calculus. 
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OO 

1. A power series a n( x — x o) n is said to converge at a point x if 

n = o 

m 

lim } a n (x — xo) n 

m— >oo 

n = 0 

exists for that x. The series certainly converges for x — x 0 ; it may converge for all x, or it 
may converge for some values of x and not for others. 

OO 

2. The series Y a n (x — xo) n is said to converge absolutely at a point x if the series 

n = 0 

OO OO 

^|a„(x-x 0 )''| = ^K||x-x 0 r 

n=0 n = 0 


converges. It can be shown that if the series converges absolutely, then the series also 
converges; however, the converse is not necessarily true. 

3. One of the most useful tests for the absolute convergence of a power series is the ratio 
test. If a n 7 ^ 0, and if, for a fixed value of x. 


a n+ 1 (x-x 0 )' ,+1 

lim 

n ~* °° a n (x-x 0 )" 


= \x — x 0 | lim 

n—*oo 


Cln+1 

CL n 


\x-x a \L, 


then the power series converges absolutely at that value of x if \x — x 0 \L < 1 and diverges 
if \x — x 0 \L > 1. If \x — x 0 | L = 1, the test is inconclusive. 


EXAMPLE 

1 


For which values of x does the power series 

OO 

J2 (— 1 ) n+1 n(x - 2) n 

n = 1 


converge? 

To test for convergence, we use the ratio test. We have 


lim 

n— >oo 


(— 1)” +2 (/7 + l)(x — 2)" +1 
(— l)' 1+1 «(x — 2) n 


|x — 2| lim 

n — ►oo 


n+ 1 
n 


|x-2|. 


According to statement 3, the series converges absolutely for |x — 2| < 1, or 1 < x < 3, and 
diverges for |x — 2| > 1. The values of x corresponding to |x — 2| = 1 arex = 1 andx = 3. The 
series diverges for each of these values of x since the nth term of the series does not approach 
zero as n — > oo. 


OO 

4. If the power series ^a n (x—xo) n converges at x = xi, it converges absolutely for 

n = 0 

|x — x 0 | < |xi — x 0 |; and if it diverges at x = x\, it diverges for |x — x 0 | > |xi — x 0 |. 

5. There is a nonnegative number p, called the radius of convergence, such that 

OO 

a n (x — x 0 ) n converges absolutely for \x — x 0 \ < p and diverges for |x — x 0 | > p. For 

n = 0 

a series that converges only at xo, we define p to be zero; for a series that converges for 
all x, we say that p is infinite. If p > 0, then the interval |x — x 0 | < p is called the interval 
of convergence; it is indicated by the hatched lines in Figure 5.1.1. The series may either 
converge or diverge when |x — xq| = p. 
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Series | 
diverges 

u v; 

x 0 -p 


Series 

converges 

absolutely 

x Q ' 


Series 

diverges 


x 0 + p 



FIGURE 5.1.1 


\ Series may / 
converge or diverge 

The interval of convergence of a power series. 


EXAMPLE 

2 


Determine the radius of convergence of the power series 


E 

n = 1 


(x + 1)' 1 
n2 n 


We apply the ratio test: 


lim 

n — »oo 


(x + 1)" +1 n2 n 
( n + 1)2" +1 (x + l) n 


|x + 1| .. n |x T 1| 
lim = 

2 n^oo n . + 1 2 


Thus the series converges absolutely for |x+l|<2, or — 3 < x < 1, and diverges for 
|x T 1 1 >2. The radius of convergence of the power series is p = 2. Finally, we check the 
endpoints of the interval of convergence. At x = 1 the series becomes the harmonic series 


OO . 

E ’ 


which diverges. At x = — 3 we have 


E 

n= 1 


(-3 + 1)” 
n 2 n 


E 

n = 1 


(-D n 

n 


which converges but does not converge absolutely. The series is said to converge conditionally 
at x = —3. To summarize, the given power series converges for — 3 < x < 1 and diverges 
otherwise. It converges absolutely for — 3 < x < 1 and has a radius of convergence 2. 


OO OO 

Suppose that a n( x — x o)" an d b n (x — xo)” converge to /(x) and g(x), respec- 

n=Q n=0 

tively, for |x — xo| < p, p > 0. 

6. The series can be added or subtracted termwise, and 

OO 

/(X) ± g(x) = ± b„)(x - Xo)": 

n—0 


the resulting series converges at least for |x — x 0 | < p. 

7. The series can be formally multiplied, and 

OO 

= £ c „(x-x 0 r, 

n = 0 


/(x)g(x) = 


^a„(x-x 0 )" 


_n=0 


J^b n (x~x 0 ) n 


_ _n = 0 


where c n = a 0 b n + a\b n _i + ■ ■ ■ + a n b 0 . The resulting series converges at least for 
|x Xq | < p. 
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Further, if g(xo ) / 0, the series can be formally divided, and 


fix) 

g(x) 


OO 

Y]d n (x -x 0 ) n . 

n = 0 


In most cases the coefficients cl n can be most easily obtained by equating coefficients in 
the equivalent relation 


OO 

Y. a ni X - X 0 ) n 
n = 0 


Y dn{X - X 0 ) n 

_n = 0 


Y, b niX ~ Xp)" 
_n = 0 


n = 0 \k = 0 


— Y, ( Y, dkbn-l 


(■ x-x 0 ) n . 


In the case of division the radius of convergence of the resulting power series may be less 
than p. 

8. The function / is continuous and has derivatives of all orders for \x — x Q \ < p. Further,/', 
/", . . . can be computed by differentiating the series termwise; that is, 

fix) = a\ + 2 a 2 (x — jc 0 ) H 1- na„(x - x 0 ) n_1 H 

OO 

= Y na '< (x - Xo) n ~\ 

n = 1 

fix) = 2a 2 + 6a 3 (x — Xq) + • • • + n(n — 1 )a n (x — xo) n ~ 2 + • • • 

OO 

= Y, n i ,Z — 1 ) a n(x ~ Xo)"~ 2 , 
n=2 


9. 


and so forth, and each of the series converges absolutely for \x — x 0 \ < p. 
The value of a n is given by 

f n) ix o) 


The series is called the Taylor 1 series for the function / about x = xq. 

OO OO 

10. If a nix — xo) n = fb n {x — xo) n for each x in some open interval with center xq, then 


n = o 


n= 0 


a n = b„ for n = 0, 1, 2, 3, In particular, if f a„{x — xo)" = 0 for each such x, then 

n = o 

a 0 = a.\ = ■ ■ ■ = a n — ■ ■ ■ — 0. 


A function / that has a Taylor series expansion about x — xq 

ft \ f tn) i x o) n 

ff ) = > ; — ix - X 0 ) , 

' n\ 

n = 0 


with a radius of convergence p > 0, is said to be analytic at x — x Q . All of the familiar 
functions of calculus are analytic except perhaps at certain easily recognized points. 


^rook Taylor (1685-1731) was the leading English mathematician in the generation following Newton. 
In 1715 he published a general statement of the expansion theorem that is named for him, a result that is 
fundamental in all branches of analysis. He was also one of the founders of the calculus of finite differences 
and was the first to recognize the existence of singular solutions of differential equations. 
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EXAMPLE 

3 


EXAMPLE 

4 


For example, sin x and e x are analytic everywhere, 1 /x is analytic except at x = 0, and 
tanx is analytic except at odd multiples of tt/ 2. According to statements 6 and 7, if 
/ and g are analytic at xq, then/ ± g,f ■ g, and f/g [provided that g(.vo) 9^ 0] are also 
analytic at x = x 0 . In many respects the natural context for the use of power series 
is the complex plane. The methods and results of this chapter nearly always can be 
directly extended to differential equations in which the independent and dependent 
variables are complex-valued. 


Shift of Index of Summation. The index of summation in an infinite series is a dummy 
parameter just as the integration variable in a definite integral is a dummy variable. 
Thus it is immaterial which letter is used for the index of summation. For example. 


00 ,m )n v n 



n = 0 


oo 


E 


2 ixi 

~T 


Just as we make changes of the variable of integration in a definite integral, we find 
it convenient to make changes of summation indices in calculating series solutions of 
differential equations. We illustrate by several examples how to shift the summation 
index. 


OO 

Write a nX n as a series whose first term corresponds to n = 0 rather than n = 2. 

n=2 

Let m = n — 2; then n — m + 2, and n = 2 corresponds to m — 0. Hence 

OO OO 

^OnX" = J2 a m+2X m+2 . (1) 

n=2 m = 0 

By writing out the first few terms of each of these series, you can verify that they contain 
precisely the same terms. Finally, in the series on the right side of Eq. (1 ), we can replace the 
dummy index m by n , obtaining 

OO OO 

^2 a n x n = ^2 a„+ 2 X n+2 ■ (2) 

n=2 n = 0 

In effect, we have shifted the index upward by 2 and have compensated by starting to count 
at a level 2 lower than originally. 


Write the series 

OO 

’^2(n + 2)(n + l)a„(x - x 0 ) n ~ 2 (3) 

n= 2 

as a series whose generic term involves (x — x 0 )" rather than ( x — x 0 ) n ~ 2 . 

Again, we shift the index by 2 so that n is replaced by n + 2 and start counting 2 lower. We 
obtain 

OO 

^2(n + 4)(n + 3)a n+2 (x - x 0 ) n . (4) 

n = 0 

You can readily verify that the terms in the series (3) and (4) are exactly the same. 
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EXAMPLE 

5 


EXAMPLE 

6 


Write the expression 


x 2 y ( r + n)a n x r+n 1 

n = 0 

as a series whose generic term involves x r+n . 

First, take the x 2 inside the summation, obtaining 


y (/' + ri)a n x r+n+1 . 


n = 0 

Next, shift the index down by 1 and start counting 1 higher. Thus 


OO 


OO 


(5) 


( 6 ) 


y (r + n)a n x r+n+1 = y (r + n — l)a„^ 1 x r+n . (7) 

n= 0 n= 1 

Again, you can easily verify that the two series in Eq. (7) are identical and that both are exactly 
the same as the expression (5). 


Assume that 

OO OO 

y na„x n ~ 1 — y a„x n (8) 

H = 1 H=0 

for all x, and determine what this implies about the coefficients a n . 

We want to use statement 10 to equate corresponding coefficients in the two series. In order 
to do this, we must first rewrite Eq. (8) so that the series display the same power of x in their 
generic terms. For instance, in the series on the left side of Eq. (8), we can replace n by n + 1 
and start counting 1 lower. Thus Eq. (8) becomes 

OO OO 

y (n + l)a n+1 x n = y a„x n . (9) 

n= 0 n=0 

According to statement 10, we conclude that 

(n + l)a„+i = a n , n = 0, 1, 2, 3, . . . 


or 

Cln+l — — “Tj 71 = 0,1,2,3,.... (10) 

n + 1 

Hence, choosing successive values of n in Eq. (10), we have 

U\ CIq CI2 CIq 

Cl l = CIq, Cl 1 — — — — , Cl 2 = — = — , 

2 2 3 3! 

and so forth. In general, 

a„=~. n = 1,2,3, (11) 

n\ 

Thus the relation (8) determines all the following coefficients in terms of ao. Finally, using the 
coefficients given by Eq. (11), we obtain 


y a n x n 

n= 0 


X 

a 0 / , T 

— ' n 

n = 0 


a 0 e x . 


where we have followed the usual convention that 0! = 1. 
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PROBLEMS 


In each of Problems 1 through 8 determine the radius of convergence of the given power series. 


1. E(*-3)" 

n = 0 
oo 

3- E — 

„=o «! 

5 ~ (2x + 1 )" 

' rS 1 n 2 

- (-l)»n 2 (x + 2 r 

' h 3" 


2 T —x n 
' ^ 
n = 0 z 

oo 

4. E2'V 

n=0 


6. E 

n=l 


(■* - *o) n 


“ «!x" 

n=l 


In each of Problems 9 through 16 determine the Taylor series about the point xo for the given 
function. Also determine the radius of convergence of the series. 


sinx, xo = 0 

10. 

e\ 

o 

II 

o 

* 

X, Xq = 1 

12. 

x 2 . 

x 0 = -1 

lnx, xo = 1 

14. 

1 

o 

II 

o 

* 

1 + x’ 


16. 

1 

xo = 2 

1 — X 

1 — x’ 

oo 

Given that y = E wx", compute y' and y" 

n= 0 

and write out the first : 


15. 


as well as the coefficient of x" in the general term. 

OO 

18. Given that y = E a nX n , compute / and y" and write out the first four terms of each 

n = 0 

series as well as the coefficient of x n in the general term. Show that if y" = y, then the 
coefficients a 0 and a\ are arbitrary, and determine a 2 and o 3 in terms of a 0 and a\. Show 
that a n+ 2 = a„/(n + 2)(n + l), n = 0, 1, 2, 3, 

In each of Problems 19 and 20 verify the given equation. 

OO OO 

19. E a n(x - 1)" +1 = E a n -\{x - 1)" 


20. E a k+ ix k + E a k x k+1 = a t + E («*+i + a k -\)x k 

k = 0 k=0 k = 1 


In each of Problems 21 through 27 rewrite the given expression as a sum whose generic term 
involves x n . 


21. E n ( n ~ l)n«x" 

n=2 


22. J^a n x n+2 


23. x E na„x n 1 + E a kX k 

n = 1 jt=0 


24. (1 — x 2 ) E n ( n ~ 1 )a„x n 

n=2 


25. E m(m — 1 )a m x m 2 + x E ka k x k 

m=2 k=l 


26. E na n x n 1 + x E a nX n 

n= 1 n= 0 


27. x E n ( n — 1 )u«x" 2 + E 

n=2 n=0 
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28. Determine the a„ so that the equation 

oo oo 

T; na n x n ~ l + 2 ^ a n x n = 0 

n = 1 n = 0 


oo 

is satisfied. Try to identify the function represented by the series ^ a n x n . 

n = 0 


5.2 Series Solutions Near an Ordinary Point, Part I 

In Chapter 3 we described methods of solving second order linear differential equa- 
tions with constant coefficients. We now consider methods of solving second order 
linear equations when the coefficients are functions of the independent variable. In 
this chapter we will denote the independent variable by x. It is sufficient to consider 
the homogeneous equation 

d 2 y dy 

P ^dx? + +R ^ y = °’ 0 ) 

since the procedure for the corresponding nonhomogeneous equation is similar. 

Many problems in mathematical physics lead to equations of the form (1) having 
polynomial coefficients; examples include the Bessel equation 

x 2 y" + xy' + (x 2 - v 2 )y = 0, 

where v is a constant, and the Legendre equation 

(1 — x 2 )y" — 2 xy' + a( a + 1 )y — 0, 

where a is a constant. For this reason, as well as to simplify the algebraic com- 
putations, we primarily consider the case in which the functions P, Q, and R are 
polynomials. However, as we will see, the method of solution is also applicable when 
P, Q, and R are general analytic functions. 

For the present, then, suppose that P, Q , and R are polynomials and that they have 
no common factors. Suppose also that we wish to solve Eq. (1) in the neighborhood 
of a point x 0 . The solution of Eq. (1) in an interval containing x 0 is closely associated 
with the behavior of P in that interval. 

A point jt'o such that P(x o) ^ 0 is called an ordinary point. Since P is continuous, it 
follows that there is an interval about xq in which P(x) is never zero. In that interval 
we can divide Eq. (1) by P{x) to obtain 

y" + p(x)/ + q(x)y = 0, (2) 

where p(x) = Q(x)/P(x) and q(x) = R(x)/P(x) are continuous functions. Hence, 
according to the existence and uniqueness Theorem 3.2.1, there exists in that inter- 
val a unique solution of Eq. (1) that also satisfies the initial conditions y(xo) = >’o, 
y'(xo) = >’() for arbitrary values of yo and y' 0 . In this and the following section we 
discuss the solution of Eq. (1) in the neighborhood of an ordinary point. 

On the other hand, if P(x o) = 0, then xq is called a singular point of Eq. (1). In 
this case at least one of Q(x o) and R(x o) is not zero. Consequently, at least one of 
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the coefficients p and q in Eq. (2) becomes unbounded asi^- xq, and therefore 
Theorem 3.2.1 does not apply in this case. Sections 5.4 through 5.7 deal with finding 
solutions of Eq. (1) in the neighborhood of a singular point. 

We now take up the problem of solving Eq. (1) in the neighborhood of an ordinary 
point xq. We look for solutions of the form 


OO 

y = a 0 + a\(x - x 0 ) H V a„(x - x 0 ) n H = ^ a n (x - x 0 ) n (3) 

n = 0 

and assume that the series converges in the interval \x — xol < p for some p > 0. 
While at first sight it may appear unattractive to seek a solution in the form of a 
power series, this is actually a convenient and useful form for a solution. Within their 
intervals of convergence, power series behave very much like polynomials and are 
easy to manipulate both analytically and numerically. Indeed, even if we can obtain 
a solution in terms of elementary functions, such as exponential or trigonometric 
functions, we are likely to need a power series or some equivalent expression if we 
want to evaluate them numerically or to plot their graphs. 

The most practical way to determine the coefficients a n is to substitute the series 
(3) and its derivatives fory,y', andy" in Eq. (1). The following examples illustrate this 
process. The operations, such as differentiation, that are involved in the procedure 
are justified so long as we stay within the interval of convergence. The differential 
equations in these examples are also of considerable importance in their own right. 


EXAMPLE 

1 


Find a series solution of the equation 


y" + y = 0, — oo < x < oo. (4) 

As we know, a fundamental set of solutions of this equation are sinx and cosx, so series 
methods are not needed to solve this equation. However, this example illustrates the use of 
power series in a relatively simple case. For Eq. (4), P(x) = 1, Q(x) = 0, and R(x) = 1; hence 
every point is an ordinary point. 

We look for a solution in the form of a power series about x Q = 0 


OO 

y — ciq U\X -\- CI 2 X 2 + • • • + ci n x n + • • • = ^ ' a n x n (5) 

n = 0 

and assume that the series converges in some interval |x| < p. Differentiating Eq. (5) term by 
term yields 


oo 

y' = a\ + 2 ci 2 X + • • • + na n x n ~ l + • • • = na n x n ~ l , (6) 

n=l 

oo 

y" = la 2 H + n(n — l)a n x n ~ 2 + • • • = ^ n{n — 1 )a n x n ~ 2 . (7) 

n=2 

Substituting the series (5) and (7) for y and y" in Eq. (4) gives 

OO OO 

y2 n ( n — i )a„x n ~ 2 + y2 a n x " = o. 

n= 2 n = 0 
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To combine the two series, we need to rewrite at least one of them so that both series display 
the same generic term. Thus, in the first sum, we shift the index of summation by replacing n 
by n + 2 and starting the sum at 0 rather than 2. We obtain 

OO OO 

Y ( n + 2 )(n + l)a n + 2 X n + ^ a n x n = 0 

n = 0 n=0 


or 

OO 

Y [( n + 2 )(n + 2 + a n \ x n = o. 

n = 0 


For this equation to be satisfied for all x, the coefficient of each power of x must be zero; hence 
we conclude that 

(n + 2)(n + l)a „+2 + a„ = 0, « = 0, 1,2,3, (8) 

Equation (8) is referred to as a recurrence relation. The successive coefficients can be 
evaluated one by one by writing the recurrence relation first for n = 0, then for n = 1, and 
so forth. In this example Eq. (8) relates each coefficient to the second one before it. Thus 
the even-numbered coefficients ( a 0 ,a 2 ,a 4 , . . .) and the odd-numbered ones (a 1; 03 , as, ■ ■ .) are 
determined separately. For the even-numbered coefficients we have 


CIq CIq CI2 CIq 

a2= ~ 2^1 —— 21 ’ a * = ~ 4 ^ 3 =+ 4 \' 


Cl6 — 


#4 

(T”5 


«o 

6 !’ 


These results suggest that in general, if n = 2k, then 


Clf 1 — ^2 k — 


(~i) fc 
(2 k)\ 


■flo, 


k = 1,2,3,.... 


( 9 ) 


We can prove Eq. (9) by mathematical induction. First, observe that it is true for k = 1. Next, 
assume that it is true for an arbitrary value of k and consider the case k + 1. We have 


aik 

a2k+2 ~ ~ (2k + 2)(2k + 1) 


(-D* 

{2k + 2)(2k + l)(2k)\ a ° 


(2k + 2)1°°' 


Hence Eq. (9) is also true for k + 1, and consequently it is true for all positive integers k. 
Similarly, for the odd-numbered coefficients 

Cl\ d\ (I3 d\ d5 d\ 

“ 3 = ~ 2^3 = _ 3 [’ ° 5 = ^ 5^4 = + 5 !"’ “ 7 = “ 7T = “ V ." ' " ’ 

and in general, if n — 2k + 1, then 2 


a„ = aik+i = 


(-1)" 

(2k + I)! 01 ’ 


k — 1,2,3,.... 


Substituting these coefficients into Eq. (5), we have 


( 10 ) 


2 The result given in Eq. (10) and other similar formulas in this chapter can be proved by an induction 
argument resembling the one just given for Eq. (9). We assume that the results are plausible and omit the 
inductive argument hereafter. 
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CIO l fl l 3 . a 0 4 , a l 5 

V = cin T «ix x~ .V H x + — .r 

* u 1 ?! ?! 4! S! 




= «0 


(2 n)! 


1_ 2! + 4! + ‘ 


( 2 « + 1 )! 




+ d\ 


X ~ 3! + 5! + ' 


(2m)! 


+ ^1^ 2 „ +1 + . 


(2m + 1)! 


^ (-D" 2« , V' (“I)" 2n+l 

= a ^2l^ny x ~ + a 'Z2~- ^ 


(2m + 1)! 


( 11 ) 


Now that we have formally obtained two series solutions of Eq. (4), we can test them for 
convergence. Using the ratio test, we can show that each of the series in Eq. (11) converges 
for all x, and this justifies retroactively all the steps used in obtaining the solutions. Indeed, 
we recognize that the first series in Eq. (11) is exactly the Taylor series for cosx about x = 0 
and that the second is the Taylor series for sinx about x = 0. Thus, as expected, we obtain the 
solution y — a 0 cosx + ci\ sinx. 

Notice that no conditions are imposed on a 0 and a\, hence they are arbitrary. From Eqs. (5) 
and (6) we see that y and / evaluated at x = 0 are ao and a\, respectively. Since the initial 
conditions y (0) and y'( 0) can be chosen arbitrarily, it follows that a 0 and ai should be arbitrary 
until specific initial conditions are stated. 

Figures 5.2.1 and 5.2.2 show how the partial sums of the series in Eq. (11) approximate cosx 
and sinx. As the number of terms increases, the interval over which the approximation is 
satisfactory becomes longer, and for each x in this interval the accuracy of the approximation 
improves. However, you should always remember that a truncated power series provides only 
a local approximation of the solution in a neighborhood of the initial point x = 0; it cannot 
adequately represent the solution for large |x|. 



FIGURE 5.2.1 Polynomial approximations to cosx. The value of n is the degree of the 
approximating polynomial. 
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y 

2 

n = 5 n = 9 n = 

13 71 = 

17 re = 21 

l 

II 

1 

1 1 


2 4 6/ 

8 

10 x 

-l 



y = sin x 

-2 

n = 3 n = l n = ll 

n = 15 

n = 19 


FIGURE 5.2.2 Polynomial approximations to sinx. The value of n is the degree of the 
approximating polynomial. 


In Example 1 we knew from the start that sinx and cos x form a fundamental set of 
solutions of Eq. (4). However, if we had not known this and had simply solved Eq. (4) 
using series methods, we would still have obtained the solution (11). In recognition 
of the fact that the differential equation (4) often occurs in applications, we might 
decide to give the two solutions of Eq. (11) special names, perhaps 

°° ( 1 \n °° ( 1 \n 

C (x) = - — — -x 2 ", s(x) = — - — — — x 2n+i . ( 12 ) 

^ (2 n)\ ^ (2n + 1 )! y > 

n= 0 n= 0 

Then we might ask what properties these functions have. For instance, can we be 
sure that C(x) and ,S'(x) form a fundamental set of solutions? It follows at once from 
the series expansions that C(0) = 1 and 5(0) = 0. By differentiating the series for 
C(x) and 5(x) term by term, we find that 

S'(x) = C(x), C'(x) = —S(x). (13) 


Thus, at x = 0 we have S'(0) = 1 and C'(0) = 0. Consequently, the Wronskian of C 
and S at x = 0 is 


w(c,sm 


1 0 
0 1 


= 1 , 


(14) 


so these functions do indeed form a fundamental set of solutions. By substituting — x 
for x in each of Eqs. (12), we obtain C(— x) = C(x), and S(— x) = — 5(x). Moreover, 
by calculating with the infinite series, 3 we can show that the functions C(x) and S(x) 
have all the usual analytical and algebraic properties of the cosine and sine functions, 
respectively. 


3 Such an analysis is given in Section 24 of K. Knopp, Theory and Applications of Infinite Series (New York: 
Hafner, 1951). 
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Although you probably first saw the sine and cosine functions defined in a more 
elementary manner in terms of right triangles, it is interesting that these functions 
can be defined as solutions of a certain simple second order linear differential equa- 
tion. To be precise, the function sinx can be defined as the unique solution of the 
initial value problem y" + y = 0, y(0) = 0, y'{ 0) = 1; similarly, cosx can be defined 
as the unique solution of the initial value problem y" + y = 0, y(0) = 1, / (0) = 0. 
Many other functions that are important in mathematical physics are also defined as 
solutions of certain initial value problems. For most of these functions there is no 
simpler or more elementary way to approach them. 


EXAMPLE 

2 


Find a series solution in powers of x of Airy’s 4 equation 

y" — xy = 0, —oo < x < oo. (15) 

For this equation P(x) = l,Q(x) = 0,andf?(x) = — x; hence every point is an ordinary point. 
We assume that 

OO 

y = y a n x n (16) 

n= 0 

and that the series converges in some interval \x\ < p. The series for y" is given by Eq. (7); as 
explained in the preceding example, we can rewrite it as 

OO 

y" = y (ft + 2) (« + l)a n+2 x n . (17) 

n = 0 

Substituting the series (16) and (17) for y and y" in Eq. (15), we obtain 

oo oo oo 

y (ft + 2) (ft + l)a„ +2 x” = x y a n x" — y fl„x" +1 . (18) 

71=0 71=0 71=0 


Next, we shift the index of summation in the series on the right side of Eq. (18) by replacing 
n by ft — 1 and starting the summation at 1 rather than zero. Thus we have 

OO OO 

2 • lfl 2 + + 2)(« + l)a n+2 x" = a n _ix n . 

71 = 1 71=1 

Again, for this equation to be satisfied for all x in some interval, the coefficients of like powers 
of x must be equal; hence a 2 = 0, and we obtain the recurrence relation 


(ft + 2)(ft + l)a„ +2 = a„_\ for ft = 1,2,3,.... 


(19) 


Since a n+2 is given in terms of «„_i, the as are determined in steps of three. Thus a 0 
determines a 2 , which in turn determines . . . ;ai determines a 4 , which in turn determines 
« 7 , . . . ; and a 2 determines a s , which in turn determines a s ,.... Since a 2 = 0, we immediately 
conclude that a s — a 8 = a n = ■ ■ ■ = 0. 


4 Sir George Biddell Airy (1801-1892), an English astronomer and mathematician, was director of the 
Greenwich Observatory from 1835 to 1881. One reason why Airy’s equation is of interest is that for 
x negative the solutions are oscillatory, similar to trigonometric functions, and for x positive they are 
monotonic, similar to hyperbolic functions. Can you explain why it is reasonable to expect such behavior? 
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For the sequence 00,03,05,09, ... we set « = 1,4,7, 10, . . . in the recurrence relation: 

CIq CI3 CIq CIq CIq 

@3 ^ ^ 5 @6 ^ ^ ^ ^ ^ ^ 9 g ^ ^ ^ ^ ^ g ^ 9 * • • • 

These results suggest the general formula 

a ° „ ^ A 
a3n ~ 2 • 3 ■ 5 ■ 6 • • ■ (3n — l)(3n) ’ " ~ ' 

For the sequence 01,04,03,010, . . . , we set n = 2,5,8,11, ... in the recurrence relation: 

^4 CL-j Cl\ 

° A — 3^4’ " 7 = (Y7 = 3 ■ 4 • 6 • 7 ’ 010 = 9 • 10 = 3 • 4 • 6 • 7 • 9 • 10’ " " 

In general, we have 

Ol 

ain+1 ~ 3 • 4 • 6 • 7 • • ■ (3n)(3« + 1) ’ " ~ ' 


Thus the general solution of Airy’s equation is 


y = a 0 


v 3n 


1 + 


2-3 2 • 3 • 5 • 6 


■ + 


+ a\ 


2 • 3 • • • (3n — l)(3n) 

3n+l 


4 7 

X X X 

X + “h ■ ■ ■ “h 

3-4 3 • 4 • 6 • 7 3-4---(3w)(3h + 1) 


( 20 ) 


Having obtained these two series solutions, we can now investigate their convergence. Be- 
cause of the rapid growth of the denominators of the terms in the series (20), we might expect 
these series to have a large radius of convergence. Indeed, it is easy to use the ratio test to 
show that both these series converge for all x ; see Problem 20. 

Assuming for the moment that the series do converge for all x, let yi and y 2 denote the 
functions defined by the expressions in the first and second sets of brackets, respectively, in 
Eq. (20). Then, by choosing first ao = 1, fli = 0 and then « 0 = 0, a\ = 1 , it follows that y\ 
and y 2 are individually solutions of Eq. (15). Notice that y\ satisfies the initial conditions 
jVi (0) = 1, y'jIO) = 0 and that y 2 satisfies the initial conditions >>2(0) = 0, y' 2 (0) = 1. Thus 
W(yi,y2)(0) = 1/0, and consequently y 1 and y 2 are a fundamental set of solutions. Hence 
the general solution of Airy’s equation is 

y — ooyi(x) + aiy 2 (x), —00 < x < 00 . 

In Figures 5.2.3 and 5.2.4, respectively, we show the graphs of the solutions yi and y 2 of 
Airy’s equation, as well as graphs of several partial sums of the two series in Eq. (20). Again, 
the partial sums provide local approximations to the solutions in a neighborhood of the origin. 
Although the quality of the approximation improves as the number of terms increases, no 
polynomial can adequately represent y 1 and y 2 for large |jc|. A practical way to estimate the 
interval in which a given partial sum is reasonably accurate is to compare the graphs of that 
partial sum and the next one, obtained by including one more term. As soon as the graphs 
begin to separate noticeably, we can be confident that the original partial sum is no longer 
accurate. For example, in Figure 5.2.3 the graphs for n = 24 and n = 27 begin to separate 
at about x = —9/2. Thus, beyond this point, the partial sum of degree 24 is worthless as an 
approximation to the solution. 

Observe that both y\ and y 2 are monotone for x > 0 and oscillatory for x < 0. You can 
also see from the figures that the oscillations are not uniform but, rather, decay in amplitude 
and increase in frequency as the distance from the origin increases. In contrast to Example 1, 
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EXAMPLE 

3 
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FIGURE 5.2.3 Polynomial approximations to the solution yi(x) of Airy’s equation. The 
value of n is the degree of the approximating polynomial. 



FIGURE 5.2.4 Polynomial approximations to the solution yoix) of Airy’s equation. The 
value of n is the degree of the approximating polynomial. 


the solutions y\ and y 2 of Airy’s equation are not elementary functions that you have already 
encountered in calculus. However, because of their importance in some physical applications, 
these functions have been extensively studied, and their properties are well known to applied 
mathematicians and scientists. 


Find a solution of Airy’s equation in powers of x — 1. 

The point x = 1 is an ordinary point of Eq. (15), and thus we look for a solution of the form 

OO 

y = Y2 a n(x - 1)", 

n = 0 
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where we assume that the series converges in some interval \x — 1| < p. Then 

OO OO 

/ = y, na„ (x - l)"” 1 = + l)a n+i (x - 1)", 

n= 1 n= 0 

and 

OO OO 

y" = y 'n(n - 1 )a n (x - 1)"~ 2 = ^(m + 2)(n + l)a„ +2 (x - 1)"- 

n=2 n= 0 

Substituting for y and y" in Eq. (12), we obtain 

OO OO 

+ 2)(« + l)a„ +2 (x - 1)” = x^Onix - 1)". (21) 

n=0 n= 0 

Now to equate the coefficients of like powers of (x — 1), we must express x, the coefficient of 
y in Eq. (15), in powers of x — 1; that is, we write x = 1 + (x — 1). Note that this is precisely 
the Taylor series for x about x = 1. Then Eq. (21) takes the form 


OO OO 

+ 2 )(n + l)fl„ +2 (x - 1)" = [1 + (x - 1)] ^ a„(x - l) n 

n= 0 n=0 

OO OO 

= y^a„(x - 1)" + ^2a„(x - 1)" +1 . 

n= 0 n=0 

Shifting the index of summation in the second series on the right gives 

OO OO OO 

+ 2)(n + l)a„ +2 (x - 1)" = - 1)" + ^a„^i(x - 1)". 

n=0 n= 0 n= 1 

Equating coefficients of like powers of x — 1, we obtain 


2a 2 = r/f> , 

(3 • 2)a 3 = T 
(4 • 3)a 4 = a 2 + fli, 
(5 • 4)a 5 = + a 2 . 


The general recurrence relation is 

(n + 2)(n + l)a„ +2 = a„ + for n > 1. (22) 


Solving for the first few coefficients in terms of a 0 and a 2 , we find that 


#0 (7;[ dQ 

a 2 = -, a 3 = j + j, 
Hence 


^2 #1 (7o #1 

(7a == -|- = -|- , 

12 12 24 12 


(73 (?2 (7o (7i 

as= 20 + 20 = 30 + 120' 


y = a 0 


(x - l) 2 (x - l) 3 {x - l) 4 (x - l) 5 

1+ — + - — + ' + — - + ■ 


24 


30 


+ «i 


' (*-l) 3 (x-1) 4 (x-1) 5 

(x - 1) H 1 73; 1 7377; h ■ 


12 


120 


(23) 


In general, when the recurrence relation has more than two terms, as in Eq. (22), the 
determination of a formula for a„ in terms a Q and cii will be fairly complicated, if not impossible. 
In this example such a formula is not readily apparent. Lacking such a formula, we cannot test 
the two series in Eq. (23) for convergence by direct methods such as the ratio test. However, 
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we shall see in Section 5.3 that even without knowing the formula for a n , it is possible to 
establish that the two series in Eq. (23) converge for all x. Further, they define functions y 3 
and y 4 that are a fundamental set of solutions of the Airy equation (15). Thus 

y = a 0 yi(x) + a 1 y 4 (x) 

is the general solution of Airy’s equation for —00 < x < 00 . 


It is worth emphasizing, as we saw in Example 3, that if we look for a solution 

OO 

of Eq. (1) of the form y = a n (x — x {) ) n , then the coefficients P(x), Q(x), and R(x) 

n=0 

in Eq. (1) must also be expressed in powers of x — xq. Alternatively, we can make 
the change of variable x — x 0 = t, obtaining a new differential equation for y as a 

OO 

function of t, and then look for solutions of this new equation of the form a n t" . 

n = 0 

When we have finished the calculations, we replace t by x — x 0 (see Problem 19). 

In Examples 2 and 3 we have found two sets of solutions of Airy’s equation. The 
functions y 1 and yj defined by the series in Eq. (20) are a fundamental set of solutions 
of Eq. (15) for all x , and this is also true for the functions y 4 and y 4 defined by the series 
in Eq. (23). According to the general theory of second order linear equations, each 
of the first two functions can be expressed as a linear combination of the latter two 
functions and vice versa — a result that is certainly not obvious from an examination 
of the series alone. 

Finally, we emphasize that it is not particularly important if, as in Example 3, we 
are unable to determine the general coefficient a n in terms of ao and a\. What is 
essential is that we can determine as many coefficients as we want. Thus we can find 
as many terms in the two series solutions as we want, even if we cannot determine 
the general term. While the task of calculating several coefficients in a power series 
solution is not difficult, it can be tedious. A symbolic manipulation package can be 
very helpful here; some are able to find a specified number of terms in a power series 
solution in response to a single command. With a suitable graphics package we can 
also produce plots such as those shown in the figures in this section. 


PROBLEMS In each of Problems 1 through 14: 

(a) Seek power series solutions of the given differential equation about the given point xo; 
find the recurrence relation. 

(b) Find the first four terms in each of two solutions y 4 and y 2 (unless the series terminates 
sooner). 

(c) By evaluating the Wronskian W{yi,y 2 )(x Q ), show that y 4 and y 2 form a fundamental set 
of solutions. 

(d) If possible, find the general term in each solution. 


1. y" — y = 0 , Xo = 0 

3. y" — xy' — y = 0, x 0 = 1 
5. (1 - x)y" + y = 0, x 0 = 0 

7. y" + xy' + 2y = 0, x 0 = 0 

9. (1 + x 2 )y" - 4 xy' + 6y = 0, x 0 = 0 


2 . y" - xy’ - y = 0, x 0 = 0 
4. y" + k 1 2 x 2 y = 0, xo = 0, k a constant 
6 . (2 + x 2 )y" - xy' + 4y = 0, x 0 = 0 
8 . xy" + y + xy = 0 , x 0 = 1 
10. (4 - x 2 )y" + 2y = 0, x 0 = 0 
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1 1 . (3 - x 2 )y" - 3xy' -y = 0, x 0 = 0 12. (1 - x)y" + xy' - y = 0, x 0 = 0 

13. 2 y" + xy' + 3y = 0, x 0 = 0 14. 2 y" + (x + 1 )/ + 3y = 0, x 0 = 2 

In each of Problems 15 through 18: 

(a) Find the first five nonzero terms in the solution of the given initial value problem. 

(b) Plot the four-term and the five-term approximations to the solution on the same axes. 

(c) From the plot in part (b) estimate the interval in which the four-term approximation is 
reasonably accurate. 

jfl, 15. y" — xy' — y = 0, y(0) = 2, y'(0) = 1; see Problem 2 

(~jl- 16. (2 + x 2 )y" — xy' + 4y = 0, y(0) = -1, /(0) = 3; see Problem 6 

17. y" + xy' + 2y = 0, y(0) = 4, y'(0) = — 1; see Problem 7 

18. (1 — x)y" + xy' — y = 0, y(0) = —3, /(0) = 2; see Problem 12 

19. (a) By making the change of variable x — 1 = t and assuming that y has a Taylor series in 
powers of f, find two series solutions of 

y" + (x- 1 ) 2 / + (x 2 - l)y = 0 

in powers of x — 1. 

(b) Show that you obtain the same result by assuming that y has a Taylor series in powers 
of x — 1 and also expressing the coefficient x 2 — 1 in powers of.r — 1. 

20. Show directly, using the ratio test, that the two series solutions of Airy’s equation about 
x = 0 converge for all x; see Eq. (20) of the text. 

21. The Hermite Equation. The equation 

y" — 2 xy' + Xy = 0, — oo < x < oo, 

where X is a constant, is known as the Hermite 5 equation. It is an important equation in 
mathematical physics. 

(a) Find the first four terms in each of two solutions about x = 0 and show that they form 
a fundamental set of solutions. 

(b) Observe that if X is a nonnegative even integer, then one or the other of the series 
solutions terminates and becomes a polynomial. Find the polynomial solutions for X = 0, 
2, 4, 6, 8, and 10. Note that each polynomial is determined only up to a multiplicative 
constant. 

(c) The Hermite polynomial H n (x) is defined as the polynomial solution of the Hermite 
equation with X — 2 n for which the coefficient of x n is 2". Find H 0 (x ), . . . , H$(x). 

22. Consider the initial value problem y' = 1 — y 2 ,y( 0) = 0. 

(a) Show that y = sinx is the solution of this initial value problem. 

(b) Look for a solution of the initial value problem in the form of a power series about 
x = 0. Find the coefficients up to the term in x 3 in this series. 

In each of Problems 23 through 28 plot several partial sums in a series solution of the given 
initial value problem about x = 0, thereby obtaining graphs analogous to those in Figures 5.2.1 
through 5.2.4. 


5 Charles Hermite (1822-1901) was an influential French analyst and algebraist. He introduced the Her- 
mite functions in 1864 and showed in 1873 that e is a transcendental number (that is, e is not a root of any 
polynomial equation with rational coefficients). His name is also associated with Hermitian matrices (see 
Section 7.3), some of whose properties he discovered. 
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23. y" — xy' — y = 0, y(0) = 1, y'(0) = 0; see Problem 2 

24. (2 + x 2 )y" — xy' + 4y = 0, y(0) = 1, y'(0) = 0; see Problem 6 

25. y" + xy' + 2y = 0, y(0) = 0, y’ (0) = 1; see Problem 7 

26. (4 — x 2 )y" + 2y = 0, y( 0) = 0, y'( 0) = 1; see Problem 10 

27. y" + x 2 y = 0, y(0) = 1, y' (0) = 0; see Problem 4 

28. (1 — x)y" 4- xy' — 2y = 0, y(0) = 0, y'(0) = 1 
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In the preceding section we considered the problem of finding solutions of 

P(x)y" + Q(x)y' + R(x)y = 0, (1) 

where P , Q , and R are polynomials, in the neighborhood of an ordinary point xq. 
Assuming that Eq. (1) does have a solution y = <p(x) and that </> has a Taylor series 

OO 

y = </>(*) = a n {x - x 0 ) n , (2) 

n = 0 

which converges for |jc — jco| < p, where p > 0, we found that the a n can be deter- 
mined by directly substituting the series (2) for y in Eq. (1). 

Let us now consider how we might justify the statement that if xq is an ordinary 
point of Eq. (1), then there exist solutions of the form (2). We also consider the 
question of the radius of convergence of such a series. In doing this, we are led to a 
generalization of the definition of an ordinary point. 

Suppose, then, that there is a solution of Eq. (1) of the form (2). By differentiating 
Eq. (2) m times and setting x equal to xq, we obtain 

m\a m = 4> {m) (x 0 ). 

Hence, to compute a n in the series (2), we must show that we can determine cp {n \x o) 
for n = 0, 1,2, . . . from the differential equation (1). 

Suppose that y = cp(x) is a solution of Eq. (1) satisfying the initial conditions 
y(xg) = yo, y'(xo) = y o- Then a o = yo and a\ = y' Q . If we are solely interested in 
finding a solution of Eq. (1) without specifying any initial conditions, then a 0 and a\ 
remain arbitrary. To determine (p (n \x o) and the corresponding a n for n = 2, 3, ... , 
we turn to Eq. (1). Since </> is a solution of Eq. (1), we have 

P(x)<p"(x) + Q(x)<f>'(x) + R(x)(p(x) — 0. 

For the interval about x 0 for which P is nonvanishing, we can write this equation in 
the form 

<p"(x) = -p(x)(t)'(x) - q(x)4>(x), (3) 

where p(x) = Q(x)/ P(x) and q(x) = R(x)/ P(x). Setting x equal to xq in Eq. (3) gives 
0"(x o ) = -p(xo)<p'(x 0 ) - q(x 0 )<p(x o). 
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Theorem 5.3.1 


Hence a 2 is given by 


2\a 2 = <p"{x 0 ) = -p(x 0 )ai - q(x 0 )a 0 . 


(4) 


To determine a 2 , we differentiate Eq. (3) and then set x equal to xq, obtaining 


3!n 3 = (p"'(x 0 ) = 


- [p(P " + (p + <?)</>' + q'<t>] 


X=XQ 


= - 2 \p(x 0 )a 2 - 1 p\xq ) + q(x 0 )]ai - q\x 0 )a 0 . 


(5) 


Substituting for a 2 from Eq. (4) gives a 2 in terms of a\ and no- Since P, Q, and R are 
polynomials and P(x 0 ) ^ 0, all the derivatives of p and q exist at xq. Hence, we can 
continue to differentiate Eq. (3) indefinitely, determining after each differentiation 
the successive coefficients a 4 , a 5 , . . . by setting x equal to x 0 . 

Notice that the important property that we used in determining the a n was that we 
could compute infinitely many derivatives of the functions p and q. It might seem 
reasonable to relax our assumption that the functions p and q are ratios of polyno- 
mials and simply require that they be infinitely differentiable in the neighborhood 
of x 0 . Unfortunately, this condition is too weak to ensure that we can prove the con- 
vergence of the resulting series expansion for y = <p(x). What is needed is to assume 
that the functions p and q are analytic at xo; that is, they have Taylor series expansions 
that converge to them in some interval about the point xq: 

OO 

p(x) — Po + Pi{x — X 0 ) -) b Pn(x ~ X 0 )" H = ^ P»(X ~ X 0 ) n , (6) 

n = 0 


OO 

q( X) = q 0 + qi(x — x 0 ) H b q n (x - x 0 ) n H = ^ q„(x - x 0 ) n . (7) 

n = 0 

With this idea in mind, we can generalize the definitions of an ordinary point and 
a singular point of Eq. (1) as follows: if the functions p = Q/P and q = R/P are 
analytic at x 0 , then the point x 0 is said to be an ordinary point of the differential 
equation (1); otherwise, it is a singular point. 

Now let us turn to the question of the interval of convergence of the series solution. 
One possibility is actually to compute the series solution for each problem and then 
to apply one of the tests for convergence of an infinite series to determine its radius 
of convergence. Unfortunately, these tests require us to obtain an expression for the 
general coefficient a n as a function of n, and this task is often quite difficult, if not 
impossible; recall Example 3 in Section 5.2. However, the question can be answered 
at once for a wide class of problems by the following theorem. 


If xq is an ordinary point of the differential equation (1) 

P(x)y" + Q(x)y' + R(x)y = 0, 

that is, if p = Q/P and q = R/P are analytic at xq, then the general solution of 
Eq. (1) is 

OO 

y — ^2 a,,(x ~ x °> n = fl oyiU) + aiy 2 (x), 

n = 0 


( 8 ) 
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EXAMPLE 

1 


where «o and a\ are arbitrary, and y \ and y 2 are two power series solutions that are 
analytic at xq. The solutions y i and y 2 form a fundamental set of solutions. Further, 
the radius of convergence for each of the series solutions y\ and y 2 is at least as 
large as the minimum of the radii of convergence of the series for p and q. 


To see that y\ and y 2 are a fundamental set of solutions, note that they have the form 
yi(x) — 1 + b 2 (x — x 0 ) 2 + • • ■ and y 2 (x) = (x — x 0 ) + c 2 (x — x 0 ) 2 + ■ ■ -, where b 2 + 
c 2 = a 2 . Hence y\ satisfies the initial conditions yi (xq) = l,y[(xo) — 0. and y 2 satisfies 
the initial conditions y 2 (*o) = 0,y' 2 (xo) = 1. Thus W (yi,y 2 )(xa) — 1. 

Also note that although calculating the coefficients by successively differentiating 
the differential equation is excellent in theory, it is usually not a practical computa- 
tional procedure. Rather, one should substitute the series (2) for y in the differential 
equation (1) and determine the coefficients so that the differential equation is satis- 
fied, as in the examples in the preceding section. 

We will not prove this theorem, which in a slightly more general form was estab- 
lished by Fuchs. 6 What is important for our purposes is that there is a series solution 
of the form (2) and that the radius of convergence of the series solution cannot be 
less than the smaller of the radii of convergence of the series for p and q: hence we 
need only determine these. 

This can be done in either of two ways. Again, one possibility is simply to compute 
the power series for p and q and then to determine the radii of convergence by 
using one of the convergence tests for infinite series. However, there is an easier 
way when P , Q. and R are polynomials. It is shown in the theory of functions of a 
complex variable that the ratio of two polynomials, say, Q/P, has a convergent power 
series expansion about a point x = xq if P(x o) ^ 0. Further, if we assume that any 
factors common to Q and P have been canceled, then the radius of convergence of 
the power series for Q/P about the point xq is precisely the distance from xq to the 
nearest zero of P. In determining this distance, we must remember that P(x) = 0 
may have complex roots, and these must also be considered. 


What is the radius of convergence of the Taylor series for (1 + x 2 ) 1 about x — 0? 

One way to proceed is to find the Taylor series in question, namely, 

= l-x 2 +x 4 -x 6 + --- + (-l) n x 2n + ---. 

1 +x 2 

Then it can be verified by the ratio test that p = 1. Another approach is to note that the zeros 
of 1 + x 2 are x — ±i. Since the distance in the complex plane from 0 to i or to —i is 1, the radius 
of convergence of the power series about x = 0 is 1. 


6 Immanuel Lazarus Fuchs (1833-1902) was a student and later a professor at the University of Berlin. 
He proved the result of Theorem 5.3.1 in 1866. His most important research was on singular points of 
linear differential equations. He recognized the significance of regular singular points (Section 5.4), and 
equations whose only singularities, including the point at infinity, are regular singular points are known as 
Fuchsian equations. 
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What is the radius of convergence of the Taylor series for (x 2 — 2x + 2) 1 about x = 0? about 
x = 1? 

First notice that 

x 2 — 2x + 2 = 0 

has solutions x = 1 ± i. The distance in the complex plane from x = 0 to either x = 1 + / or 

OO 

x = 1 — i is a/ 2; hence the radius of convergence of the Taylor series expansion Y a n x " about 

n = o 

x = 0 is ■s/2. 

The distance in the complex plane from x = 1 to either x = 1 + i or x = 1 — i is 1; hence the 

OO 

radius of convergence of the Taylor series expansion Y b n {x — 1)" about x = 1 is 1. 

n = o 

According to Theorem 5.3.1, the series solutions of the Airy equation in Examples 
2 and 3 of the preceding section converge for all values of x and x — 1, respectively, 
since in each problem P(x) = 1 and hence is never zero. 

A series solution may converge for a wider range of x than indicated by Theorem 
5.3.1, so the theorem actually gives only a lower bound on the radius of convergence 
of the series solution. This is illustrated by the Legendre polynomial solution of the 
Legendre equation given in the next example. 

Determine a lower bound for the radius of convergence of series solutions about x = 0 for the 
Legendre equation 

(1 — x 2 )y" — 2 xy' + a(a + 1 )y = 0, 

where a is a constant. 

Note that P(x) = 1 — x 2 , Q(x ) — — 2x, and R(x) — a (a + 1) are polynomials, and that the 
zeros of P, namely, x = ±1, are a distance 1 from x = 0. Hence a series solution of the form 

OO 

Y a n x n converges at least for |x| < 1, and possibly for larger values of x. Indeed, it can be 

n= 0 

shown that if a is a positive integer, one of the series solutions terminates after a finite number 
of terms and hence converges not just for \x\ < 1 but for all x. For example, if a = 1, the 
polynomial solution is y = x. See Problems 22 through 29 at the end of this section for a more 
comprehensive discussion of the Legendre equation. 


Determine a lower bound for the radius of convergence of series solutions of the differential 
equation 

(1 + x 2 )y" + 2xy' + 4x 2 y = 0 (9) 

about the point x = 0; about the point x = — \ . 

Again P, Q. and R are polynomials, and P has zeros at x = ±i. The distance in the complex 
plane from 0 to ±i is 1, and from —1 to ±i is ^1 + 1 = V5/2. Hence in the first case the 

series Y a n x " converges at least for |x| < 1, and in the second case the series Y b n (x + 1) 

n = 0 h=0 

converges at least for |x + 1| < a/5/2. 

An interesting observation that we can make about Eq. (9) follows from Theorems 3.2.1 
and 5.3.1. Suppose that initial conditions y(0) = y 0 and /((]) = y' 0 are given. Since 1 + x 2 / 0 
for all x, we know from Theorem 3.2.1 that there exists a unique solution of the initial value 
problem on —oo < x < oo. On the other hand, Theorem 5. 3.1 only guarantees a series solution 

OO 

of the form Y (with a 0 = yo^i = /) f° r — 1 < x < 1. The unique solution on the interval 

n=0 

—oo < x < oo may not have a power series about x = 0 that converges for all x. 
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Can we determine a series solution about x = 0 for the differential equation 

y" + (sinx)/ + (1 + x 2 )y = 0, 
and if so, what is the radius of convergence? 

For this differential equation,/?(x) = sinx and q{x) = 1 + x 2 . Recall from calculus that sinx 
has a Taylor series expansion about x = 0 that converges for all x. Further, q also has a Taylor 
series expansion about x = 0, namely, q(x ) = 1 + x 2 , that converges for all x. Thus there is a 

OO 

series solution of the form y = «„x" with a o and a\ arbitrary, and the series converges for 

n = 0 

all x. 


PROBLEMS In each of Problems 1 through 4 determine <j>"{x o), <p"'(x o), and 0 (4) (x o) for the given point xo 
if y = <f>(x) is a solution of the given initial value problem. 

1 . y" + xy' + y = 0; y(0) = 1, /(()) = 0 

2. y" + (sinx)/ + (cosx)y = 0; y(0) = 0, /( 0) = 1 

3. x 2 y" + (1 + x)y' + 3(lnx)y = 0; y( 1) = 2, /( 1) = 0 

4. /' + x 2 / + (sinx)v = 0; y(0) = a 0 , /( 0) = m 

In each of Problems 5 through 8 determine a lower bound for the radius of convergence of 
series solutions about each given point x 0 for the given differential equation. 

5. y" + 4/ + 6xy = 0; x 0 = 0, x 0 = 4 

6. (x 2 — 2x — 3)y" + xy' + 4y = 0; xo = 4, xo = —4, xo = 0 

7. (1 + x 3 )y" + 4x/ + y = 0; xo = 0, xo = 2 

8. xy" + y = 0; x 0 = 1 

9. Determine a lower bound for the radius of convergence of series solutions about the given 
xo for each of the differential equations in Problems 1 through 14 of Section 5.2. 

10. The Chebyshev Equation. The Chebyshev 7 differential equation is 

(1 - x 2 )y" - xy' + a 2 y = 0, 


where a is a constant. 

(a) Determine two solutions in powers of x for |x| < 1 and show that they form a funda- 
mental set of solutions. 

(b) Show that if cc is a nonnegative integer n , then there is a polynomial solution of 
degree n. These polynomials, when properly normalized, are called the Chebyshev 
polynomials. They are very useful in problems that require a polynomial approximation 
to a function defined on —1 < x < 1. 

(c) Find a polynomial solution for each of the cases a = n = 0, 1, 2, 3. 


7 Pafnuty L. Chebyshev (1821-1894), professor at Petersburg University for 35 years and the most influen- 
tial nineteenth-century Russian mathematician, founded the so-called Petersburg school, which produced 
a long line of distinguished mathematicians. His study of Chebyshev polynomials began about 1854 as 
part of an investigation of the approximation of functions by polynomials. Chebyshev is also known for 
his work in number theory and probability. 
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For each of the differential equations in Problems 11 through 14 find the first four nonzero 
terms in each of two power series solutions about the origin. Show that they form a funda- 
mental set of solutions. What do you expect the radius of convergence to be for each solution? 

11. y" + (sinx)v = 0 12. e x y" + xy = 0 

13. (cosjt)y" + xy' — 2y = 0 14. e~ x y" + ln(l + x)y' — xy = 0 

15. Letx and* 2 be solutions of a differential equation P(x)y" + Q(x)y' + R(x)y = 0. Can you 
say whether the point x = 0 is an ordinary point or a singular point? Prove your answer. 

First Order Equations. The series methods discussed in this section are directly applicable 
to the first order linear differential equation P{x)y' + Q(x)y = 0 at a point x 0 , if the function 
p = Q/P has a Taylor series expansion about that point. Such a point is called an ordinary 

OO 

point, and further, the radius of convergence of the series y — £ a„(x — xo) n is at least as 

n = 0 

large as the radius of convergence of the series for Q/P. In each of Problems 16 through 21 
solve the given differential equation by a series in powers of x and verify that a o is arbitrary 
in each case. Problems 20 and 21 involve nonhomogeneous differential equations to which 
series methods can be easily extended. Where possible, compare the series solution with the 
solution obtained by using the methods of Chapter 2. 

16. y - y = 0 17. y -xy = 0 

2 

18. y — e x y, three terms only 19. (1 — x)y' = y 

20 .y'-y = x 2 21.y'+xy = l+x 

The Legendre Equation. Problems 22 through 29 deal with the Legendre 8 equation 

(1 — x 2 )y" — 2 xy’ + a (a + 1 )y = 0. 


As indicated in Example 3, the point* = 0 is an ordinary point of this equation, and the distance 
from the origin to the nearest zero of P(x) = 1 — x 2 is 1. Hence the radius of convergence of 
series solutions about x = 0 is at least 1. Also notice that we need to consider only a > — 1 
because if a < —1, then the substitution a = — (1 + y), where y > 0, leads to the Legendre 
equation (1 — x 2 )y" — 2 xy' + y(y + 1 )y = 0. 

22. Show that two solutions of the Legendre equation for |*| < 1 are 


yi(x) = 1 - 


+ 1) 2 a(a - 2)(a + l)(a + 3) , 

xr -\ x 

2! 4! 


, V\ ir “ ' ' ' (g ~ 2m + 2)(t * + 1] • ' • (a + 2m ~ 2m 

(2m)! 


(ot — l)(ff + 2) 3 (a - l)(a - 3) (a + 2) (or + 4) 5 
y 2 (x) = x — x — x~ 


3! 


5! 


+ D- 1 )’ 


, (a - 1) ■ ■ ■ (a - 2m + l)(a + 2) • ■ ■ (a + 2m) 2m+1 
(2m + 1)! 


s Adrien-Marie Legendre (1752-1833) held various positions in the French Academie des Sciences from 
1783 onward. His primary work was in the fields of elliptic functions and number theory. The Legendre 
functions, solutions of Legendre’s equation, first appeared in 1784 in his study of the attraction of spheroids. 
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23. Show that, if a is zero or a positive even integer 2 n, the series solution y\ reduces to a 
polynomial of degree 2 n containing only even powers of x. Find the polynomials corre- 
sponding to a = 0, 2, and 4. Show that, if a is a positive odd integer 2n + 1, the series 
solution y 2 reduces to a polynomial of degree 2 n + 1 containing only odd powers of x. 
Find the polynomials corresponding to a = 1,3, and 5. 

24. The Legendre polynomial P n (.r) is defined as the polynomial solution of the Legendre 
equation with a = n that also satisfies the condition P„( 1) = 1. 

(a) Using the results of Problem 23, find the Legendre polynomials Pq{x), . . . , Psix). 

(b) Plot the graphs of P 0 (x), . . . ,P 5 (x) for — 1 < x < 1. 

(c) Find the zeros of Pq(x), . . . , Psix). 

25. It can be shown that the general formula for P„ix) is 


L"/2J 


P n ix) — 9 „ X 


(— l)*(2r» — 2k)\ 
k\in — k)\in — 2 k)\ X 


where \n/2\ denotes the greatest integer less than or equal to n/2. By observing the form 
of P„ix) for n even and n odd, show that P„i~ 1) = (— 1)". 

26. The Legendre polynomials play an important role in mathematical physics. For example, in 
solving Laplace’s equation (the potential equation) in spherical coordinates, we encounter 
the equation 

d 2 Fiw) dFi<p) 

— — hcot <p — — - + nin + l)Fi<p) = 0, 0 < cp < n, 

d(p z dip 


where n is a positive integer. Show that the change of variable x = cos ip leads to the 
Legendre equation with a = n for y = fix) — F(arccosx). 

27. Show that for n = 0,1, 2, 3, the corresponding Legendre polynomial is given by 


1 d n 

P„ix) = — ix 2 — l) n . 

2 n n\ dx " 


This formula, known as Rodrigues’ formula, 9 is true for all positive integers n. 

28. Show that the Legendre equation can also be written as 

[(1 - x 2 )y']' = -a(a + 1 )y. 


Then it follows that 


[(1 - x 2 )P' n ix)f = -nin + 1 )P n ix) and [(1 - x 2 )P' m (x)f - -mini + 1 )P m (x). 


By multiplying the first equation by P m ix) and the second equation by P n ix), integrating 
by parts, and then subtracting one equation from the other, show that 


/; 


P n ix)P m ix) dx = 0 


if n m. 


This property of the Legendre polynomials is known as the orthogonality property. If 
m = n, it can be shown that the value of the preceding integral is 2/(2n + 1). 


9 Ofinde Rodrigues (1794-1851) published this result as part of his doctoral thesis from the Ecole Normale 
in Paris in 1816. He later became a banker and social reformer. 
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29. Given a polynomial / of degree n , it is possible to express / as a linear combination of 
P 0 .P 1 .P 2 , ■ ■ ■ ,Pn '■ 

n 

fix) = a k p k (x ). 

k=0 

Using the result of Problem 28, show that 

2k + 1 f 1 

a k = — - — J f(x)P k (x)d. x. 


5.4 Euler Equations; Regular Singular Points 

In this section we will begin to consider how to solve equations of the form 

P(x)y" + Q(x)y' + R(x)y = 0 (1) 

in the neighborhood of a singular point xq. Recall that if the functions P, Q, and R 
are polynomials having no common factors, then the singular points of Eq. (1) are 
the points for which P(x) = 0. 

Euler Equations. A relatively simple differential equation that has a singular point is 

the Euler equation 10 

L[y] = x 2 y" + axy' + fiy = 0, (2) 

where a and p are real constants. In this case Fix) — x 2 , so x = 0 is the only singular 
point for Eq. (2); all other points are ordinary points. For convenience we first 
consider the interval x > 0; later we extend our results to the interval* < 0. 

Observe that (*'')' = rx '~ 1 and (*')" = r(r — l)x r ~ 2 . Hence, if we assume that 
Eq. (2) has a solution of the form 

y = *, ( 3 ) 

then we obtain 

L[x r ] = x 2 (x r )" + ax(x r )' + Px r 

= x r [r(r — 1) + ar + p], (4) 

If r is a root of the quadratic equation 

F(r) = r(r - 1) + ar + p = 0, (5) 

then L[x r ] is zero, and y = x r is a solution of Eq. (2). The roots of Eq. (5) are 

-(a - 1) ± V(« - l) 2 - 4/ 

n,r 2 = 2 ’ ( 6 ) 

and F(r) = (r — r\)(r — r 2 ). As for second order linear equations with constant coef- 
ficients, we consider separately the cases in which the roots are real and different, real 


10 This equation is sometimes called the Cauchy-Euler equation or the equidimensional equation. It was 
studied by Euler about 1740, but its solution was known to Johann Bernoulli before 1700. 
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EXAMPLE 

1 


but equal, and complex conjugates. Indeed, the entire discussion of Euler equations 
is similar to the treatment of second order linear equations with constant coefficients 
in Chapter 3, with e rx replaced by x r . 

Real, Distinct Roots. If F(r) — 0 has real roots r\ and r 2 , with r\ / r 2 , then yi(x) = x n 
andy 2 (^) = x ri are solutions of Eq. (2). Since 

W(x n ,x r2 ) = (r 2 - r\)x n+r2 ~ l 

is nonvanishing for r 2 / r 2 and a- > 0, it follows that the general solution of Eq. (2) is 

y = ciA ri + c 2 x ri , x > 0. (7) 

Note that if r is not a rational number, then x r is defined by x r = e r,nx . 


Solve 

2 x 2 y" + 3 xy' — y = 0, x > 0. (8) 

Substituting y = x r in Eq. (8) gives 

x r [2 r(r - 1) + 3r - 1] = x r (2 r 2 + r - 1) = x r (2 r - l)(r + 1) = 0. 

Hence r\ = 1 and r 2 = —1, so the general solution of Eq. (8) is 

y — cix 1/2 + C 2 X -1 , x > 0. (9) 


Equal Roots. If the roots r\ and r 2 are equal, then we obtain only one solution 
yi (a) = x n of the assumed form. A second solution can be obtained by the method 
of reduction of order, but for the purpose of our future discussion we consider an 
alternative method. Since r\ = r 2 , it follows that F(r) = (r — r 2 ) 2 . Thus in this case 
not only does F(ri) = 0 but also F'Oi) = 0. This suggests differentiating Eq. (4) with 
respect to r and then setting r equal to ri. Differentiating Eq. (4) with respect to r 
gives 

L[x r ] = i ] -[x r F{r )]. 
dr dr 

Substituting for F(r), interchanging differentiation with respect to x and with respect 
to r, and noting that d{x r )/dr = x r lnx, we obtain 

L[x r lnx] = (r — r\) 2 x r \nx + 2(r — r\)x r . (10) 

The right side of Eq. (10) is zero for r = r\\ consequently, 

y 2 {x) — x n In a, a > 0 (11) 

is a second solution of Eq. (2). By evaluating the Wronskian, we find that 

W(x n , x ri In a) = A 2ri_1 . 

Hence x n and x n In a are a fundamental set of solutions for a > 0, and the general 
solution of Eq. (2) is 

y = (ci + c 2 lnx)A n , a > 0. (12) 
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Solve 

x 2 y" + 5xy' + 4y = 0, x > 0. (13) 

Substituting y = x r in Eq. (13) gives 

x r [r(r - 1) + 5r + 4] = x r (r 2 + 4 r + 4) = 0. 

Hence r\ = r% = —2, and 

y = x -2 (ci + C 2 lnx), x > 0 (14) 

is the general solution of Eq. (13). 

Complex Roots. Finally, suppose that the roots r\ and i '2 of Eq. (5) are complex con- 
jugates, say, r\ = a + ip and r 2 = k — ip, with p ^ 0. We must now explain what is 

meant by x r when r is complex. Remembering that 

x r = e r,nx (15) 

when x > 0 and r is real, we can use this equation to define x r when r is complex. 
Then, using Euler’s formula for e l,i hlx , we obtain 

A.+//Z _(A.+f/x ) \nx Xlnx i/ilnx A. i/ilnx 

A C C C A L- 

= x^cosf/r lnx) + isin(p In a)], x > 0. (16) 

With this definition of x r for complex values of r, it can be verified that the usual 
laws of algebra and the differential calculus hold, and hence x ri and x n are indeed 
solutions of Eq. (2). The general solution of Eq. (2) is 

y = c l x l+i ^ + c 2 x x - il1 . (17) 

The disadvantage of this expression is that the functions x x+ ' 11 and are complex- 

valued. Recall that we had a similar situation for the second order differential equa- 
tion with constant coefficients when the roots of the characteristic equation were 
complex. In the same way as we did then, we observe that the real and imaginary 
parts of x' +ll ‘ , namely, 

x x cos(/rlnA) and x x sin(p lnx) (18) 

are also solutions of Eq. (2). A straightforward calculation shows that 

W[x x cos (p \nx),x x sin(/r lnx)] = px 2x ~ l . 

Hence these solutions form a fundamental set of solutions for x > 0, and the general 
solution of Eq. (2) is 

y = C\X X cos (p lnx) + C 2 X* sin (p lnx), x > 0. (19) 

EXAMPLE 

3 

Solve 

x 2 /' + xy’ + y = 0. (20) 

Substituting y = x r in Eq. (20) gives 

x r [r(r - 1) + r + 1] = x r {r 2 + 1) = 0. 

Hence r = ±i, and the general solution is 

y = Ci cos(lnx) + c 2 sin(lnx), x > 0. (21) 

The factor x x does not appear explicitly in Eq. (21) because in this example k = 0 and ,v A = 1. 
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Now let us consider the qualitative behavior of the solutions of Eq. (2) near the 
singular point x = 0. This depends entirely on the values of the exponents r\ and at. 
First, if r is real and positive, then x r — > 0 as x tends to zero through positive values. 
On the other hand, if r is real and negative, then x r becomes unbounded. Finally, if 
r = 0, then x r = 1. Figure 5.4.1 shows these possibilities for various values of r. If r is 
complex, then a typical solution is x x cos (/x In x). This function becomes unbounded 
or approaches zero if X is negative or positive, respectively, and also oscillates more 
and more rapidly as x —*■ 0. This behavior is shown in Figures 5.4.2 and 5.4.3 for 
selected values of X and /i. If X = 0. the oscillation is of constant amplitude. Finally, 
if there are repeated roots, then one solution is of the form v'Tnx, which tends to 
zero if r > 0 and becomes unbounded if r < 0. An example of each case is shown in 
Figure 5.4.4. 

The extension of the solutions of Eq. (2) into the interval x < 0 can be carried out 
in a relatively straightforward manner. The difficulty lies in understanding what is 
meant by x r when x is negative and r is not an integer; similarly, In x has not been 
defined for x < 0. The solutions of the Euler equation that we have given for x > 0 
can be shown to be valid for x < 0, but in general they are complex-valued. Thus in 
Example 1 the solution x 1/2 is imaginary for x < 0. 

It is always possible to obtain real-valued solutions of the Euler equation (2) in 
the interval x < 0 by making the following change of variable. Let x = — £, where 



FIGURE 5.4.1 Solutions of an Euler equation; real roots. 
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FIGURE 5.4.2 Solution of an Euler equa- FIGURE 5.4.3 Solution of an Euler equa- 
tion; complex roots with negative real part. tion; complex roots with positive real part. 
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FIGURE 5.4.4 
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Typical second solutions of an Euler equation with equal roots. 


% > 0, and let y = u(%). Then we have 

dy du dl; du d 2 y d / du\ d% d 2 u 

dx dtj dx d% ’ dx 2 dq l dt; ) dx d % 2 

Thus, for x < 0, Eq. (2) takes the form 
, d 2 u du 

$ 2 ^+a!;—+P i i = 0, £ > 0. (23) 

B ut, except for names of the variables, this is exactly the same as Eq. (2) ; from Eqs. (7) , 
(12), and (19) we have 


«(!) = 


cir 1 + c 2 r 

(Cl +c 2 ln£)£ ri 

cif 1 cos(/r ln£) + c 2 % x sin(p ln£), 


(24) 


depending on whether the zeros of F(r) — r(r — 1 ) + ar + /3 are real and different, 
real and equal, or complex conjugates. To obtain u in terms of x, we replace f by — x 
in Eqs. (24). 

We can combine the results for x > 0 and x < 0 by recalling that \x\ = xwhenx > 0 
and that |x| = — xwhenx < 0. Thus we need only replace* by |x|inEqs. (7), (12), and 
(19) to obtain real-valued solutions valid in any interval not containing the origin. 

Hence the general solution of the Euler equation (2) 

x 2 y" + axy' + fiy = 0 


in any interval not containing the origin is determined by the roots r\ and r 2 of the 
equation 

F{r) = r(r — 1) + ar + P — 0 
as follows. If the roots are real and different, then 

y — ci\x\ n + c 2 |x|'" 2 . (25) 

If the roots are real and equal, then 

y= (ci + c 2 ln|x|)|*r i . (26) 

If the roots are complex conjugates, then 

y = |x| A [ci cos(p In |x|) + c 2 sin(p In |x|)] , (27) 


where ri, r 2 = k ± ip. 
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The solutions of an Euler equation of the form 

(x - xo) 2 y" + a(x - xo)y' + f)y = 0 (28) 

are similar. If we look for solutions of the form y = {x — xo) r , then the general solution 
is given by Eq. (25), Eq. (26), or Eq. (27) with x replaced by x — xq. Alternatively, 
we can reduce Eq. (28) to the form of Eq. (2) by making the change of independent 
variable t = x — xq. 

Regular Singular Points. We now return to a consideration of the general equation (1) 

P(x)y" + Q(x)y' + R{x)y = 0 

where x 0 is a singular point. This means that P(x 0 ) = 0 and at least one of Q and R 
is not zero at xq. 

Unfortunately, if we attempt to use the methods of the preceding two sections to 
solve Eq. (1) in the neighborhood of a singular point xo, we find that these methods 
fail. This is because the solution of Eq. (1) is often not analytic at xo and consequently 
cannot be represented by a Taylor series in powers of x — x 0 . Examples 1, 2, and 3 
above illustrate this fact; in each of these examples the solution fails to have a power 
series expansion about the singular point x = 0. Therefore, to have any chance of 
solving Eq. (1) in the neighborhood of a singular point, we must use a more general 
type of series expansion. 

Since the singular points of a differential equation are usually few in number, we 
might ask whether we can simply ignore them, especially since we already know how 
to construct solutions about ordinary points. However, this is not feasible because 
the singular points determine the principal features of the solution to a much larger 
extent than you might at first suspect. In the neighborhood of a singular point the 
solution often becomes large in magnitude or experiences rapid changes in magni- 
tude. For example, the solutions found in Examples 1, 2, and 3 are illustrations of this 
fact. Thus the behavior of a physical system modeled by a differential equation fre- 
quently is most interesting in the neighborhood of a singular point. Often geometric 
singularities in a physical problem, such as corners or sharp edges, lead to singular 
points in the corresponding differential equation. Thus, although at first we might 
want to avoid the few points where a differential equation is singular, it is precisely 
at these points that it is necessary to study the solution most carefully. 

As an alternative to analytical methods, we can consider the use of numerical 
methods, which are discussed in Chapter 8. However, these methods are ill suited 
for the study of solutions near a singular point. Thus, even if we adopt a numerical 
approach, it is advantageous to combine it with the analytical methods of this chapter 
in order to examine the behavior of solutions near singular points. 

Without any additional information about the behavior of Q/P and R/P in the 
neighborhood of the singular point, it is impossible to describe the behavior of the 
solutions of Eq. (1) near x = xq. It may be that there are two distinct solutions of 
Eq. (1) that remain bounded as x — >• xo (as in Example 3); or there may be only one, 
with the other becoming unbounded as x — »■ x 0 (as in Example 1); or they may both 
become unbounded as x — »■ xo (as in Example 2). IfEq. (1) has solutions that become 
unbounded as x — >■ xo, it is often important to determine how these solutions behave 
as x — »■ xo. For example, does y -> oo in the same way as (x — xo) -1 or |x — xo |~ 1 ,/2 , 
or in some other manner? 
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Our goal is to extend the method already developed for solving Eq. (1) near an 
ordinary point so that it also applies to the neighborhood of a singular point jco- To 
do this in a reasonably simple manner, it is necessary to restrict ourselves to cases in 
which the singularities in the functions Q/P and R/P at x = x 0 are not too severe — 
that is, to what we might call “weak singularities.” At this stage it is not clear exactly 
what is an acceptable singularity. However, as we develop the method of solution, 
you will see that the appropriate conditions (see also Section 5.6, Problem 21) to 
distinguish “weak singularities” are 



(29) 


and 



(30) 


This means that the singularity in Q/P can be no worse than ( x — xo) _1 and the singu- 
larity in R/P can be no worse than (x — xo)~ 2 . Such a point is called a regular singular 
point of Eq. (1). For equations with more general coefficients than polynomials, x 0 
is a regular singular point of Eq. (1) if it is a singular point and if both 11 



(31) 


have convergent Taylor series about x 0 — thatis,ifthe functions inEq. (31) are analytic 
at x — xq. Equations (29) and (30) imply that this will be the case when P, Q, and R 
are polynomials. Any singular point of Eq. (1) that is not a regular singular point is 
called an irregular singular point of Eq. (1). 

Observe that the conditions in Eqs. (29) and (30) are satisfied by the Euler equation 
(28). Thus the singularity in an Euler equation is a regular singular point. Indeed, 
we will see that all equations of the form (1) behave very much like Euler equations 
near a regular singular point. That is, solutions near a regular singular point may 
include powers of x with negative or nonintegral exponents, logarithms, or sines or 
cosines of logarithmic arguments. 

In the following sections we discuss how to solve Eq. (1) in the neighborhood of a 
regular singular point. A discussion of the solutions of differential equations in the 
neighborhood of irregular singular points is more complicated and may be found in 
more advanced books. 


Determine the singular points of the Legendre equation 


EXAMPLE 


4 


(1 — x 2 )y" — 2 xy' + a (a + 1 )y = 0 


(32) 


and determine whether they are regular or irregular. 


n The functions given in Eq. (31) may not be defined at xq, in which case their values at xq are to be 
assigned as their limits as x — > xq. 
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EXAMPLE 

5 


EXAMPLE 

6 


In this case P(x ) = 1 — x 2 , so the singular points are x = 1 and x = —1. Observe that when 
we divide Eq. (32) by 1 — x 2 , the coefficients of / and y are — 2x/(l — x 2 ) anda(a + 1)/(1 — x 2 ), 
respectively. We consider the point x = 1 first. Thus, from Eqs. (29) and (30), we calculate 


lim(x — 1) 


—2x 


= lim 


(x — 1)(— 2x) 


*->l (1 — x)(l + X) 


— lim - 


2x 


»i 1 T x 


= 1 


and 


lim(x — l) 2 

X — > 1 


a (a + 1 ) 
1 — x 2 


(x — \ya(oi + 1) 

lim 

i->i (1 — x)(l+x) 


(x - l)(-ot)(a + 1) 

= lim = 0. 

*->1 1 + x 


Since these limits are finite, the point x = 1 is a regular singular point. It can be shown in a 
similar manner that x = — 1 is also a regular singular point. 


Determine the singular points of the differential equation 

2x(x — 2 ) 2 y" + 3xy' + (x — 2 )y = 0 

and classify them as regular or irregular. 

Dividing the differential equation by 2x(x — 2) 2 , we have 


y + 2(x - 2) 2 7 + 2x(x - 2) } " °’ 

so p(x) = Q(x)/P(x) — 3/2(x — 2) 2 and q(x) — R(x)/P(x) = l/2x(x — 2). The singular points 
are x = 0 and x = 2. Consider x = 0. We have 

3 

limxp(x) = limx = 0, 

x ^o x^o 2(x - 2) 2 


limx 2 £7(x) = limx 2 = 0. 

*->■0 x 2x(x — 2) 

Since these limits are finite, x = 0 is a regular singular point. For x = 2 we have 


lim(x - 

x — >2 


2 )p(x) = lim(x - 

x —>2 


■ 2 ); 


= lim - 


2(x — 2)2 xXi 2(x — 2) ’ 

so the limit does not exist; hence x = 2 is an irregular singular point. 


Determine the singular points of 

^x — y ) y" + (cosx)y' + (sinx)y = 0 


and classify them as regular or irregular. 

The only singular point is x = ?r/2. To study it, we consider the functions 


{x-\)p(x) = {x- |) 


Q(x) 

P{x) 


cosx 
X — TC / 2 


and 
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l n \ 2 , ( n\ 2 R(x) 

V 2 / ' V 2/ P(x) 

Starting from the Taylor series for cos x about x = n/2, we find that 

cosx (x — n/2) 2 (x — n / 2) 4 

x- ;r/2 = ~ + 3! 5! + '"’ 

which converges for all x. Similarly, sin* is analytic at x = jt/2. Therefore we conclude that 
?r/2 is a regular singular point for this equation. 


PROBLEMS 


In each of Problems 1 through 12 determine the general solution of the given differential 
equation that is valid in any interval not including the singular point. 

1. x 2 y" + 4 xy' + 2y = 0 2. (x + l) 2 y"+ 3(x + 1 )y'+ 0.75y = 0 

3. x 2 y" - 3 xy' + 4y = 0 4. x 2 y" + 3 xy' + 5y = 0 

5. x 2 y" - xy' + y = 0 6. (x - l) 2 y" + SCv - 1 )y' + 12y = 0 

7. x 2 y" + 6xy' - y = 0 8. 2x 2 y" - 4x / + 6y = 0 

9. x 2 y" - 5xy' + 9y = 0 10. (x - 2) 2 y" + 5(x - 2)/ + 8y = 0 

11. x 2 y" + 2xy' + 4y = 0 12. x 2 y" - 4 xy' + 4y = 0 


In each of Problems 13 through 16 find the solution of the given initial value problem. Plot 
the graph of the solution and describe how the solution behaves as x — >■ 0. 

13. 2x 2 y" + xy' - 3y = 0, y( 1) = 1, /( 1) = 4 
4 ffl 14. 4 x 2 y" + 8 xy' + 11 y = 0, v(l) = 2, /( 1) = -3 
4*1 15. x 2 y" - 3 xy' + 4y = 0, y(-l) = 2, y'(-l) = 3 

jfl 16. x 2 y" + 3 xy' + 5y = 0, y(l) = 1, y'(l) = -1 


In each of Problems 17 through 34 find all singular points of the given equation and determine 
whether each one is regular or irregular. 

17. xy" + (1 - x)y' + xy = 0 18. x 2 (l - x) 2 y" + 2xy' + 4y = 0 

19. x 2 (l - x)y" + (x - 2)y' - 3xy = 0 20. x 2 (l - x 2 )y" + (2 /x)y' + 4y = 0 

21. (1 - x 2 ) 2 y" + x(l - x)y' + (1 + x)y = 0 

22. x 2 y" + xy' + (x 2 — v 2 )y = 0, Bessel equation 

23. (x + 3)y" - 2xy' + (1 - x 2 )y = 0 

24. x(l - x 2 ) 3 y" + (1 - x 2 ) 2 y' + 2(1 + x)y = 0 

25. (x + 2 ) 2 (x — l)y" + 3(x — l)y' — 2(x + 2)y = 0 

26. x(3 - x)y" + (x + l)y' - 2y = 0 


27. (x 2 + x - 2)y" + (x + l)y' + 2y = 0 
29. y" + (In |x|)y' + 3xy = 0 
31. x 2 y" - 3(sinx)y' + (1 + x 2 )y = 0 

33. (sinx)y" + xy' + 4y = 0 


28. xy" + e*y' + (3 cosx)y = 0 
30. x 2 y" + 2(e x - 1 )y' + (e~ x cosx)y = 0 
32. xy" + y' + (cotx)y = 0 

34. (x sinx)y" + 3y' + xy = 0 


35. Find all values of a for which all solutions of x 2 y" + axy' + (5/2 )y = 0 approach zero as 
x — > 0. 
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36. Find all values of fi for which all solutions of x 2 y" + f)y = 0 approach zero as x — > 0. 

37. Find y so that the solution of the initial value problem x 2 y" — 2 y = 0, y(l) = l,y'(l) = y 
is bounded as x — > 0. 

38. Find all values of a for which all solutions of x 2 y" + orxy' + (5/2 )y — 0 approach zero as 
x — > oo. 

39. Consider the Euler equation x 2 y" + axy' + fiy = 0. Find conditions on a and fi so that: 

(a) All solutions approach zero as x -> 0. 

(b) All solutions are bounded as x -> 0. 

(c) All solutions approach zero as x — > oo. 

(d) All solutions are bounded as x -> oo. 

(e) All solutions are bounded both as x -> 0 and as x — > oo. 

40. Using the method of reduction of order, show that if r\ is a repeated root of 

r(r — 1) + ar + f) = 0, 

then x n and x n In x are solutions of x 2 y" + axy' + fiy = 0 for x > 0. 


In each of Problems 41 and 42 show that the point x = 0 is a regular singular point. In each 

OO 

problem try to find solutions of the form a n x n . Show that (except for constant multiples) 

n = 0 

there is only one nonzero solution of this form in Problem 41 and that there are no nonzero 

solutions of this form in Problem 42. Thus in neither case can the general solution be found 

in this manner. This is typical of equations with singular points. 

41. 2xy" + 3/ + xy = 0 

42. 2 x 2 y" + 3 xy' - (1 + x)y = 0 

43. Singularities at Infinity. The definitions of an ordinary point and a regular singular point 
given in the preceding sections apply only if the point x 0 is finite. In more advanced work 
in differential equations it is often necessary to consider the point at infinity. This is done 
by making the change of variable £ = 1/x and studying the resulting equation at § = 0. 
Show that, for the differential equation 

P(x)y" + (Q(x)y' + R(x)y = 0, 


the point at infinity is an ordinary point if 


1 


pam 


2Pam 


Q( 1 / 5 )] 

? 2 . 


and 


mm 

ppam 


have Taylor series expansions about § = 0. Show also that the point at infinity is a regular 
singular point if at least one of the above functions does not have a Taylor series expansion. 


but both 


pam 


2Pam 


Qam 

? 2 


and 


Ram 

ppam 


do have such expansions. 


In each of Problems 44 through 49 use the results of Problem 43 to determine whether the 
point at infinity is an ordinary point, a regular singular point, or an irregular singular point of 
the given differential equation. 

44. /' + y = 0 

45. x 2 y" + xy' - 4y = 0 
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46. (1 — x 2 )y" — 2xy' + a(a + \)y = 0, Legendre equation 

47. x 2 y" + xy' + (x 2 — v 2 )y = 0, Bessel equation 

48. y" — 2 xy' + Xy = 0, Hermite equation 

49. y" — xy = 0, Airy equation 


5.5 Series Solutions Near a Regular Singular Point, Part I 


We now consider the question of solving the general second order linear equation 

P(x)y" + Q{x)y' + R(x)y = 0 (1) 

in the neighborhood of a regular singular point x = x 0 . For convenience we assume 
that xq = 0. If a'o / 0, we can transform the equation into one for which the regular 
singular point is at the origin by letting x — xq equal I. 

The fact that x = 0 is a regular singular point of Eq. (1) means that xQ(x)/P(x) = 
xp(x) and x 2 R(x) / P(x) = x 2 q(x) have finite limits as x — > 0 and are analytic at x = 0. 
Thus they have convergent power series expansions of the form 

OO OO 

xp(x) = y^p n x n , x 2 q(x) — (2) 

n = 0 n = 0 

on some interval |x| < p about the origin, where p > 0. To make the quantities xp(x) 
and x 2 q(x) appear in Eq. (1), it is convenient to divide Eq. (1) by P(x ) and then to 
multiply by x 2 , obtaining 

x 2 y" + x[xp(x)]y' + [x 2 q(x)]y = 0, (3) 


or 


x 2 y" + x(p 0 + pix H h p n x n -\ )/ 

+ (q 0 + qix-\ b q n x" H )y = 0. (4) 


If all of the coefficients p„ and q n are zero, except possibly 


,. xQ(x) 

P o = lim — and q 0 
x ^0 P(x) 


lim 


x 2 R(x) 


x ^0 P(x) 


then Eq. (4) reduces to the Euler equation 

x 2 y" + poxy 1 + q 0 y = 0, 


( 5 ) 

( 6 ) 


which was discussed in the preceding section. In general, of course, some of the 
p n and q n , n > 1, are not zero. However, the essential character of solutions of 
Eq. (4) is identical to that of solutions of the Euler equation (6). The presence of 

the terms p\ x + b p n x n H and qix + b q n x n + ■ ■ ■ merely complicates the 

calculations. 

We restrict our discussion primarily to the interval x > 0. The interval x < 0 can 
be treated, just as for the Euler equation, by making the change of variable x — — f 
and then solving the resulting equation for § > 0. 
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Since the coefficients in Eq. (4) are “Euler coefficients” times power series, it is 
natural to seek solutions in the form of “Euler solutions" times power series. Thus 
we assume that 

OO OO 

y = x r (a 0 + a\x H f a n x n H ) = x r Y a n x n = Y a n x r+ ' 1 , (7) 

n = 0 n = 0 

where ao / 0. In other words, r is the exponent of the first term in the series, and ao 
is its coefficient. As part of the solution we have to determine: 

1. The values of r for which Eq. (1) has a solution of the form (7). 

2. The recurrence relation for the coefficients a n . 

OO 

3. The radius of convergence of the series £ a nX n ■ 

n = 0 

The general theory was constructed by Frobenius 12 and is fairly complicated. 
Rather than trying to present this theory, we simply assume in this and the next 
two sections that there does exist a solution of the stated form. In particular, we 
assume that any power series in an expression for a solution has a nonzero radius of 
convergence and concentrate on showing how to determine the coefficients in such 
a series. To illustrate the method of Frobenius, we first consider an example. 


EXAMPLE 

1 


Solve the differential equation 

2 x 2 y" — xy’ + (1 + x)y = 0. (8) 

It is easy to show that x = 0 is a regular singular point of Eq. (8). Further, xp(x) — —1/2 
&r\dx 2 q(x) = (1 +x)/2. Thus p 0 = —l/2,q 0 = 1/2 ,qi = 1/2, and all other p’s and q's are zero. 
Then, from Eq. (6), the Euler equation corresponding to Eq. (8) is 

2 x 2 y" -xy’ + y = 0. (9) 

To solve Eq. (8), we assume that there is a solution of the form (7). Then / and y" are 
given by 

OO 

y = a n (r + n)x r+n ~ 1 (10) 

H= 0 

and 

OO 

y" = y ^ a n (r + n)(r + n - l)x r+n ~ 2 . (11) 

n= o 

By substituting the expressions for y, y’ , and y" in Eq. (8), we obtain 

OO 

2 x 2 y" — xy' + (1 + x)y = ^ 2a n (r + n)(r + n — l)x r+n 

n = 0 


OO OO OO 

— Y. a n {r + ri)x r+n + a n x r+n + a n x r+n+1 ■ (12) 

n = 0 n = 0 n = 0 


12 Ferdinand Georg Frobenius (1849-1917) was (like Fuchs) a student and eventually a professor at the 
University of Berlin. Fie showed how to construct series solutions about regular singular points in 1874. 
His most distinguished work, however, was in algebra, where he was one of the foremost early developers 
of group theory. 
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The last term in Eq. (12) can be written as a n -\X r+n , so by combining the terms in Eq. (12), 

n = 1 

we obtain 

2x 2 y" — xy' + (1 + x)y — a 0 [2r(r — 1) — r + l]x r 

OO 

+ ^ {[2 (r + ri)(r + n - 1) - (/■ + n) + 1] + «„_!} x r+ '' = 0. (13) 

n = 1 

If Eq. (13) is to be satisfied for all x, the coefficient of each power of x in Eq. (13) must be 
zero. From the coefficient of x r we obtain, since «o 7 ^ 0, 

2 r(r - 1) - r + 1 = 2r 2 - 3r + 1 = (r - l)(2r - 1) = 0. (14) 

Equation (14) is called the indicial equation for Eq. (8). Note that it is exactly the polynomial 
equation we would obtain for the Euler equation (9) associated with Eq. (8). The roots of the 
indicial equation are 

n = 1, r 2 — 1/2. (15) 

These values of r are called the exponents at the singularity for the regular singular point 
x = 0. They determine the qualitative behavior of the solution (7) in the neighborhood of the 
singular point. 

Now we return to Eq. (13) and set the coefficient of x r+n equal to zero. This gives the relation 
[2(r + n)(r + n - 1) - (r + n) + l]a„ + a n -i = 0, n > 1, (16) 


or 


Myi — 1 

2 (r + n) 2 — 3 (r + ri) + 1 


1 


[(r + n) - l][2(r + ri) - 1] 


w > 1. 


(17) 


For each root and r 2 of the indicial equation, we use the recurrence relation (17) to determine 
a set of coefficients a\,a 2 , .... For r = r\ = 1, Eq. (17) becomes 


&n—l 


(2 n + 1 )n ’ 


« > 1. 


Thus 


= — 


d2 = ~ 


a 0 

3/1 ’ 

«1 

5-2 ” (3 • 5) ( 1 • 2) ’ 


«o 


and 


In general, we have 


ClT, = — 


Cl2 


a o 


a„ = 


7-3 (3 ■ 5 ■ 7)(1 • 2 ■ 3) ’ 

(- 1 )" 


[3 ■ 5 • 7 • ■ ■ (2n + l)]n! 


flo. 


n > 4. 


(18) 


Multiplying the numerator and denominator of the right side of Eq. (18) by 2 • 4 • 6 • • • 2n = 
2 n n\, we can rewrite a n as 


(-1)"2" 

Cl n = — — an. 

(2n + 1)! 


n > 1. 
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Hence, if we omit the constant multiplier no- one solution of Eq. (8) is 


yi(x) = x 


i + E 


( — 1)"2" „ 
(2n + l)\ X 


x > 0. 


(19) 


To determine the radius of convergence of the series in Eq. (19), we use the ratio test: 


lim 

n—> oo 


Mn+lX 


n + 1 


a n x n 


= lim 


2\x\ 


n—*oo (2 n + 2) (2 n + 3) 


= 0 


for all x. Thus the series converges for all x. 

Corresponding to the second root r = r 2 = 1, we proceed similarly. From Eq. (17) we have 


Cln—1 


Cln—1 


2n (n — 1) n(2n — 1) ’ 


n > 1. 


Hence 


Cl\ = — 


a 2 = - 


a 3 = - 


a o 

O’ 
a \ 

2-3 ~ (1 • 2) (1 • 3) ’ 

a 2 Oq 


a o 


and, in general, 


3-5 (1 • 2 • 3)(1 • 3 • 5) ’ 

(-D" 


n![l ■ 3 ■ 5 ■ ■ ■ (2n — 1)] 


ao, n > 4. 


( 20 ) 


Just as in the case of the first root n , we multiply the numerator and denominator by 
2 ■ 4 ■ 6 ■ ■ ■ 2 n = 2 "n\. Then we have 


(-l)"2 n 

(2/i)! 


oo, n > 1. 


Again omitting the constant multiplier a 0 , we obtain the second solution 


y 2 (x) = x 


_ v l/2 


i + fc^V 


jc > 0. 


( 21 ) 


As before, we can show that the series in Eq. (21) converges for all x. Since yi and y 2 behave 
like x and x 1/2 , respectively, near x = 0, they form a fundamental set of solutions. Hence the 
general solution of Eq. (8) is 


y — c iyi(x) + c 2 y 2 (x), x > o. 


The preceding example illustrates that if x = 0 is a regular singular point, then 
sometimes there are two solutions of the form (7) in the neighborhood of this point. 
Similarly, if there is a regular singular point at x = xq, then there may be two solutions 
of the form 

OO 

y = (x- xqY a n (* - x o) n ( 22 ) 

n = 0 

that are valid near x = xo. However, just as an Euler equation may not have two 
solutions of the form y = x r , so a more general equation with a regular singular point 
may not have two solutions of the form (7) or (22). In particular, we show in the next 
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section that if the roots r\ and r 2 of the indicial equation are equal, or differ by an 
integer, then the second solution normally has a more complicated structure. In all 
cases, though, it is possible to find at least one solution of the form (7) or (22); if r\ 
and r 2 differ by an integer, this solution corresponds to the larger value of r. If there 
is only one such solution, then the second solution involves a logarithmic term, just as 
for the Euler equation when the roots of the characteristic equation are equal. The 
method of reduction of order or some other procedure can be invoked to determine 
the second solution in such cases. This is discussed in Sections 5.6 and 5.7. 

If the roots of the indicial equation are complex, then they cannot be equal or 
differ by an integer, so there are always two solutions of the form (7) or (22). Of 
course, these solutions are complex-valued functions of x. However, as for the Euler 
equation, it is possible to obtain real-valued solutions by taking the real and imaginary 
parts of the complex solutions. 

Finally, we mention a practical point. If P, Q, and R are polynomials, it is often 
much better to work directly with Eq. (1) than with Eq. (3). This avoids the necessity 
of expressing xQ(x)/P(x ) and x 2 R(x) / P(x) as power series. For example, it is more 
convenient to consider the equation 

x(\ + x)y" + 2/ +xy — 0 


than to write it in the form 


x 2 y" ■ 


2x . x 2 

-y + = o. 


1 + X 1 + x 

which would entail expanding 2x/(l + x) and x 2 /(l + x) in power series. 


PROBLEMS 


In each of Problems 1 through 10: 

(a) Show that the given differential equation has a regular singular point at x = 0. 

(b) Determine the indicial equation, the recurrence relation, and the roots of the indicial 
equation. 

(c) Find the series solution (x > 0) corresponding to the larger root. 

(d) If the roots are unequal and do not differ by an integer, find the series solution corre- 
sponding to the smaller root also. 


1. 2xy" + / + xy = 0 
3. xy" + y = 0 
5. 3x : l y" + 2xy' + x 2 y = 0 
7. xy" + (1 - x)y' -y = 0 
9. x 2 y" - x(x + 3)/ + (x + 3)y = 0 
11. The Legendre equation of order a is 


2. x 2 y" + xy' + (x 2 - | ) y = 0 
4. xy" +y -y = 0 
6. x 2 y" + xy' + (x - 2)y = 0 
8. 2 x 2 y" + 3 xy' + (2x 2 - l)y = 0 
10. x 2 y" +{x 2 +\)y = 0 


(1 — x 2 )y" — 2 xy' + a(a + 1 )y = 0. 


The solution of this equation near the ordinary point x = 0 was discussed in Problems 22 
and 23 of Section 5.3. In Example 4 of Section 5.4 it was shown that x = ±1 are regular 
singular points. 

(a) Determine the indicial equation and its roots for the point x = 1. 
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(b) Find a series solution in powers of x — 1 for x — 1 > 0. 

Hint: Write 1 + x = 2 + (x — 1) and x = 1 + (x — 1). Alternatively, make the change of 
variable x — 1 = t and determine a series solution in powers of t. 

12. The Chebyshev equation is 

(1 - x 2 )y" - xy' + cry = 0, 
where a is a constant; see Problem 10 of Section 5.3. 

(a) Show that x = 1 and x = — 1 are regular singular points, and find the exponents at 
each of these singularities. 

(b) Find two solutions about x = 1. 

13. The Laguerre 13 differential equation is 

xy" + (1 - x)y' + Ay = 0. 

(a) Show that x = 0 is a regular singular point. 

(b) Determine the indicial equation, its roots, and the recurrence relation. 

(c) Find one solution (x > 0). Show that if X = m, a positive integer, this solution reduces 
to a polynomial. When properly normalized, this polynomial is known as the Laguerre 
polynomial, L m (x). 

14. The Bessel equation of order zero is 

x 2 y" + xy' + x 2 y - 0. 

(a) Show that x = 0 is a regular singular point. 

(b) Show that the roots of the indicial equation are r\ = r 2 = 0. 

(c) Show that one solution for x > 0 is 


OO 

/ow = i + 

n = 1 


(-l^X 2 " 

2 2n (n\) 2 


(d) Show that the series for Jq(x) converges for all x. The function J 0 is known as the 
Bessel function of the first kind of order zero. 

15. Referring to Problem 14, use the method of reduction of order to show that the second 
solution of the Bessel equation of order zero contains a logarithmic term. 

Hint: Ify 2 (x) = / 0 (x)i>(x), then 

f dx 

y 2 (x) =/ 0 (x) / r ■ 

J x[J 0 (x)Y 

Find the first term in the series expansion of l/x[/o(x)] 2 . 

16. The Bessel equation of order one is 

x 2 y" + xy' + (x 2 - l)y = 0. 

(a) Show that x = 0 is a regular singular point. 

(b) Show that the roots of the indicial equation are r\ = 1 and r 2 = —1. 


13 Edmond Nicolas Laguerre ( 1834-1886), a French geometer and analyst, studied the polynomials named 
for him about 1879. 
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(c) Show that one solution for x > 0 is 

/ lW = *E ( - 1)n *V 

2 " (/; + l)\n\2 2n 

(d) Show that the series for Jflx ) converges for all x. The function / i is known as the 
Bessel function of the first kind of order one. 

(e) Show that it is impossible to determine a second solution of the form 

OO 

x~ l b n x n , x > 0. 

n = 0 


5.6 Series Solutions Near a Regular Singular Point, Part II 

Now let us consider the general problem of determining a solution of the equation 
L[y ] = x 2 y" + x[xp{x)]y' + [x 2 q(x)]y = 0, (1) 

where 

OO OO 

xp(x) = y^p n x n , x 2 q(x) = E q„x n , (2) 

n= 0 n= 0 

and both series converge in an interval \x\ < p for some p > 0. The point x — 0 is a 
regular singular point, and the corresponding Euler equation is 

x~y" + poxy' + q 0 y = 0. (3) 

We seek a solution of Eq. (1) for x > 0 and assume that it has the form 

OO OO 

y — <P (r , x) = x r E a nX n = E a nX r+n , (4) 

n= 0 n=0 

where ao ^ 0, and we have written y = 0 (r, x) to emphasize that cp depends on r as 
well as x. It follows that 

OO OO 

y' = E 0 + n)a n x' +n -\ y" = ^(r + n)(r + n- 1 )a n x r+n ~ 2 . 

n= 0 n=0 

Then, substituting from Eqs. (2), (4), and (5) in Eq. (1) gives 

aor(r — l)x r + a\(r + 1 )rx r+1 H + a n (r + n)(r + n — \ )x r+n + ■■■ 

+ (po +PlX H h PnX 11 H ) 

x [a 0 rx r + a\{r + l)x r+1 H h a n (r + n)x r+n H ] 

+ (<7o + 4l x + • • • + l JnX n + • • •) 

x {pox' + a\x r+l + • • ■ + a n x r+n + ■••) = 0. 


( 5 ) 
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Multiplying the infinite series together and then collecting terms, we obtain 

a 0 F(r)x r + [aiF(r + 1) + a 0 (pir + q\)]x r+l 

+ {a 2 F(r + 2) + a 0 (p 2 r + q 2 ) + a 2 [p\(r + 1) + qi\) x r+2 

H b {a„F(r + n ) + a 0 (p n r + q n ) + (r + 1) + q n -i\ 

+ b a n -i[pi(r + n- 1) + qi\) x r+n H = 0, 

or, in a more compact form, 

L[cj)}(r,x) = a 0 F(r)x r 

oo T n — 1 | 

+ X! P + n)Cln + ak ^ r + + qn-k\ 1 X r+ " = 0, (6) 

n=l [ k= 0 J 

where 

F(r) = r(r-l)+p 0 r + q 0 . (7) 

For Eq. (6) to be satisfied for all x > 0, the coefficient of each power of x must be 
zero. 

Since a o ^ 0, the term involving x r yields the equation F{r ) = 0. This equation is 
called the indicial equation; note that it is exactly the equation we would obtain in 
looking for solutions y — x r of the Euler equation (3). Let us denote the roots of 
the indicial equation by r\ and r 2 with r\ > r 2 if the roots are real. If the roots are 
complex, the designation of the roots is immaterial. Only for these values of r can we 
expect to find solutions of Eq. (1) of the form (4). The roots /q and r 2 are called the 
exponents at the singularity; they determine the qualitative nature of the solution in 
the neighborhood of the singular point. 

Setting the coefficient of x r+n in Eq. (6) equal to zero gives the recurrence relation 


n — 1 

F(r + n)a„ + ^ a k [{r + k)p n - k + q n - k \ = 0, n > 1. (8) 

k=0 


Equation (8) shows that, in general, a n depends on the value of r and all the pre- 
ceding coefficients aa,a\,... ,a n -\. It also shows that we can successively compute 
cii,a 2 , . . . , a n , ... in terms of ao and the coefficients in the series for xp(x) and x 2 q(x), 
provided that F(r + 1 ),F(r + 2), . . . ,F(r + «),... are not zero. The only values of 
r for which F(r) = 0 are r = /q and r = r 2 , since r\ > r 2 , it follows that /q + n is not 
equal to r\ or r 2 for n > 1. Consequently, F{r\ + n) / 0 for n > 1. Hence we can 
always determine one solution of Eq. (1) in the form (4), namely, 


yi(x)=x ri 


OO 

1 + ^2,a n (ji)x n 
n = 1 


x > 0. 


( 9 ) 


Here we have introduced the notation a n (r\) to indicate that a n has been determined 
from Eq. (8) with r = r\. To specify the arbitrary constant in the solution, we have 
taken ao to be 1. 
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If r 2 is not equal to r\, and r\ — r 2 is not a positive integer, then r 2 + n is not equal 
to r\ for any value of n > 1; hence F(r 2 + n) / 0, and we can also obtain a second 
solution 


yi(x) = x r > 


OO 

1 + ^2 a nYl)x" 
n= 1 


x > 0. 


( 10 ) 


Just as for the series solutions about ordinary points discussed in Section 5.3, the 
series in Eqs. (9) and (10) converge at least in the interval |x| < p where the series for 
both xp(x ) and x 1 q(x) converge. Within their radii of convergence, the power series 

OO OO 

1 + a n {r\)x n and 1 + YZ a n (r 2 )x n define functions that are analytic atx = 0. Thus 

n= 1 n = 1 

the singular behavior, if there is any, of the solutions jq and y 2 is due to the factors 
x n and x n that multiply these two analytic functions. Next, to obtain real-valued 
solutions for x < 0, we can make the substitution x = — § with § > 0. As we might 
expect from our discussion of the Euler equation, it turns out that we need only 
replace x n in Eq. (9) and x 1 ' 2 in Eq. (10) by \x\ n and \x\ ri , respectively. Finally, note 
that if ri and r 2 are complex numbers, then they are necessarily complex conjugates 
and r 2 ^ r\+ N for any positive integer N. Thus, in this case we can always find two 
series solutions of the form (4); however, they are complex-valued functions of x. 
Real-valued solutions can be obtained by taking the real and imaginary parts of the 
complex-valued solutions. The exceptional cases in which r\ = r 2 or r\ — r 2 = N, 
where A is a positive integer, require more discussion and will be considered later in 
this section. 

It is important to realize that r\ and r 2 , the exponents at the singular point, are easy 
to find and that they determine the qualitative behavior of the solutions. To calculate 
ri and r 2 , it is only necessary to solve the quadratic indicial equation 


r{r - 1) +p 0 r + q 0 = 0, (11) 

whose coefficients are given by 

po = lim A'p(x), q 0 — lim x 2 q(x). (12) 

jt ^-0 x ^-0 

Note that these are exactly the limits that must be evaluated in order to classify the 
singularity as a regular singular point; thus they have usually been determined at an 
earlier stage of the investigation. 

Further, if x = 0 is a regular singular point of the equation 

P(x)y" + Q(x)y' + R(x)y = 0, (13) 


where the functions P, Q, and R are polynomials, then xp(x) = xQ(x)/ P(x) and 
x 2 q(x) = x 2 R(x)/P(x). Thus 


Po 


= limx 


Q(x) 
P(x) ’ 


<7o 


= lim jr 


R(x) 

P(xY 


(14) 


Finally, the radii of convergence for the series in Eqs. (9) and (10) are at least equal 
to the distance from the origin to the nearest zero of P other than .r = 0 itself. 
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EXAMPLE 

1 


Discuss the nature of the solutions of the equation 

2x(l + x)y" + (3 + x)y' — xy = 0 

near the singular points. 

This equation is of the form (13) with P(x) = 2x(l + x), Q(x) = 3 + x, and R(x) = — x. The 
points x = 0 and x = — 1 are the only singular points. The point x = 0 is a regular singular 
point, since 


,. QW ,. 3 + x 3 

hmx = hmx = 

x->o P(x) -v — >-0 2x(l + x) 2 


lim x 2 


R(x) 

P(x) 


= limx 2 

x^O 


—X 

2x(l + x) 


= 0. 


Further, from Eq. (14), p 0 = \ and <70 = 0. Thus the indicial equation is r(r — 1) + | r = 0, and 
the roots are r\ = 0, r 2 = — \ . Since these roots are not equal and do not differ by an integer, 
there are two solutions of the form 


OO 

yi(x) = 1 + ^2,a n { 0)x” 
# 1=1 


and 


y 2 (x) = \x\ 1/2 


1 + ^2a n (-f)x' 

71 = 1 


for 0 < |x| < p. A lower bound for the radius of convergence of each series is 1, the distance 
from x = 0 to x = —1, the other zero of P(x). Note that the solution yi is bounded as x -»• 0, 
indeed is analytic there, and that the second solution y 2 is unbounded as x -> 0. 

The point x = — 1 is also a regular singular point, since 


lim (x + 1) 

X->— 1 


Q(x) 

P(x) 


lim 

X^—l 


(x+ l)(3 + x) 
2x(l + x) 


= -l, 


lim (x + l) 2 

Jt — > — 1 


R(x) 

P(x) 


lim 

*-f-i 


(x + 1) 2 ( x) 
2x(l + x) 


= 0. 


In this case p 0 = — 1, q 0 = 0, so the indicial equation is r(r — 1) — r = 0. The roots of the indicial 
equation are r 2 = 2 and r 2 = 0. Corresponding to the larger root there is a solution of the form 


yi( x ) = (x + l ) 2 


OO 

1 + y ] a n (2)(x + V) n 


n= 1 

The series converges at least for \x + 1| < 1, and y\ is an analytic function there. Since the two 
roots differ by a positive integer, there may or may not be a second solution of the form 


y 2 {x) = 1 + y^a„(0)(x + 1)". 

71 = 1 

We cannot say more without further analysis. 

Observe that no complicated calculations were required to discover the information about 
the solutions presented in this example. All that was needed was to evaluate a few limits and 
solve two quadratic equations. 


We now consider the cases in which the roots of the indicial equation are equal, or 
differ by a positive integer, r\ — r 2 = N. As we have shown earlier, there is always one 
solution of the form (9) corresponding to the larger root r\ of the indicial equation. 
By analogy with the Euler equation, we might expect that if r\ — r 2 , then the second 
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solution contains a logarithmic term. This may also be true if the roots differ by an 
integer. 


Equal Roots. The method of finding the second solution is essentially the same as the 
one we used in finding the second solution of the Euler equation (see Section 5.4) 
when the roots of the indicial equation were equal. We consider r to be a continuous 
variable and determine a n as a function of r by solving the recurrence relation (8). 
For this choice of a n (r ) for n > 1, Eq. (6) reduces to 

L[<p](r,x) = a 0 F(r)x r = a 0 (r - r,)V, (15) 


since r\ is a repeated root of F(r). Setting r — r\ in Eq. (15), we find that 
L[<p]{r\,x) = 0; hence, as we already know, yi ( jc) given by Eq. (9) is one solution 
of Eq. (1). But more important, it also follows from Eq. (15), just as for the Euler 
equation, that 


L 


d(p 

dr 


(r i,x) = a a — [x r {r - ri) 2 ~\ 
dr 


= ao[(r — ri) 2 x r ln.v + 2 (r — r\)x r ] 


= 0 . 

r=r l 


(16) 


Hence, a second solution of Eq. (1) is 


yi(x) = 


d(f>(r.x) 


dr 


— — \ x 

dr I 


(. x n ln.r) 


a 0 + y ^ a n (r)x" 

n = l 

OO 

n 0 + ^a n (ri)x ' 1 


n = 1 




n= 1 


= yi(x) \nx + x n ^ a' n (ri)x n , x > 0, 


(17) 


n= 1 


where a' n {r\) denotes da n /dr evaluated at r — r\. 

It may turn out that it is difficult to determine a n (r) as a function of r from the 
recurrence relation (8) and then to differentiate the resulting expression with respect 
to r. An alternative is simply to assume that y has the form of Eq. (17). That is, assume 
that 

OO 

y = y\ (x) In .r + x n b n x n , x > 0, (18) 

n = 1 


where y\{x) has already been found. The coefficients b n are calculated, as usual, by 
substituting into the differential equation, collecting terms, and setting the coefficient 
of each power of x equal to zero. A third possibility is to use the method of reduction 
of order to find yz(x) once y\{x) is known. 


Roots rj and r 2 Differing by an Integer N. For this case the derivation of the second so- 
lution is considerably more complicated and will not be given here. The form of 
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Theorem 5.6.1 


this solution is stated in Eq. (24) in the following theorem. The coefficients c n (r 2 ) in 
Eq. (24) are given by 


Cnfa) = — [(r- r 2 )a„(r)] 
dr 


n= 1 , 2 ,..., 


(19) 


where a n (r) is determined from the recurrence relation (8) with at, = 1. Further, the 
coefficient a in Eq. (24) is 

a — lim(r — r 2 )a^(r). (20) 

r—^ri 


If « v(> 2 ) is finite, then a — 0 and there is no logarithmic term in V 2 - A full derivation 
of formulas (19) and (20) may be found in Coddington (Chapter 4). 

In practice, the best way to determine whether a is zero in the second solution is 
simply to try to compute the a n corresponding to the root r 2 and to see whether it is 
possible to determine a^(r 2 ). If so, there is no further problem. If not, we must use 
the form (24) with a ^ 0. 

When n-r 2 = N, there are again three ways to find a second solution. First, we 
can calculate a and c„(r 2 ) directly by substituting the expression (24) for y in Eq. (1). 
Second, we can calculate c n {r 2 ) and a of Eq. (24) using the formulas (19) and (20). 
If this is the planned procedure, then in calculating the solution corresponding to 
r = ri, be sure to obtain the general formula for a n (r) rather than just a„(r , ). The 
third alternative is to use the method of reduction of order. 


Consider the differential equation (1) 

x 2 y" + x[xp(x)]y’ + [x 2 q(x)]y — 0, 

where x = 0 is a regular singular point. Then xp(x) and x 2 q(x) are analytic at x = 0 
with convergent power series expansions 

OO OO 

xp{x) = ^2p n x n , x 2 q(x) = ^2 dnX n 

n = 0 n—0 

for |x| < p, where p > 0 is the minimum of the radii of convergence of the power 
series for xp(x) and x 2 q(x). Let r, and r 2 be the roots of the indicial equation 


F(r) = r(r - 1) + p 0 r + q 0 = 0, 


with r\ > r 2 if r, and r 2 are real. Then in either the interval —p<x< 0 or the 
interval 0 < x < p, there exists a solution of the form 


y lW = \x\ n 


OO 

1 + '22a n (r,)x n , 

n = 1 


( 21 ) 


where the a n (r 2 ) are given by the recurrence relation (8) with no = 1 and r = r\. 

If r, — r 2 is not zero or a positive integer, then in either the interval — p < x < 0 
or the interval 0 < x < p, there exists a second solution of the form 

OO 

1 + ^2a n (r 2 )x n 

n=l 


yi(x) = |xp 


( 22 ) 
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The a„ (>' 2 ) are also determined by the recurrence relation (8) with ciq = 1 and r = r 2 . 
The power series in Eqs. (21) and (22) converge at least for |x| < p. 

If ri — r 2 , then the second solution is 

OO 

yi(x) = y\ (x) In |x| + \x\ n (23) 

n = 1 


If r\ — rj = N, a positive integer, then 


yi(x) = ay 1 (x) In \x\ + \x\ ri 


OO 

1 + ^c„(7- 2 )x" 
n = 1 


(24) 


The coefficients a n {r\), b n (r \ ), c n (r 2 ), and the constant a can be determined by 
substituting the form of the series solutions for y in Eq. (1). The constant a may 
turn out to be zero, in which case there is no logarithmic term in the solution (24). 
Each of the series in Eqs. (23) and (24) converges at least for |x| < p and defines a 
function that is analytic in some neighborhood of x = 0. 

In all three cases the two solutions yi(x) and y 2 (x) form a fundamental set of 
solutions of the given differential equation. 


PROBLEMS 


In each of Problems 1 through 12: 

(a) Find all the regular singular points of the given differential equation. 

(b) Determine the indicial equation and the exponents at the singularity for each regular 


singular point. 

1. xy" + 2 xy' + 6e*y = 0 
3. x(x - 1 )y" + 6 x 2 y' + 3y = 0 
5. x 2 y" + 3(sinx)/ — 2y = 0 
7. x 2 y" + I(x + sin.r)/ + y = 0 
9. x 2 (l - x)y" - (1 + x)y' + 2 xy = 0 
11. (4 - x 2 )y" + Ixy' + 3y = 0 


2. x 2 y" - x(2 + x)y' + (2 + x 2 )y = 0 
4. y" + 4 xy' + 6y = 0 
6. 2x(x + 2)y" + y' - xy = 0 
8. (x + l) 2 y" + 3(x 2 - 1 )/ + 3y = 0 
10. (x — 2) 2 (x + 2 )y" + 2 xy' + 3(x — 2 )y = 0 
12. x(x + 3) 2 y" - 2(x + 3)/ - xy = 0 


In each of Problems 13 through 17: 

(a) Show that x — 0 is a regular singular point of the given differential equation. 

(b) Find the exponents at the singular point x — 0. 

(c) Find the first three nonzero terms in each of two solutions (not multiples of each other) 
about x = 0. 


13. xy" + y' — y = 0 

14. xy" + 2 xy' + 6e x y = 0; see Problem 1 

15. x(x - 1 )y" + 6 x 2 y' + 3y = 0; see Problem 3 

16. xy" + y = 0 

17. x 2 y" + (sinx)y' — (cosx)y = 0 

18. (a) Show that 

(lnx)y" + l/ + y = 0 


has a regular singular point at x = 1. 

(b) Determine the roots of the indicial equation at x = 1. 
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oo 

(c) Determine the first three nonzero terms in the series ^ ci n (x — l) r+n corresponding 

n=0 


to the larger root. Take x — 1 > 0. 

(d) What would you expect the radius of convergence of the series to be? 


19. In several problems in mathematical physics it is necessary to study the differential equa- 
tion 

x(l - x)y" + [y - (1 + or + P)x\y' - afiy = 0, (i) 


where a, fi, and y are constants. This equation is known as the hypergeometric equation. 

(a) Show that x = 0 is a regular singular point and that the roots of the indicial equation 
are 0 and 1 — y. 

(b) Show that x = 1 is a regular singular point and that the roots of the indicial equation 
are 0 and y — a — fl. 

(c) Assuming that 1 — y is not a positive integer, show that, in the neighborhood of x = 0, 
one solution of Eq. (i) is 


yi(x) = 1 + 


af) 

T 7 !! 


x + 


a(a + 1)jS(/3 + 1) 2 , 

x + 

y(y + 1)2! 


What would you expect the radius of convergence of this series to be? 

(d) Assuming that 1 — y is not an integer or zero, show that a second solution for 0 < x < 1 
is 


y 2 (x) = x 1 ^ 


1 + 


(a-y + 1 -Kg - y + 1) 

(2 - y)l! 


(a - y + l)(o» - y + 2) (ft - y + 1)Q6 - y+ 2) 2 
(2 — y)(3 — y)2! 


(e) Show that the point at infinity is a regular singular point and that the roots of the 
indicial equation are a and fi. See Problem 43 of Section 5.4. 

20. Consider the differential equation 

x 3 y" + axy' + 0y = 0, 

where a and /) are real constants and a ^ 0. 

(a) Show that x = 0 is an irregular singular point. 

OO 

(b) By attempting to determine a solution of the form a nX r+n , show that the indicial 

n = 0 

equation for r is linear and that, consequently, there is only one formal solution of the 
assumed form. 

(c) Show that if p/a = —1,0, 1,2, ... .then the formal series solution terminates and there- 
fore is an actual solution. For other values of fl /a, show that the formal series solution has 
a zero radius of convergence and so does not represent an actual solution in any interval. 

21. Consider the differential equation 



where a^O and f} ^ 0 are real numbers, and s and t are positive integers that for the 
moment are arbitrary. 

(a) Show that if s > 1 or t > 2, then the point x = 0 is an irregular singular point. 
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(b) Try to find a solution of Eq. (i) of the form 

OO 

y = y ^ a n x r+n , x > 0. (ii) 

n=0 

Show that if s = 2 and t = 2, then there is only one possible value of r for which there is a 
formal solution of Eq. (i) of the form (ii). 

(c) Show that if s = 1 and t = 3, then there are no solutions of Eq. (i) of the form (ii). 

(d) Show that the maximum values of s and t for which the indicial equation is quadratic in 
r [and hence we can hope to find two solutions of the form (ii)] are s = 1 and t = 2. These 
are precisely the conditions that distinguish a “weak singularity,” or a regular singular 
point, from an irregular singular point, as we defined them in Section 5.4. 

As a note of caution, we point out that although it is sometimes possible to obtain a formal 
series solution of the form (ii) at an irregular singular point, the series may not have a 
positive radius of convergence. See Problem 20 for an example. 


5.7 Bessel’s Equation 


In this section we illustrate the discussion in Section 5.6 by considering three special 
cases of Bessel’s 14 equation, 

x 2 y" + xy’ + (x 2 - v 2 )y = 0, (1) 


where v is a constant. It is easy to show that x = 0 is a regular singular point of 
Eq. (1). We have 


r r 1 i 

p o = hmx— — = lirnx- = 1, 

P(x) x->-0 X 


.. .• 2 
<7o = limx — — = lim.r 
x^O P(x) x — ^0 


Thus the indicial equation is 

F(r ) = r(r - 1) + p 0 r + q 0 = r(r - 1) + r - v 2 = r 2 - v 1 — 0, 

with the roots r = ±v. We will consider the three cases v = 0, v — j, and v = 1 for 
the interval x > 0. 


Bessel Equation of Order Zero. In this case v = 0, so Eq. (1) reduces to 

L[y] = x 2 y" + xy' + x 2 y = 0, 


( 2 ) 


14 Friedrich Wilhelm Bessel (1784-1846) embarked on a career in business as a youth but soon became 
interested in astronomy and mathematics. He was appointed director of the observatory at Konigsberg in 
1810 and held this position until his death. His study of planetary perturbations led him in 1824 to make 
the first systematic analysis of the solutions, known as Bessel functions, of Eq. (1). He is also famous for 
making, in 1838, the first accurate determination of the distance from the earth to a star. 
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and the roots of the indicial equation are equal: r\ = t- 2 = 0. Substituting 

OO 

y = <j)(r,x) = a 0 x r + ^ a n x r+n , (3) 

n = 1 

in Eq. (2), we obtain 

OO OO 

L[cf>](r,x) — ^2 a nl(r + n){r + n - 1) + (r + n)]x r+n + ^ a n x r+ ' l+ 2 

n = 0 n = 0 


= a {) [r(r - 1) + r]x r + m[(r + l)r + (r + l)]x r+1 


OO 

+ ^2 { a n[( r + n)(r + n - 1) + (r + n)] + a„_ 2 } x r+n = 0. (4) 

n= 2 


As we have already noted, the roots of the indicial equation F(r) = r(r — 1) + r = 0 
are r\ — 0 and r 2 = 0. The recurrence relation is 

. , fin-2 W a n - 2 (r) 

a n (r ) = = r-, n > 2. (5) 

(r + ri)(r + n — 1) + (r + n) (r + n) 2 

To determine yi(x), we set r equal to 0. Then, from Eq. (4), it follows that for 
the coefficient of x r+l to be zero we must choose a 1 = 0. Hence, from Eq. (5), 
n 3 = as = a-] = • • ■ = 0. Further, 


fin(0) = -a„_ 2 (0)/7r 2 , 77 = 2,4,6,8,..., 

or, letting n = 2m, we obtain 

«2m(0) = -a 2m -2(0)/(2m) 2 , m = 1,2,3, .. . . 

Thus 


«2(0) = -g, «4(0) = ^, 


A6(0) = 


flo 


and, in general, 


Hence 


fi 2 m (0) — 


y tW = ao 


2 2m (m\) 2 ’ 


2 6 (3 ■ 2) 2 ’ 
771 = 1,2, 3, ... . 


00 / 1 \m v 2m 

1 + y ( ~ 1} x 

2 lm (ni\) 2 
m= 1 v y . 


x > 0. 


( 6 ) 

( 7 ) 


The function in brackets is known as the Bessel function of the first kind of order 
zero and is denoted by Jo(x). It follows from Theorem 5.6.1 that the series converges 
for all x and that Jq is analytic at x — 0. Some of the important properties of Jq are 
discussed in the problems. Figure 5.7.1 shows the graphs of y — Jq (x) and some of 
the partial sums of the series (7). 

To determine y 2 W, we will calculate «',(()). 15 First we note from the coeffi- 
cient of x r+l in Eq. (4) that (r + l) 2 ni(r) = 0. Thus a\{r) = 0 for all r near r — 0. 


^Problem 10 outlines an alternative procedure, in which we simply substitute the form (23) of Section 5.6 
in Eq. (2) and then determine the b n . 
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FIGURE 5.7.1 Polynomial approximations to J 0 (x). The value of n is the degree of the 
approximating polynomial. 

So not only does ni(0) = 0 but also a'-^O) = 0. From the recurrence relation (5) it 
follows that a' 3 ( 0) = «j(0) = • • • = a 2 /3+ i (0) = • • • = 0; hence we need only compute 
a' lm { 0), m = 1, 2, 3, From Eq. (5) we have 

« 2 mO) = -a 2m - 2 {r)/(r + 2m) 1 , m = 1, 2, 3 

By solving this recurrence relation, we obtain 


a 2 (r) = 


a 0 


a 4 (r) = 


a 0 


and, in general. 


a 2 m (r ) = 


(r + 2) 1 ’ 4W (r + 2) 2 (r + 4) 2 ’ 
(— l) m ao 


m > 3. 


( 8 ) 


(r + 2) 2 •••(/' + 2m) 2 ’ 

The computation of a' 2m (r) can be carried out most conveniently by noting that if 
f(x) = (x - a tf' (x - a^Hx - a 3 ) fe ■ ■ ■ (x - 
and if x is not equal to ai,a 2 , . . . ,a n , then 

f(x) Pi | h | ! Pn 

fix) x — a\ x — a 2 x — a n 

Applying this result to a 2m (r) from Eq. (8), we find that 


a’jjr) 


= -2 


1 


1 


1 


a 2m (r) V r + 2 r + 4 
and setting r equal to 0, we obtain 

a 2 »/ 0 ) = ~ 2 2 + 4 H 

Substituting for a 2m (0) from Eq. (6), and letting 

1 1 

= 1 + ^ ^ + ‘ 


1 

2m 


r + 2m 


^2 m(0)- 


1 

m 


( 9 ) 
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we obtain, finally, 


, (-l) m n 0 

a 2m (0) = -Hm 22m(ml)2 , m = 1,2,3, ... . 

The second solution of the Bessel equation of order zero is found by setting no = 1 
and substituting for y\(x) and a' 2m (0) = b 2m (0) in Eq. (23) of Section 5.6. We obtain 

°° / 1 \m+l TT 

y 2 (x) = hix) lnx + Y, X > °' (10) 

m = 1 V 

Instead of y 2 ,the second solution is usually taken to be a certain linear combination 
of Jo and y 2 . It is known as the Bessel function of the second kind of order zero 
and is denoted by Yq. Following Copson (Chapter 12), we define 16 

Y 0 (x) = ~[y 2 (x) + (y - ln2)/ 0 (x)]. (11) 

71 

Here y is a constant known as the Euler-Mascheroni 17 constant; it is defined by the 
equation 

y = lim (H n - Inn) = 0.5772. (12) 


Substituting for y 2 (x) in Eq. (11), we obtain 


Yq(x) = - 

TT 


oo 

(y + In Jo(x) + Y 


(-1 ) m+1 H, 


m= 1 


2 2m (mly 


m 2m 


X > 0 . 


(13) 


The general solution of the Bessel equation of order zero for x > 0 is 


y = ClJ 0 (x) + C 2 Y 0 (x). 

Note that Jo(x) 1 as .r 0 and that Y {] (x) has a logarithmic singularity at .r = 0; 
that is, YoCx) behaves as (2/jt) Inx when x —> 0 through positive values. Thus, if we 
are interested in solutions of Bessel’s equation of order zero that are finite at the 
origin, which is often the case, we must discard Yq. The graphs of the functions Jo 
and Yq are shown in Figure 5.7.2. 

It is interesting to note from Figure 5.7.2 that for x large, both Jq(x) and Y 0 (x) are 
oscillatory. Such a behavior might be anticipated from the original equation; indeed 
it is true for the solutions of the Bessel equation of order v. If we divide Eq. (1) by 
jr, we obtain 


16 Other authors use other definitions for Yo- The present choice for Yo is also known as the Weber 
function, after Heinrich Weber (1842-1913), who taught at several German universities. 

17 Lorenzo Mascheroni (1750-1800) was an Italian priest and professor at the University of Pavia. He 
correctly calculated the first 19 decimal places of y in 1790. 
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FIGURE 5.7.2 The Bessel functions J 0 and Yi,. 


For x very large it is reasonable to conjecture that the terms (1 /x)y' and (v 2 /x 2 )y are 
small and hence can be neglected. If this is true, then the Bessel equation of order v 
can be approximated by 


y" + y = 0. 


The solutions of this equation are sin x and cos x; thus we might anticipate that the 
solutions of Bessel’s equation for large x are similar to linear combinations of sin x 
and cos x. This is correct insofar as the Bessel functions are oscillatory; however, it is 
only partly correct. For x large the functions Jo and Yq also decay as x increases; thus 
the equation y" + y = 0 does not provide an adequate approximation to the Bessel 
equation for large x, and a more delicate analysis is required. In fact, it is possible to 
show that 


and that 



(14) 


(15) 


These asymptotic approximations, as x — oo, are actually very good. For example, 
Figure 5.7.3 shows that the asymptotic approximation (14) to Jq (x) is reasonably 
accurate for all x > 1. Thus to approximate /o(x) over the entire range from zero to 
infinity, you can use two or three terms of the series (7) for x < 1 and the asymptotic 
approximation (14) forx > 1. 


Bessel Equation of Order One-Half. This case illustrates the situation in which the roots 
of the indicial equation differ by a positive integer but there is no logarithmic term 
in the second solution. Setting v = | in Eq. (1) gives 

L[y] = x 2 y" + xy' + (x 2 -l)y = 0. 


(16) 
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FIGURE 5.7.3 Asymptotic approximation to Jo(x). 


When we substitute the series (3) for y — cp(r,x), we obtain 

OO OO 

L[4>](r,x) = \(r + ri)(r + n — 1) + (r + n) - i] a n x r+n + ^ a n x r+n+2 

72=0 72=0 

= (r 2 - i) a 0 x r + [(r + l) 2 - i] aix r+l 

OO 

+ H { [ (r + n ) 2 ~ ?] x " + " = °- (17) 


72=2 


The roots of the indicial equation are ri = r 2 = — hence the roots differ by an 
integer. The recurrence relation is 


[(r + n f - |] a„ = -a„- 2 , n > 2. 


(18) 


Corresponding to the larger root r\ = we find from the coefficient of x r+l inEq. (17) 
that a\ = 0. Hence, from Eq. (18), a 2 = as — ■ ■ ■ — a 2n+ \ = • • • = 0. Further, for 


2 ’ 


a n = 


or, letting n = 2m, we obtain 


A 2 m — 


a >i 2 

n(n + 1) ’ 


U 2 m—2 


n — 2, 4, 6 , 


2m(2m + 1) ’ 

By solving this recurrence relation, we find that 


m — 1, 2, 3, ... . 


a 0 a 0 

a 2 = , t/4 = — , . . . 


and, in general, 


®2 m — 


(~l) m flo 
(2m + 1)! ’ 


m = 1,2, 3, ... . 
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Hence, taking ao = 1, we obtain 


yi(x)=x 1/2 


i + E 

m= 1 


(_l)/n x 2m 

(2m + 1)! 


OO 

=* _1/2 E 

m = 0 


(~l) m x 2m+1 
(2m + 1)! 


x > 0. (19) 


The second power series in Eq. (19) is precisely the Taylor series for sin x; hence one 
solution of the Bessel equation of order one-half is x _1/2 sinx. The Bessel function 
of the first kind of order one-half, / 1 / 2 , is defined as (2/7r) 1/2 yi. Thus 

/ 2 \ 1/2 

J 1/2 (x) = I — J sinx, x > 0. (20) 


Corresponding to the root r 2 — — \, it is possible that we may have difficulty in 
computing ci\ since N = r± — r 2 = l. However, from Eq. (17) for r = — the coef- 
ficients of x r and x r+1 are both zero regardless of the choice of ao and a\. Hence ao 
and a\ can be chosen arbitrarily. From the recurrence relation (18) we obtain a set 
of even-numbered coefficients corresponding to ao and a set of odd-numbered coef- 
ficients corresponding to a\ . Thus no logarithmic term is needed to obtain a second 
solution in this case. It is left as an exercise to show that, for r = — i. 


Cl2n — 


(2n)\ ' 


fl2«+l = 


(2n + l)!’ 


n — 1,2,. 


Hence 


y 2 (x) = x 


- V-V 2 


«=0 


njln 


(-l)"x 

(2/7)! 


« i I] 


72=0 


(— 1)"x 2m+1 
(2/i + 1)! 


cosx 


smx 


a 0 - 


,1/2 


■ - 


' 1/2 ’ 


x > 0. 


( 21 ) 


The constant «| simply introduces a multiple of yi(x). The second solution of the 
Bessel equation of order one-half is usually taken to be the solution for which 
a 0 = ( 2/jt ) 1/2 and a\ = 0. It is denoted by J-y%. Then 


/ 2 \ 1/2 

/_i/ 2 (x) = I — j cosx, x > 0. (22) 


The general solution of Eq. (16) is y = c\Ji/ 2 (x ) + c 2 J-\/ 2 (x). 

By comparing Eqs. (20) and (22) with Eqs. (14) and (15), we see that, except for a 
phase shift of jt/ 4, the functions J-\/ 2 and J\/ 2 resemble Jq and To, respectively, for 
large x. The graphs of J\/ 2 and J-\/ 2 are shown in Figure 5.7.4. 

Bessel Equation of Order One. This case illustrates the situation in which the roots of 
the indicial equation differ by a positive integer and the second solution involves a 
logarithmic term. Setting v = 1 in Eq. (1) gives 


L[y] = x 2 y" + xy' + (x 2 - 1 )y = 0. 


(23) 
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FIGURE 5.7.4 The Bessel functions J 1/2 and J- 1 / 2 . 


If we substitute the series (3) for y = <p(r,x) and collect terms as in the preceding 
cases, we obtain 

L[<p](r,x) = a 0 (r 2 - l)x r + ai[(r + l) 2 - l]x r+1 

OO 

+ { [ (r + n) 2 -!]«» + a n- 2 } X r+n = 0. (24) 

n= 2 

The roots of the indicial equation are r\ — 1 and r 2 = —1. The recurrence relation is 
[(r + nf - 1 ]a n (r) = -a n - 2 (r), n > 2. (25) 


Corresponding to the larger root r = 1, the recurrence relation becomes 


(hi 2 

( n + 2 )n ’ 


n = 2,3,4, ... . 


We also find from the coefficient of x r+1 in Eq. (24) that ci\ = 0; hence from the 
recurrence relation, aj, = a 5 = • • • = 0. For even values of n, let n = 2 m\ then 


^2 m — 


Cl2m—2 


tt2m-2 


(2m + 2) (2 m) 2 2 (m + 1 )m ’ 

By solving this recurrence relation, we obtain 

(— l) m 0o 


m = 1,2,3, ... . 


@2m — 


2 2m (rn + l)!m! ’ 


m — 1,2,3, ... . 


(26) 


The Bessel function of the first kind of order one, denoted by / 1 , is obtained by 
choosing a 0 = 1/2. Hence 


mJ2m 


1 , , x (— l)"‘x 

AW = 


2 ' 2 2,n (m + 1 )\m\ 

m = 0 v ’ 


(27) 


The series converges absolutely for all x, so the function ,I\ is analytic everywhere. 

In determining a second solution of Bessel’s equation of order one, we illustrate 
the method of direct substitution. The calculation of the general term in Eq. (28) 
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below is rather complicated, but the first few coefficients can be found fairly easily. 
According to Theorem 5.6.1, we assume that 


y 2 (x) = a.l i (x) In x + x 


1 + Y C„x n 
n = 1 


x > 0. 


(28) 


Computing y' 2 (x),y , 2 (x), substituting in Eq. (23), and making use of the fact that 1\ is 
a solution of Eq. (23), we obtain 


OO OO 

2 axJ[(x) + Y, [( n ~ 1)(« — 2 )c„ + (n — l)c„ — c„] x" _1 + ^ c„x' I+1 = 0, (29) 

n= 0 n= 0 

where co = 1. Substituting for J\ (x) from Eq. (27), shifting the indices of summation 
in the two series, and carrying out several steps of algebra, we arrive at 


OO 

-Ci + [0 • C 2 + C 0 ]X + Y t(" 2 * ~ l)Cn+l + C n -l]X n 

n—2 


= —a 


E 

m = 1 


(— l) m (2m + l)x 2m+1 
2 lm (m + 1)! nz! 


■ (30) 


From Eq. (30) we observe first that ci = 0, and a = —cq — —1. Further, since there 
are only odd powers of x on the right, the coefficient of each even power of x on the 
left must be zero. Thus, since Ci = 0, we have C3 = C5 = • • • = 0. Corresponding to 
the odd powers of x, we obtain the recurrence relation [let n = 2m + 1 in the series 
on the left side of Eq. (30)] 

r/o ,^2 n , (—l) m (2m + 1) 

[(2m + 1) - l]c 2m+2 + C 2m - 22m{m + |)!/;; , , m = 1,2,3,.... (31) 

When we set m — 1 in Eq. (31), we obtain 

(3 2 — l)c 4 + c 2 — (—1)3/ (2 2 • 2!). 


Notice that c 2 can be selected arbitrarily , and then this equation determines c 4 . Also 
notice that in the equation for the coefficient of x , c 2 appeared multiplied by 0, and 
that equation was used to determine a. That c 2 is arbitrary is not surprising, since c 2 


is the coefficient of x in the expression x 1 


OO 


1 + E c nX H 

n = 1 


Consequently, c 2 simply 


generates a multiple of Ji, and y 2 is determined only up to an additive multiple of J\ . 
In accordance with the usual practice, we choose C2 = 1/2 2 . Then we obtain 


c 4 


-1 
2 4 ■ 2 



-1 

242 ! 



2 4 - 2 ! " 


It is possible to show that the solution of the recurrence relation (31) is 

(-l) m+ i (H m + H m _ 0 


^2 m — 


2 2m m\(m — 1)! 


m — 1 , 2 ,... 
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PROBLEMS 


with the understanding that Ho = 0. Thus 


yi(x) = -/i(*)ln* + 


1 

x 


i-E 

m = 1 


(— 1 )"\H m + H m _ i) ^ 2m 
2 2m m\(m — 1)! 


x > 0. (32) 


The calculation of y 2 (x) using the alternative procedure [see Eqs. (19) and (20) 
of Section 5.6] in which we determine the c n (r 2 ) is slightly easier. In particular, the 
latter procedure yields the general formula for c 2m without the necessity of solving a 
recurrence relation of the form (31) (see Problem 11). In this regard, you may also 
wish to compare the calculations of the second solution of Bessel’s equation of order 
zero in the text and in Problem 10. 

The second solution of Eq. (23), the Bessel function of the second kind of order 
one, Yj, is usually taken to be a certain linear combination of J\ and y 2 . Following 
Copson (Chapter 12), Y\ is defined as 

Y x (x) = 2 [~y 2 (x) + (y - ln2)/|(x)], (33) 

TV 

where y is defined in Eq. (12). The general solution of Eq. (23) for x > 0 is 


y = cH\{x) + c 2 Yi(x). 


Notice that although J\ is analytic at x = 0, the second solution Y\ becomes un- 
bounded in the same manner as 1/x as x — ► 0. The graphs of /j and Y\ are shown in 
Figure 5.7.5. 


y 

l 





0.5 

y = Jj(x) 

y = Y 1 (x) 

\ 1 / 

\ 

1 



2 4 , 6 8 

10\ 

12 

14 x 

-0.5 






FIGURE 5.7.5 The Bessel functions J\ and Y\. 


In each of Problems 1 through 4 show that the given differential equation has a regular singular 
point at x = 0, and determine two solutions for x > 0. 

1. x 2 y" + 2 xy' + xy = 0 2. x 2 y" + 3 xy' + (1 + x)y = 0 

3. x 2 y" + xy' + 2 xy = 0 4. x 2 y" + 4 xy' + (2 + x)y = 0 

5. Find two solutions (not multiples of each other) of the Bessel equation of order | 

x 2 y"+xy' + (x 2 -l)y = 0, 


x > 0. 
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6. Show that the Bessel equation of order one -half 

x 2 y" + xy' + (x 2 - i) y = 0, x > 0 
can be reduced to the equation 

v" + v = 0 

by the change of dependent variable y = x~ 1/2 v(x). From this conclude that 
yi(x ) = x _1/2 cosx and y 2 (x) = x _1/2 sinx are solutions of the Bessel equation of order 
one-half. 

7. Show directly that the series for /o(x), Eq. (7), converges absolutely for all x. 

8. Show directly that the series for J\ (x) , Eq. (27), converges absolutely for all x and that 
JoQc) = J i (x) . 

9. Consider the Bessel equation of order v 

x 2 y" + xy' + (x 2 — v 2 )y = 0, x > 0, 


where v is real and positive. 

(a) Show that x = 0 is a regular singular point and that the roots of the indicial equation 
are v and — v. 

(b) Corresponding to the larger root v, show that one solution is 


y\ (x) = x v 


1 /X\ 2 1 / X\ 4 

1!(1 + v) V2/ + 2!(1 + v)(2 + v) V2/ 


+ 


OO 


E 


(-i r 

m\(\ + v) ■■■(»! + v) 



(c) If 2v is not an integer, show that a second solution is 


y 2 (x) = x ” 


1! (1 - v) 



1 

2!(1 - v)(2 - v) 



+ 


OO 


E 


m\(l — v) ■ ■ ■ (m — v) 



Note that yi(x) — >- 0 as x — >■ 0, and that y 2 (x) is unbounded as x — >• 0. 

(d) Verify by direct methods that the power series in the expressions for yi(x) and y 2 (x) 
converge absolutely for all x. Also verify that y 2 is a solution provided only that v is not 
an integer. 

10. In this section we showed that one solution of Bessel’s equation of order zero 


L[y\ = x 2 y" + xy' + x 2 y = 0 


is J 0 , where Jo(x) is given by Eq. (7) with a 0 = 1. According to Theorem 5.6.1, a second 
solution has the form (x > 0) 


(a) Show that 


OO 

y 2 (x) = J 0 (x) In* + ^ b n x n . 

n = 1 


OO OO OO 

L[y 2 \{x) = y, n(n — Db n x n + y nb„x n + y b„x n+ 2 + 2 xj' 0 (x). (i) 

n = 2 n = 1 n=l 
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(b) Substituting the series representation for/oM in Eq. (i), show that 



(ii) 


(c) Note that only even powers of x appear on the right side of Eq. (ii). Show that 
bi = b 2 = bs = • • • = 0, £>2 = 1/2 2 (1!) 2 , and that 


(2 n) 2 b 2 „ + b 2 „- 2 = — 2(— 1)" (2»)/2 2 "(n!) 2 , n = 2,3,4 


Deduce that 




and 


The general solution of the recurrence relation is b 2n — (— l) n+1 H n /2 2n (n\) 2 . Substituting 
for b n in the expression for y 2 (x), we obtain the solution given in Eq. ( 10). 

11. Find a second solution of Bessel’s equation of order one by computing the c n (r 2 ) and a 
of Eq. (24) of Section 5.6 according to the formulas (19) and (20) of that section. Some 
guidelines along the way of this calculation are the following. First, use Eq. (24) of this 
section to show that «i(— 1) and flj(— 1) are 0. Then show that Ci ( — 1) = 0 and, from the 
recurrence relation, that c„(— 1) = 0 for n = 3,5, Finally, use Eq. (25) to show that 



a 4 (r) = 


(r+l)(r + 3)(r + 3)(r + 5)’ 


and that 


m > 3. 


a 2 m(r) = 


(r + 1) ■■■(/• + 2m — 1 )(r + 3) ■ ■ • (r + 2m + 1) ’ 


Then show that 

cim(-l) = (-1 ) m+ \H m + H m -\) /2 lm m\(m - 1)!, m > 1. 

12. By a suitable change of variables it is sometimes possible to transform another differential 
equation into a Bessel equation. For example, show that a solution of 


x 2 y" + (a 2 p 2 x 2 P + l - v 2 p 2 ) y = 0, x > 0 


is given by y = x 1/2 f(ax^), where /(£) is a solution of the Bessel equation of order v. 

13. Using the result of Problem 12, show that the general solution of the Airy equation 


y" — xy = 0, x > 0 


is y = x 1/2 [ci/i(|« 3/2 ) + C 2 / 2 (|ix 3/2 )], where /i(§) and f 2 (^) are a fundamental set of solu- 
tions of the Bessel equation of order one-third. 

14. It can be shown that J 0 has infinitely many zeros for x > 0. In particular, the first three 
zeros are approximately 2.405, 5.520, and 8.653 (see Figure 5.7.1). Let Xj,j = 1, 2, 3, ... , 
denote the zeros of / 0 ; it follows that 



Verify that y = Jo(X/x) satisfies the differential equation 



x > 0. 
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CHAPTER 


The Laplace 
Transform 


6 


Many practical engineering problems involve mechanical or electrical systems acted 
on by discontinuous or impulsive forcing terms. For such problems the methods 
described in Chapter 3 are often rather awkward to use. Another method that is es- 
pecially well suited to these problems, although useful much more generally, is based 
on the Laplace transform. In this chapter we describe how this important method 
works, emphasizing problems typical of those that arise in engineering applications. 


6.1 Definition of the Laplace Transform 


Improper Integrals. Since the Laplace transform involves an integral from zero to in- 
finity, a knowledge of improper integrals of this type is necessary to appreciate the 
subsequent development of the properties of the transform. We provide a brief 
review of such improper integrals here. If you are already familiar with improper 
integrals, you may wish to skip over this review. On the other hand, if improper 
integrals are new to you, then you should probably consult a calculus book, where 
many more details and examples will be found. 

An improper integral over an unbounded interval is defined as a limit of integrals 
over finite intervals; thus 



( 1 ) 


where A is a positive real number. If the integral from a to A exists for each A > a. 
and if the limit as A — »■ oo exists, then the improper integral is said to converge to 
that limiting value. Otherwise the integral is said to diverge, or to fail to exist. The 
following examples illustrate both possibilities. 
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EXAMPLE 

1 


EXAMPLE 

2 


EXAMPLE 

3 


Let /(f) = e ct , t > 0, where c is a real nonzero constant. Then 


/•OO p/ 

/ e ct dt — lim 

Jo A ^°° Jo 


e' 1 dt = lim / e c ‘ dt — lim — 

A — »-oo /n A —> oo C 


= lim -(e M -l). 

.A — >oc C 


It follows that the improper integral converges to the value — 1/cif c < 0 and diverges if c > 0. 
If c = 0, the integrand/(t) is the constant function with value 1, and the integral again diverges. 


Let /(f) = 1 /t, t > 1. Then 



lim f 

A->ooJ i 


dt 

t 


lim In A. 

A — ►oo 


Since lim In A — oo, the improper integral diverges. 

A-»- oo 


Let fit) = t p , t > 1, where p is a real constant and p ^ 1; the case p = 1 was considered in 
Example 2. Then 



t~ p dt 


lim f 

A^ooJ l 


t p dt 


lim 

^4->oo 1 — p 


(A l ~ p 


- 1 ). 


/ OO 

dt converges to the 

value 1 /{p — 1) for p > 1 but (incorporating the result of Example 2) diverges for p < 1. 

OO 

These results are analogous to those for the infinite series n ~ p ■ 

n = 1 


Before discussing the possible existence of / /(f) dt, it is helpful to define certain 

J a 

terms. A function / is said to be piecewise continuous on an interval a < t < jJ if the 
interval can be partitioned by a finite number of points a = to < h < • ■ ■ < t n = fi so 
that 

1. / is continuous on each open subinterval f ,-_ \ < t < t t . 

2. f approaches a finite limit as the endpoints of each subinterval are approached from within 
the subinterval. 

In other words,/ is piecewise continuous on a < t < / if it is continuous there except 
for a finite number of jump discontinuities. If / is piecewise continuous on a < t < ft 
for every / > a, then / is said to be piecewise continuous on t > a. An example of a 
piecewise continuous function is shown in Figure 6.1.1. 

The integral of a piecewise continuous function on a finite interval is just the sum 
of the integrals on the subintervals created by the partition points. For instance, for 
the function /(f) shown in Figure 6.1.1, we have 

f f(t) dt = f fit) dt + f f(t)dt+ f f(t) dt. 

J a J a J t\ J *2 


( 2 ) 
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FIGURE 6.1.1 A piecewise continuous function y — fit). 


For the function shown in Figure 6.1.1 we have assigned values to the function at 
the endpoints a and ft and at the partition points t\ and t 2 - However, as far as the 
integrals in Eq. (2) are concerned, it does not matter whether /(f) is defined at these 
points, or what values may be assigned to /(f) at them. The values of the integrals in 
Eq. (2) remain the same regardless. 

r A 

Thus, if / is piecewise continuous on the interval a < t < A, then / /(f) dt exists. 

J a 

r A 

Hence, if / is piecewise continuous for f > a , then / /(f) dt exists for each A > a. 

J a 

However, piecewise continuity is not enough to ensure convergence of the improper 

POO 

integral / /(f) dt, as the preceding examples show. 

J a 

If / cannot be integrated easily in terms of elementary functions, the definition of 

POO 

convergence of / /(f) dt may be difficult to apply. Frequently, the most convenient 

J a 

way to test the convergence or divergence of an improper integral is by the following 
comparison theorem, which is analogous to a similar theorem for infinite series. 


If / is piecewise continuous for f > a, if |/(f)| < g(t) when t > M for some positive 

POO POO 

constant M, and if / g(t) dt converges, then / f(t) dt also converges. On the 
JM Ja 

POO POO 

other hand, if /(f) > g(t) > 0 for f > M, and if / g(t) dt diverges, then / /(f) 

JM Ja 


dt 


also diverges. 


The proof of this result from calculus will not be given here. It is made plausible, 

POO POO 

however, by comparing the areas represented by / g(t) dt and / |/(r)| dt. The 

Jm Jm 

functions most useful for comparison purposes are e ct and t~ p , which we considered 
in Examples 1, 2, and 3. 


The Laplace Transform. Among the tools that are very useful for solving linear differ- 
ential equations are integral transforms. An integral transform is a relation of the 
form 


F(s) = 



K(s,t)f(t) dt. 


( 3 ) 
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where K(s,t ) is a given function, called the kernel of the transformation, and the 
limits of integration a and / are also given. It is possible that a = — oo or (3 = oo, 
or both. The relation (3) transforms the function / into another function F, which is 
called the transform of /. 

There are several integral transforms that are useful in applied mathematics, but 
in this chapter we consider only the Laplace 1 2 transform. This transform is defined 
in the following way. Let /(f) be given for t > 0, and suppose that / satisfies certain 
conditions to be stated a little later. Then the Laplace transform off, which we will 
denote by £{/(/)} or by F(s), is defined by the equation 

POO 

C{f(t)} = F(s)= e~ st f (t)dt, (4) 

Jo 

whenever this improper integral converges. The Laplace transform makes use of the 
kernel K(s, t) — e~ st . Since the solutions of linear differential equations with constant 
coefficients are based on the exponential function, the Laplace transform is partic- 
ularly useful for such equations. The general idea in using the Laplace transform to 
solve a differential equation is as follows: 

1. Use the relation (4) to transform an initial value problem for an unknown function / in 
the f-domain into a simpler problem (indeed, an algebraic problem) for F in the j-domain. 

2. Solve this algebraic problem to find F. 

3. Recover the desired function / from its transform F. This last step is known as “inverting 
the transform.” 

In general, the parameter s may be complex, and the full power of the Laplace 
transform becomes available only when we regard F(s) as a function of a complex 
variable. However, for the problems discussed here, it is sufficient to consider only 
real values of s. The Laplace transform F of a function / exists if / satisfies certain 
conditions, such as those stated in the following theorem. 


Theorem 6.1.2 Suppose that 

1. / is piecewise continuous on the interval 0 < t < A for any positive A. 

2. |/(t)| < Ke at when t > M. In this inequality, K, a , and M are real constants, K and M 
necessarily positive. 

Then the Laplace transform C{f(t)} = F(s), defined by Eq. (4), exists for s > a. 


To establish this theorem, we must show that the integral in Eq. (4) converges for 
s > a. Splitting the improper integral into two parts, we have 


poo pM 

/ e~ st f (t) dt = / , 

Jo Jo 


<l f (0 dt - 


f 


n f(t) dt. 


( 5 ) 


lr The Laplace transform is named for the eminent French mathematician P. S. Laplace, who studied the 
relation (3) in 1782. However, the techniques described in this chapter were not developed until a century 
or more later. We owe them mainly to Oliver Heaviside (1850-1925), an innovative but unconventional 
English electrical engineer, who made significant contributions to the development and application of 
electromagnetic theory. 
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The first integral on the right side of Eq. (5) exists by hypothesis (1) of the theorem; 
hence the existence of F(s ) depends on the convergence of the second integral. By 
hypothesis (2) we have, for t > M, 

\e~ s ‘f(t)\ < Ke~ st e at = Ke (a ~ s)t , 

POO 

and thus, by Theorem 6.1.1, F(s) exists provided that / e (a ~ s)t dt converges. Refer- 

Jm 

ring to Example 1 with c replaced by a — s, we see that this latter integral converges 
when a — s < 0, which establishes Theorem 6.1.2. 

In this chapter (except in Section 6.5) we deal almost exclusively with functions that 
satisfy the conditions of Theorem 6.1.2. Such functions are described as piecewise 
continuous and of exponential order as t — > oo. Note that there are functions that 
are not of exponential order as t — »■ oo. One such function is /(f) = e‘ . As t — > oo, 
this function increases faster than Ke al regardless of how large the constants K and 
a may be. 

The Laplace transforms of some important elementary functions are given in the 
following examples. 


1 


Let /(f) = 1, f > 0. Then, as in Example 1, 


EXAMPLE 

4 


£{!) = 




e dt = — lim — 

A— >oo S 


s > 0. 


EXAMPLE 

5 


Let /(f) = e at , f > 0. Then, again referring to Example 1, 

poo poo 

£{e at }= / e~ st e at dt= / e~ {s ~ a)t dt 
Jo Jo 


1 

, s > a. 

s — a 


EXAMPLE 

6 


Let 


m = 


i, 

k, 

0, 


0 < f < 1, 
t = 1, 
f > 1, 


where A: is a constant. In engineering contexts /(f) often represents a unit pulse, perhaps of 
force or voltage. 

Note that / is a piecewise continuous function. Then 


poo pi 

C{f{t)}= / e~ s ‘f (t) dt = / « 

Jo Jo 


( dt — — - 


1 — e 


> 0. 


Observe that £{/(f)} does not depend on k , the function value at the point of discontinuity. 
Even if /(f) is not defined at this point, the Laplace transform of / remains the same. Thus 
there are many functions, differing only in their value at a single point, that have the same 
Laplace transform. 
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EXAMPLE 

7 


EXAMPLE 

8 


Let /(f) = sin at, t > 0. Then 

/>oo 

£{sina?| = F(s) = / e~ st sin at dt, s > 0. 
Jo 


Since 


F(s ) = lim / e st sin at dt , 
A ^°°Jo 


upon integrating by parts, we obtain 


F(s) = lim 

A— >oo 


e " cos at 

to 

1 

— 

1 e cosat dt 

a 

a Jo 

L 

0 


_ 1 s 
a a Jo 


e st cos at dt. 


A second integration by parts then yields 


1 s 2 

F(s) — -I e~ sl sin at dt 


r 


1 s 2 

= 2 F ^- 

n n- 


Hence, solving for F(s), we have 


F(s) = 


s 2 + a 2 


s > 0. 


Now let us suppose that/i and /2 are two functions whose Laplace transforms exist 
for s > a 1 and s > a 2 , respectively. Then, for 5 greater than the maximum of a\ and a 2 , 

POO 

£{cifi(t) + £2/2(0) = / e _i '[ci/i (0 + £2/2(0] dt 
Jo 

POO POO 

= ci e~ st fi(t)dt + c 2 I e~ sl f 2 (t)dt; 

Jo Jo 

hence 

+ c 2 f 2 (t)} = c\C{f\ (0 ) + c 2 £{/ 2 (0}- (6) 


Equation (6) states that the Laplace transform is a linear operator, and we make 
frequent use of this property later. The sum in Eq. (6) can be readily extended to an 
arbitrary number of terms. 


Find the Laplace transform of/(t) = 5e 2 ‘ — 3 sin4f, t > 0. 
Using Eq. (6), we write 

£{/( t)} = 5 C{e~ 2 '} — 3£{sin4f}. 


Then, from Examples 5 and 6, we obtain 


£{/«} = 


12 


j + 2 i 2 + 16 ’ 


s > 0. 
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PROBLEMS In each of Problems 1 through 4 sketch the graph of the given function. In each case determine 

whether / is continuous, piecewise continuous, or neither on the interval 0 < f < 3. 



\t\ 

0 < r < 1 

\ 1 2 > 

0 < f < 1 

1. 

fit) = \ 2 + t, 

1 < t < 2 

2. fit) = { (f - 1)- 

- 1 , 1 < f < 2 


[ 6 — f, 

2 < t < 3 

u 

2 < f < 3 


\t\ 

0 < t < 1 

\t. 

0 < f < 1 

3. 

fit ) = 1, 

1 < t < 2 

4. fit) = \ 3 - f, 

1 < f <2 


1.3 — f’ 

2 < t < 3 

u 

2 < f < 3 


5. Find the Laplace transform of each of the following functions: 

(a) fit ) = t 

(b) f(t ) = t 2 

(c) f{t) = t n , where n is a positive integer 

6. Find the Laplace transform of /(f) = cos at, where a is a real constant. 

Recall that cosh bt = (e bt + e~ bl )/2 and sinh bt = (e bt — e~ b ‘)/2. In each of Problems 7 through 
10 find the Laplace transform of the given function; a and b are real constants. 

7. fit ) = cosh bt 8. /(f) = sinh bt 

9. fit ) = e a ' cosh bt 10. /(f) = e al sinh bt 

In each of Problems 11 through 14 recall that cos bt = (e ,bI + e~ lb ‘)/2 and 

sin bt = ( e lbt — e~ lbt ) /2 i. Assuming that the necessary elementary integration formulas extend 

to this case, find the Laplace transform of the given function; a and b are real constants. 


11. fit) — sin hf 

12. /(f) = cos hf 

13. fit) = e" sinhf 

14. fit ) = e af cos 7>f 

In each of Problems 15 through 20 use integration by parts to find the Laplace transform of 
the given function; n is a positive integer and a is a real constant. 

15. fit) = te a ‘ 

16. fit ) = f sin of 

17. fit) = f coshaf 

18. fit) = f"e" 

19. fit) = f 2 sinuf 

20. fit) = f 2 sinh of 

In each of Problems 21 through 24 determine whether the given integral converges or diverges. 

POO 

21. / (f 2 + l)" 1 df 

Jo 

CXJ 

22. / df 

Jo 

POO 

23. J t~ 2 e‘ dt 

POO 

24. / cos t dt 

Jo 


25. Suppose that / and /' are continuous for f > 0 and of exponential order as f — >■ oo. Use 
integration by parts to show that if F{s) = £(/(f)},then lim F(s) = 0. The result is actually 

s-> OO 

true under less restrictive conditions, such as those of Theorem 6.1.2. 

26. The Gamma Function. The gamma function is denoted by T ip) and is defined by the 
integral 

POO 

r (p + 1) = / e~ x x p dx. (i) 

Jo 
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The integral converges as x — > oo for all p. For p < 0 it is also improper because the 
integrand becomes unbounded as x — > 0. However, the integral can be shown to converge 
atx = 0 for p > —1. 

(a) Show that, for p > 0, 

T(p + 1) = P r(p). 

(b) Show that T(l) = 1. 

(c) If p is a positive integer n, show that 

r(n+ 1 ) = n\. 


Since r (p) is also defined when p is not an integer, this function provides an extension of 
the factorial function to nonintegral values of the independent variable. Note that it is 
also consistent to define 0! = 1. 

(d) Show that, for p > 0, 

P(P + 1 )ip + 2) • • ■ (p + n - 1) = T(p + n)/r(p). 


Thus F {p) can be determined for all positive values of p if T (p) is known in a single interval 
of unit length, say, 0 < p < 1. It is possible to show that T (I) = ypn. Find F (|)andr (y). 

27. Consider the Laplace transform of t p , where p > — 1. 

(a) Referring to Problem 26, show that 

poo 1 poo 

C{t p ] = / e~ s, t p dt = / e~ x x p dx 

Jo s p+1 Jo 

= T(p + V)/s p+1 , s > 0. 


(b) Let p be a positive integer n in (a); show that 

C{f) = n\/s' l+l , s > 0. 

(c) Show that 

nt~V 2 \ — 

It is possible to show that 


9 r°° 

dr 1 ' 2 } — — / e ~ x 2 dx, s > 0. 

nA? Jo 


f 


-X 2 J 

e dx = ; 

2 


hence 

(d) Show that 


C{t 1/2 } = y/n/s, s > 0. 
T{fV2j = ^/(2s 3/2 ), s> 0. 


6.2 Solution of Initial Value Problems 


In this section we show how the Laplace transform can be used to solve initial value 
problems for linear differential equations with constant coefficients. The useful- 
ness of the Laplace transform in this connection rests primarily on the fact that the 
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transform of /' is related in a simple way to the transform of /. The relationship is 
expressed in the following theorem. 


Suppose that / is continuous and f is piecewise continuous on any interval 
0< t < A. Suppose further that there exist constants K, a, and M such that 
1/(01 < Ke at for t > M. Then C{f'(t)} exists for s > a, and moreover 

£{f\t)}=s£{f(.t)}-f(.0). (1) 


To prove this theorem, we consider the integral 


f 


e st f(t)dt. 


If /' has points of discontinuity in the interval 0 < t < A, let them be denoted by 
t\ , f 2 , . . . , f«. Then we can write this integral as 

[ e~ st f(t)dt= f e~ st f(t)dt+ ( e~ st f (t) dt H + f e~ st f(t)dt. 

Jo Jo J t\ J t n 

Integrating each term on the right by parts yields 

f A e~ sl f(t ) dt = e~ st f (0 ?1 + e~ st f (t) ' 2 + • • • + e~ st f(t) 

Jo 0 h 

V r‘i 


+ 5 f e st f(t ) dt + f e s, f(t ) dt H + f e st f(t) dt 

_ Jo J 1 1 J tn 


Since / is continuous, the contributions of the integrated terms at ti, t 2 , . . . , t„ cancel. 
Combining the integrals gives 

f e~ st f (0 dt = e~ sA f (A) - /( 0 ) + s f e~ st f{t) dt. 

Jo Jo 

For A > M, we have \f(A)\ < Ke“ A ; consequently, \e~ sA f(A)\ < Ke~^ s ~ a)A . Hence 
e~ sA f(A) — 0 as A -> oo whenever s > a. Therefore, for s > a, 

£{f'(t)} = s£{f(t)} — / (0) , 


which establishes the theorem. 

If/' and/" satisfy the same conditions that are imposed on / and/', respectively, 
in Theorem 6.2.1, then it follows that the Laplace transform of/" also exists for s > a 
and is given by 

£{/"«} = s 2 C{f(t)} - sf (0) f' (O'). (2) 


Indeed, provided the function / and its derivatives satisfy suitable conditions, an 
expression for the transform of the nth derivative f (n> can be derived by successive 
applications of this theorem. The result is given in the following corollary. 
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Corollary 6.2.2 Suppose that the functions/,/', . . . ,/(" -1 ) are continuous and that/ (,!) is piecewise 
continuous on any interval 0 < t < A. Suppose further that there exist constants 
K,a , and M such that |/(t)| < Ke at , |/'(f)l < Ke at , . . . , |/ ( ” -1) (0l < Ke“ l for t > M. 
Then £{/ (n) (t)} exists for s > a and is given by 

C{f (n \t)} = s n C{f{t ) } - s n] f ( 0 ) sf n ~ 2 \ 0 ) ( 3 ) 


We now show how the Laplace transform can be used to solve initial value prob- 
lems. It is most useful for problems involving nonhomogeneous differential equa- 
tions, as we will demonstrate in later sections of this chapter. However, we begin by 
looking at some homogeneous equations, which are a bit simpler. 


EXAMPLE 

1 


Consider the differential equation 

y" — / — 2y = 0 (4) 

and the initial conditions 

y(0) = l, /([)) = o. (5) 

This problem is easily solved by the methods of Section 3.1. The characteristic equation is 

r 2 - r-2 = (r- 2)(r + 1) = 0, (6) 

and consequently the general solution of Eq. (4) is 

y = cie~‘ + c 2 e 2 '. (7) 

To satisfy the initial conditions (5), we must have c\ + c 2 = 1 and — C\ + 2c 2 = 0; hence c\ = | 
and c 2 = I, so that the solution of the initial value problem (4) and (5) is 

y = 4>(t) = \e~‘ + \e 2 ‘ . (8) 

Now let us solve the same problem by using the Laplace transform. To do this, we must 
assume that the problem has a solution y = <p (f), which with its first two derivatives satisfies the 
conditions of Corollary 6.2.2. Then, taking the Laplace transform of the differential equation 
(4), we obtain 

C{y"} - C{y'} - 2 C{y} = 0, (9) 

where we have used the linearity of the transform to write the transform of a sum as the sum 
of the separate transforms. Upon using the corollary to express £{/'} and £.{y'} in terms of 
£{y), we find that Eq. (9) becomes 

s 2 C{y) - yy(0) - y'(0) - L£{y) - y(0)] - 2 C{y} = 0, 


or 

(s 2 -s- 2)Y(s) + (1 - s)y(0) - y'(0) = 0, (10) 


where Y (s) = £{yj. Substituting for y(0) and y'(0) in Eq. (10) from the initial conditions (5), 
and then solving for Y (s), we obtain 


Y(s) = 


s — 1 
s 2 — s — 2 


s — 1 

(s-2)(s + l)‘ 


( 11 ) 
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We have thus obtained an expression for the Laplace transform Y (s) of the solution y = <f>(t) 
of the given initial value problem. To determine the function 0, we must find the function 
whose Laplace transform is Y (s ), as given by Eq. (11). 

This can be done most easily by expanding the right side of Eq. (11) in partial fractions. 
Thus we write 


Y(s) = 


s — 1 

(s-2)(s + l) 


a b 
s — 2 s + 1 


a(s + 1) + b(s — 2) 
is - 2 )(s + 1) 


( 12 ) 


where the coefficients a and b are to be determined. By equating numerators of the second 
and fourth members of Eq. (12), we obtain 


s — 1 = a(s + 1) + b(s — 2), 


an equation that must hold for all s. In particular, if we set s = 2, then it follows that a = 
Similarly, if we set s = —1, then we find that b = By substituting these values for a and b, 
respectively, we have 


Y(s) = 


1/3 2/3 

s-2 s + 1' 


(13) 


Finally, if we use the result of Example 5 of Section 6.1, it follows that le 2f has the transform 
1 (s — 2) _1 ; similarly, =e - ' has the transform |(y + l) -1 . Hence, by the linearity of the Laplace 
transform. 


y = <P(t) = \e 2t + \e 


-t 


has the transform (13) and is therefore the solution of the initial value problem (4), (5). 
Observe that it does satisfy the conditions of Corollary 6.2.2, as we assumed initially. Of 
course, this is the same solution that we obtained earlier. 


The same procedure can be applied to the general second order linear equation 
with constant coefficients 

ay" + by' + cy = f(t). (14) 

Assuming that the solution y = (p(t) satisfies the conditions of Corollary 6.2.2 for 
n — 2, we can take the transform of Eq. (14) and thereby obtain 

a[x 2 Y(s) - sy(0) - /(O)] + b[sY (s) - y(0)] + cY (s) = F(s), 


where F(s ) is the transform of /(f). By solving Eq. (15) for Y (s ) , we find that 


YQs) = 


(as + b)y( 0) + ay'( 0) 
as 2 + bs + c 


F(s) 


s 2 + bs + c 


(15) 


(16) 


The problem is then solved, provided that we can find the function y = <p(t) whose 
transform is Y (s). 

Even at this early stage of our discussion we can point out some of the essential fea- 
tures of the transform method. In the first place, the transform Y (s) of the unknown 
function y — <p(t) is found by solving an algebraic equation rather than a differential 
equation, Eq. (10) rather than Eq. (4) in Example 1, or in general Eq. (15) rather 
than Eq. (14). This is the key to the usefulness of Laplace transforms for solving 
linear, constant coefficient, ordinary differential equations — the problem is reduced 
from a differential equation to an algebraic one. Next, the solution satisfying given 
initial conditions is automatically found, so that the task of determining appropriate 
values for the arbitrary constants in the general solution does not arise. Further, as 
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indicated in Eq. (15), nonhomogeneous equations are handled in exactly the same 
way as homogeneous ones; it is not necessary to solve the corresponding homoge- 
neous equation first. Finally, the method can be applied in the same way to higher 
order equations, as long as we assume that the solution satisfies the conditions of the 
corollary for the appropriate value of n. 

Observe that the polynomial as 2 + bs + c in the denominator on the right side of 
Eq. (16) is precisely the characteristic polynomial associated with Eq. (14). Since 
the use of a partial fraction expansion of Y (?) to determine </> (t) requires us to factor 
this polynomial, the use of Laplace transforms does not avoid the necessity of finding 
roots of the characteristic equation. For equations of higher than second order this 
may require a numerical approximation, particularly if the roots are irrational or 
complex. 

The main difficulty that occurs in solving initial value problems by the transform 
method lies in the problem of determining the function y — 0 (?) corresponding to 
the transform Y (?) . This problem is known as the inversion problem for the Laplace 
transform; / (f) is called the inverse transform corresponding to Y (?) , and the process 
of finding <p{t) from Y (?) is known as inverting the transform. We also use the notation 
£ _1 {^(s)} t° denote the inverse transform of Y (?). There is a general formula for 
the inverse Laplace transform, but its use requires a familiarity with functions of a 
complex variable, and we do not consider it in this book. However, it is still possible 
to develop many important properties of the Laplace transform, and to solve many 
interesting problems, without the use of complex variables. 

In solving the initial value problem (4), (5), we did not consider the question of 
whether there may be functions other than the one given by Eq. (8) that also have 
the transform (13). We know, by Theorem 3.2.1, that the initial value problem has no 
other solutions. Consistent with this fact, it can be shown that if / and g are continuous 
functions with the same Laplace transform, then / and g must be identical. On the 
other hand, if / and g are only piecewise continuous, then they may differ at one or 
more points of discontinuity and yet have the same Laplace transform; see Example 6 
in Section 6.1. This lack of uniqueness of the inverse Laplace transform for piecewise 
continuous functions is of no practical significance in applications. 

Thus there is essentially a one-to-one correspondence between functions and their 
Laplace transforms. This fact suggests the compilation of a table, such as Table 6.2.1, 
giving the transforms of functions frequently encountered, and vice versa. The entries 
in the second column of Table 6.2.1 are the transforms of those in the first column. 
Perhaps more important, the functions in the first column are the inverse transforms 
of those in the second column. Thus, for example, if the transform of the solution 
of a differential equation is known, the solution itself can often be found merely 
by looking it up in the table. Some of the entries in Table 6.2.1 have been used as 
examples, or appear as problems in Section 6.1, while others will be developed later in 
the chapter. The third column of the table indicates where the derivation of the given 
transforms may be found. Although Table 6.2.1 is sufficient for the examples and 
problems in this book, much larger tables are also available (see the list of references 
at the end of the chapter). Transforms and inverse transforms can also be readily 
obtained electronically by using a computer algebra system. 

Frequently, a Laplace transform F(s) is expressible as a sum of several terms 


F(s) = F\ (?) + F 2 (s) H b F„(s). 


(17) 
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TABLE 6.2.1 Elementary Laplace Transforms 



^7 

£ 

7 

^4 

II 

F(s ) = £{/(t)} 

Notes 

1. 

1 

5 > 0 

Sec. 6.1; Ex. 4 

2. 

e at 

l 

, 5 > a 

s — a 

Sec. 6.1; Ex. 5 

3. 

t n , n = positive integer 

n\ 

r, S > 0 

5"+!’ 

Sec. 6.1; Prob. 27 

4. 

t p , p>- 1 

r(p + i) 

m » 5 > 0 

Sec. 6.1; Prob. 27 

5. 

sin at 

— s > 0 

s 2 + a 2 

Sec. 6.1; Ex. 7 

6. 

cos at 

s 

— s > 0 

s 2 + a 2 

Sec. 6.1; Prob. 6 

7. 

sinh at 

a 

, s > |a| 

s z — a 1 

Sec. 6.1; Prob. 8 

8. 

cosh at 

s 

, i > |a| 

s z — 

Sec. 6.1; Prob. 7 

9. 

e a ' sin bt 

b 

~ r, s > a 

(s — a) 2 + i> 2 

Sec. 6.1; Prob. 13 

10. 

e a ‘ cos bt 

s — a 

(s - a) 2 + b 2 ’ S>a 

Sec. 6.1; Prob. 14 

11. 

t n e a ‘, n = positive integer 

n\ 

r, s > a 

is - a) n+1 

Sec. 6.1; Prob. 18 

12. 

ll c (t) 

p~CS 

, s > 0 

5 

Sec. 6.3 

13. 

u c (t)f(t - c) 

e~ cs Fis) 

Sec. 6.3 

14. 

e c 'f{t) 

Fis — c) 

Sec. 6.3 

15. 

f(ct) 

l F 0- c>0 

Sec. 6.3; Prob. 25 

16. 

[ fit ~ f)g(r) dr 
./0 

Fis)Gis) 

Sec. 6.6 

17. 

S(f - c) 

e~ cs 

Sec. 6.5 

18. 

f n \t ) 

s n Fis ) - s n ~ 1 f( 0) f (n ~ l) i0) 

Sec. 6.2 

19. 

(-07(0 

F M is) 

Sec. 6.2; Prob. 28 


318 


Chapter 6. The Laplace Transform 


Suppose that/i(0 = £ (■?)}, ■ . . ,/„(?) = £ 1 {F n (s)}. Then the function 

f(f) =/i(0 + •••+/» (0 

has the Laplace transform F(s). By the uniqueness property stated previously, no 
other continuous function / has the same transform. Thus 

£ _1 {FW) = £ -1 {-Fi(s)} + • ■ ■ + C~ l {F n (.s)}-, (18) 

that is, the inverse Laplace transform is also a linear operator. 

In many problems it is convenient to make use of this property by decomposing a 
given transform into a sum of functions whose inverse transforms are already known 
or can be found in the table. Partial fraction expansions are particularly useful in this 
connection, and a general result covering many cases is given in Problem 38. Other 
useful properties of Laplace transforms are derived later in this chapter. 

As further illustrations of the technique of solving initial value problems by means 
of the Laplace transform and partial fraction expansions, consider the following 
examples. 


EXAMPLE 

2 


Find the solution of the differential equation 

y" + y = sin 2 1, 


satisfying the initial conditions 

y(0) = 2, /( 0) = 1. 


(19) 

(20) 


We assume that this initial value problem has a solution y = <f>(t), which with its first two 
derivatives satisfies the conditions of Corollary 6.2.2. Then, taking the Laplace transform of 
the differential equation, we have 

s 2 Y(s) - yy(0) - y'(0) + Y(s) = 2 /(s 2 + 4), 


where the transform of sin2r has been obtained from line 5 of Table 6.2.1. Substituting for 
y(0) and y'(0) from the initial conditions and solving for Y (s), we obtain 


2 s 3 + s 2 + 8s + 6 
(s 2 + l)(s 2 + 4) 


(21) 


Using partial fractions, we can write Y (s) in the form 

as + b cs + d (as + b)(s 2 + 4) + (cs + d)(s 2 + 1) 
s 2 + 1 s 2 + 4 (s 2 + 1)(j 2 + 4) 


( 22 ) 


By expanding the numerator on the right side of Eq. (22) and equating it to the numerator in 
Eq. (21), we find that 

2j 3 + s 2 + 8s + 6 = (a + c)J 3 + (b + d)s 2 + (4 a + c)J + (4 b + d) 


for all s. Then, comparing coefficients of like powers of s, we have 

a + c = 2, b + d = 1, 

4a + c = 8, Ab + d = 6. 

Consequently, a = 2, c = 0, b = 3 , and d = — |, from which it follows that 

2s 5/3 2/3 

s 2 + 1 + s 2 + 1 s 2 + A' 


Y(s) = 


(23) 
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I 


From lines 5 and 6 of Table 6.2.1, the solution of the given initial value problem is 

y = (pit) = 2 cos t + | sint — | sin2r. 


(24) 


EXAMPLE 


Find the solution of the initial value problem 

/ 4, -y = 0, 

T(0) = 0, /(0) = 1, y"( 0) = 0, 


/"( 0 ) = 0 . 


(25) 

(26) 


In this problem we need to assume that the solution y = 4>(t) satisfies the conditions of 
Corollary 6.2.2 for n = 4. The Laplace transform of the differential equation (25) is 

s 4 Y(i) - s 3 y(0) - s 2 j/(0) - sy"(0) - /"(()) - Y(s) = 0. 

Then, using the initial conditions (26) and solving for Y (s), we have 


Y(s) = 


s 4 — 1 


(27) 


A partial fraction expansion of Y (s) is 


as + b cs + d 

Y C 5 ) = ~2 T + 2 i 1 ’ 

s 2 - 1 s 2 + 1 


and it follows that 


(as + b)(s 2 + 1) + (cj + d)(s 2 — 1) = s 2 (28) 

for all 5. By setting s = 1 and s = —1, respectively, in Eq. (28), we obtain the pair of equations 

2 (a + b) = 1 , 2 (— a + b) = 1 , 

and therefore a = 0 and b = \. If we set s = 0 in Eq. (28), then b — d = 0, so d = ^. Finally, 
equating the coefficients of the cubic terms on each side of Eq. (28), we find that a + c = 0, so 
c = 0. Thus 


1/2 1/2 
Y(s)=^-^+ ' 


s 2 — 1 ' s 2 + 1 ’ 

and from lines 7 and 5 of Table 6.2.1, the solution of the initial value problem (25), (26) is 

sinh t + sin t 


(29) 


y = <t>(t) = 


(30) 


The most important elementary applications of the Laplace transform are in the 
study of mechanical vibrations and in the analysis of electric circuits; the govern- 
ing equations were derived in Section 3.7. A vibrating spring-mass system has the 
equation of motion 

d 2 u du 

m W + r di + ku = Fm ■ (31) 

where m is the mass, y the damping coefficient, k the spring constant, and F(t) the 
applied external force. The equation that describes an electric circuit containing an 
inductance L, a resistance R, and a capacitance C (an LRC circuit) is 


320 


Chapter 6. The Laplace Transform 


PROBLEMS 


where Q(t) is the charge on the capacitor and E(t) is the applied voltage. In terms 
of the current /(f) = dQ(t)/dt, we can differentiate Eq. (32) and write 


r d 2 I dl 

L~py + R~r 
dt 2 dt 


1 dE 

C dt 


{t). 


(33) 


Suitable initial conditions on u, Q, or I must also be prescribed. 

We have noted previously, in Section 3.7, that Eq. (31) for the spring-mass system 
and Eq. (32) or (33) for the electric circuit are identical mathematically, differing 
only in the interpretation of the constants and variables appearing in them. There 
are other physical problems that also lead to the same differential equation. Thus, 
once the mathematical problem is solved, its solution can be interpreted in terms of 
whichever corresponding physical problem is of immediate interest. 

In the problem lists following this and other sections in this chapter are numerous 
initial value problems for second order linear differential equations with constant 
coefficients. Many can be interpreted as models of particular physical systems, but 
usually we do not point this out explicitly. 


In each of Problems 1 through 10 find the inverse Laplace transform of the given function. 


1. F(s) = 

s 2 + 4 

2. F(s) = 

(s - l) 3 

3. F(s) = 

2 

4. F(s) = 

3s 

s 2 + 3s — 4 

s 2 — s — 6 

5. F(s) = 

2s + 2 

6. F(s) = 

2s -3 

s 2 + 2s + 5 

s 2 — 4 

7. F(s) = 

2s + 1 

8. F(s) = 

8s 2 -4s +12 

s 2 -2s + 2 

s(s 2 + 4) 

9. F(s) = 

1-2 s 

10. F(s) = 

2s -3 

s 2 + 4s + 5 

s 2 + 2s + 10 


In each of Problems 11 through 23 use the Laplace transform to solve the given initial value 
problem. 

y'(0) = -1 

y'(0) = o 
y'(0) = l 
y'(0) = i 
y'( 0 ) = o 

y'(0) = -l 

y(0) = 0, y'(0) = 1, /'(0) = 0, /"( 0) = 1 

y(0) = 1, y'(0) = 0, y"(0) = 1, /"(()) = 0 

y(0) = 1, /( 0) = 0, y"(0) = — 2, y'"(0) = 0 


11. y" -y - 6 y = 0; 

12. y" + 3/ + 2y = 0; 

13. y" - 2/ + 2y = 0; 

14. y” - 4 y + 4 y = 0; 

15. y” - 2 / + 4 y = 0; 

16. y" + 2/ + 5y = 0; 

17. y (4) - 4 /" + 6 y" - 4 / + y = 0; 

18. y (4) - y = 0; 

19. y (4) - 4y = 0; 


y(0) = 1, 
y(0) = l, 

y(0) = 0, 
y(0> = l, 
y(P) = 2, 
y( 0) = 2, 


20. y" + u> 2 y = cos2f, a> 2 / 4; y( 0) = 1, y'(0) = 0 
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21. y" - 2/ + 2y = cost; y(0) = 1, /( 0) = 0 

22. y" — 2/ + 2y = e - '; y(0) = 0, y'(0) = 1 

23. y" + 2/ + y = 4e~‘ ; y(0) = 2, y'(0) = —1 

In each of Problems 24 through 26 find the Laplace transform Y (s) = C{y } of the solution of 
the given initial value problem. A method of determining the inverse transform is developed 
in Section 6.3. 

{ 1, 0 < t < n, 

’ “ y(0) = 1, /(0) = 0 

0, n < t < oo; 

{ t, 0 < t < 1, 

’ “ , y( 0 ) = 0 , /( 0 ) = 0 

0, 1 < t < oo; 

[f, 0 < t < 1, 

26. y" + 4y = ~ y(0) = 0, /( 0) = 0 

II, 1 < t < oo; 

27. The Laplace transforms of certain functions can be found conveniently from their Taylor 
series expansions. 

(a) Using the Taylor series for sin t 


~ ( _l)« r 2«+t 

sin(= > 

trk (2/z + 1)! 


and assuming that the Laplace transform of this series can be computed term by term, 
verify that 

1 


£{sinf} = 


t 2 + l’ 


s > 1. 


(b) Let 


m = 


{(sinf)/f, t ^ 0, 

1, t = 0. 


Find the Taylor series for / about t = 0. Assuming that the Laplace transform of this 
function can be computed term by term, verify that 


C[f{t)} — arctan(l/i), s > 1. 


(c) The Bessel function of the first kind of order zero, Jo, has the Taylor series (see 
Section 5.7) 


Jo(t) = ^2 

n = 0 


(-1 yt 2n 

2 2n (ri\) 2 ' 


Assuming that the following Laplace transforms can be computed term by term, verify 
that 

£[Jo(t)} = (s 2 + 1)- 1/2 , s > 1, 

and 

£{J 0 (Vi)} = UV 17 ' 4 * 1 , s > 0. 
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Problems 28 through 36 are concerned with differentiation of the Laplace transform. 


28. Let 


F(s) = 



e s ‘f{t) dt. 


It is possible to show that as long as / satisfies the conditions of Theorem 6.1.2, it is 
legitimate to differentiate under the integral sign with respect to the parameter s when 
s > a. 

(a) Show that F'(s) = £{— f/(f)}. 

(b) Show that F tn> (s) = £{(—t) n f(t ) }; hence differentiating the Laplace transform corre- 
sponds to multiplying the original function by —t. 

In each of Problems 29 through 34 use the result of Problem 28 to find the Laplace transform 
of the given function; a and b are real numbers and n is a positive integer. 


29. /(f) = te a ' 

30. /(f) = f 2 sin bt 

31. /(f) = t n 

32. /(f) = t n e at 

33. /(f) = te a ‘ sin bt 

34. /(f) = te at cos bt 

35. Consider Bessel’s equation of order zero 



ty" +/ + ty = o. 


Recall from Section 5.4 that f = 0 is a regular singular point for this equation, and therefore 
solutions may become unbounded as f — >• 0. However, let us try to determine whether 
there are any solutions that remain finite at f = 0 and have finite derivatives there. As- 
suming that there is such a solution y — /(f), let Y(s) — £(/(f)}. 

(a) Show that Y (s) satisfies 

(1 +£ 2 )Y'(s) -t-sYfy) = 0. 

(b) Show that Y (s) = c(l + s 2 ) _1/2 , where c is an arbitrary constant. 

(c) Writing (1 + s 2 ) _1/2 = s _1 (l + s _2 )~ 1/2 , expanding in a binomial series valid for s > 1, 
and assuming that it is permissible to take the inverse transform term by term, show that 


y = c J2 


(-1 ) n t 2 ' 
2 2n (n\) 2 


cMt), 


where J 0 is the Bessel function of the first kind of order zero. Note that / 0 (0) = 1 and that 
Jo has finite derivatives of all orders at f = 0. It was shown in Section 5.7 that the second 
solution of this equation becomes unbounded as f -> 0. 

36. For each of the following initial value problems use the results of Problem 28 to find 
the differential equation satisfied by Y(y) = £{/(f) }, where y = /(f) is the solution of the 
given initial value problem. 

(a) y" — ty = 0; y(0) = 1, y'(0) = 0 (Airy’s equation) 

(b) (1 — t 2 )y" — 2ty' + a(a + l)y = 0; y(0) = 0, y'(0) = 1 (Legendre’s equation) 

Note that the differential equation for Y (s) is of first order in part (a), but of second order 
in part (b). This is due to the fact that t appears at most to the first power in the equation 
of part (a), whereas it appears to the second power in that of part (b). This illustrates that 
the Laplace transform is not often useful in solving differential equations with variable 
coefficients, unless all the coefficients are at most linear functions of the independent 
variable. 
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37. Suppose that 

g(t) = [ fir) dr. 

Jo 

If G(s ) and F(s) are the Laplace transforms of g(t) and /(f), respectively, show that 

G(s) = F(s)/s. 

38. In this problem we show how a general partial fraction expansion can be used to calculate 
many inverse Laplace transforms. Suppose that 

F(s) = P(s)!Q(s ), 

where Q(s) is a polynomial of degree n with distinct zeros r\,...,r n , and P(s) is a 
polynomial of degree less than n. In this case it is possible to show that P(s)/Q(s ) has a 
partial fraction expansion of the form 

Pis) A 1 A„ 

7TTT — 1 1 . (0 

Q(s) s-r x s — r n 

where the coefficients A 1? . . . ,A n must be determined. 

(a) Show that 

A k = P(r k )/Q\r k ), k = l,...,n. (ii) 

Hint: One way to do this is to multiply Eq. (i) by s — r k and then to take the limit as s -> r k . 

(b) Show that 

£ _ 1 {F(S')} = V ^P^-e rk ‘. (iii) 

y Q{r k ) y 


6.3 Step Functions 


In Section 6.2 we outlined the general procedure involved in solving initial value 
problems by means of the Laplace transform. Some of the most interesting elemen- 
tary applications of the transform method occur in the solution of linear differential 
equations with discontinuous or impulsive forcing functions. Equations of this type 
frequently arise in the analysis of the flow of current in electric circuits or the vibra- 
tions of mechanical systems. In this section and the following ones we develop some 
additional properties of the Laplace transform that are useful in the solution of such 
problems. Unless a specific statement is made to the contrary, all functions appearing 
below will be assumed to be piecewise continuous and of exponential order, so that 
their Laplace transforms exist, at least for s sufficiently large. 

To deal effectively with functions having jump discontinuities, it is very helpful to 
introduce a function known as the unit step function or Heaviside function. 

This function will be denoted by u c and is defined by 


u c (t) = 


0, l < c, 

1, t>c. 


c > 0. 


( 1 ) 
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The graph of y = u c (t ) is shown in Figure 6.3.1. We have somewhat arbitrarily as- 
signed the value one to u c at t = c. However, for a piecewise continuous function 
such as u c , recall that the value at a discontinuity point is usually irrelevant. The step 
can also be negative. For instance, Figure 6.3.2 shows the graph y = 1 — u c (t). 
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FIGURE 6.3.1 Graph of 

FIGURE 6.3.2 Graph of 


y = Ucit)- 

y= 1 -u c it). 


EXAMPLE 

1 


Sketch the graph of y = hit), where 

h{t) = u„{t) - u 2 *(f), t> 0. 


From the definition of u c (t) in Eq. (1), we have 


hit) = 


0-0 = 0 , 
1-0 = 1 , 
1-1 = 0 , 


0 < t < n, 
n < t < 2n, 
2n < t < oo. 


Thus the equation y = h{t) has the graph shown in Figure 6.3.3. This function can be thought 
of as a rectangular pulse. 
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FIGURE 6.3.3 Graph of y = w T (t) — u l7t {t). 


Consider the function 

EXAMPLE 

2 


/w = 


2, 

5, 

- 1 , 

1 , 


0 < t < 4, 
4 < t < 7, 
7 < t < 9, 
f> 9, 


( 2 ) 


whose graph is shown in Figure 6.3.4. Express/(t) in terms of u c (t). 


6.3 Step Functions 


325 


r 

5 

4 

3 

2 

1 


-1 


We start with the function /j (f) = 2, which agrees with /(f) on [0,4). To produce the jump 
of three units at f = 4, we add 3 « 4 (f) to/j(f), obtaining 

flit) = 2 + 3u 4 (t), 

which agrees with/(f) on [0, 7). The negative jump of six units at f = 7 corresponds to adding 
— 6z/ 7 (f), which gives 

flit) = 2 + 3 u 4 (t) — 6w 7 (t). 

Finally, we must add 2 u 9 (t) to match the jump of two units at t = 9. Thus we obtain 

fit) = 2 + 3z< 4 (f) — 6w 7 (f) + 2ug(t). (3) 
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4 

6 

8 

10 

12 t 









FIGURE 6.3.4 Graph of the function in Eq. (2). 


The Laplace transform of u c is easily determined: 


l 


C{u c {t)} = / e s ‘u c (t)dt = 


[ 


s > 0. 


( 4 ) 


For a given function / defined for t > 0, we will often want to consider the related 
function g defined by 

0, t < c, 

fit - c), t > c. 


y = g(f ) = 


which represents a translation of / a distance c in the positive t direction; see Fig- 
ure 6.3.5. In terms of the unit step function we can write git) in the convenient 
form 

g(t) = u c it) f it -c). 


The unit step function is particularly important in transform use because of the fol- 
lowing relation between the transform of /(f) and that of its translation n c (f)/(f — c). 
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Theorem 6.3.1 


EXAMPLE 

3 




FIGURE 6.3.5 A translation of the given function, (a) y = /(f); ( b ) y = u c (t)f(t — c). 


If F(s ) = £{f(t)} exists for s > a > 0, and if c is a positive constant, then 

£{u c {i)f{t-c)}=e~ cs C{f(t)} = e- cs F{s), s > a. (5) 

Conversely, if /(f) = £ _1 {F(5)J, then 

Uc(t)f(t-c) = £~ 1 {e~ cs F(s)}. (6) 


Theorem 6.3.1 simply states that the translation of /(f) a distance c in the positive t 
direction corresponds to the multiplication of F(s) by e~ cs . To prove Theorem 6.3.1, 
it is sufficient to compute the transform of u c (t)f(t — c ): 

POO 

£{u c (t)f(t — c)} = / e~ st u c {t)f{t — c) dt 
Jo 

/ oo 

e~ st f(t — c) dt. 

Introducing a new integration variable / = t — c, we have 

POO POO 

£{u c (t)f(t -c)}= e-« +c)1 /(£) = e~ cs / e~*m d% 

Jo Jo 

= e~ cs F{s). 

Thus Eq. (5) is established; Eq. (6) follows by taking the inverse transform of both 
sides of Eq. (5). 

A simple example of this theorem occurs if we take f(t ) = 1. Recalling that 
£{ 1} = 1/s, we immediately have from Eq. (5) that £{u c (t) \ = e~ cs /s. This result 
agrees with that of Eq. (4). Examples 3 and 4 illustrate further how Theorem 6.3.1 
can be used in the calculation of transforms and inverse transforms. 


If the function / is defined by 


/« = 


{ sin f, 

sinf + cos(f — 7 t/4), 


0 < t < 7t/4, 
t > 7t/4, 


find £{f(t ) }. The graph of y = /(f) is shown in Figure 6.3.6. 
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FIGURE 6.3.6 Graph of the function in Example 3. 


Note that /(f) = sinf + g(t), where 
git) = 

Thus 


0, t < 7r/4, 

cos(f — jr/4), t > n / 4. 


git) = r< ir/4 (0cos(f-jr/4), 


and 


C{ fit)} = £{sin r) + C{u„n{t) cos (r — 7r/4)} 
= £{sinf) + e -ir ' !/4 £{cosf}. 

Introducing the transforms of sin t and cos t, we obtain 


C{fit)} = 


1 


+ e 


-ns / 4 _ 


1 + se ~ nsl 4 


s 2 + 1 s 2 + 1 s 2 + 1 

You should compare this method with the calculation of C[f(t)} directly from the definition. 


EXAMPLE 

4 


Find the inverse transform of 


From the linearity of the inverse transform we have 

/(f) = £- 1 {F(,)} = £- 1 {l}-£- 1 {^} 

= f- u 2 (t)(t-2). 

The function / may also be written as 


/(f) = 


0 < t < 2, 
t > 2. 


The following theorem contains another very useful property of Laplace trans- 
forms that is somewhat analogous to that given in Theorem 6.3.1. 
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Theorem 6.3.2 ^ F{s) = £{/(f)} exists for s > a > 0, and if c is a constant, then 

C{e ct f(t)} = F(s — c), s > a + c. 
Conversely, if/ (t) = £ _1 {F(s)}, then 

eF t f{t) = C~\F{s-c)}. 


(7) 

( 8 ) 


According to Theorem 6.3.2, multiplication of f(t) by e ct results in a translation of 
the transform F(s) a distance c in the positive s direction, and conversely. To prove 
this theorem, we evaluate £{e cl f(t)}. Thus 

/* oo /»oo 

£{e ct f(t)}= e~ st e ct f{t)dt= e~^[ f (t) dt 

Jo Jo 

= F(s - c ), 

which is Eq. (7). The restriction s > a + c follows from the observation that, ac- 
cording to hypothesis (ii) of Theorem 6.1.2, |/(f)| < Ke at ; hence \e cl f(t)\ < Ke la+C)l . 
Equation (8) is obtained by taking the inverse transform of Eq. (7), and the proof is 
complete. 

The principal application of Theorem 6.3.2 is in the evaluation of certain inverse 
transforms, as illustrated by Example 5. 


Find the inverse transform of 


EXAMPLE 


G(s) = 


1 


s 2 — 4s + 5 

By completing the square in the denominator, we can write 

1 


G(s) = 


= F{s - 2), 


(s -2) 2 + l 

where F(s) — ( s 2 + l) -1 . Since £ _1 {C(s)} = sinf, it follows from Theorem 6.3.2 that 

g(t) — Zr/GO)} = e 2 ' sint. 


The results of this section are often useful in solving differential equations, partic- 
ularly those that have discontinuous forcing functions. The next section is devoted 
to examples illustrating this fact. 


PROBLEMS In each of Problems 1 through 6 sketch the graph of the given function on the interval t > 0. 

1. g(t) = u.\ {t) + 2 u 3 (t) - 6m 4 (0 2. g(t) = (f - 3)w 2 (0 - (f - 2)it 3 (f) 

3 . g(t) = f(t — n)u n (t), where/(t) = f 2 4. g(f) =f(t — 3)ui(t), where f(t) = sin t 

5. g{t) = f(t - l)M 2 (f), where /(f) = 2 1 

6. git) = (f - l)Mi(f) - 2(f - 2 )h 2 (/) + (f - 3)113(0 
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In each of Problems 7 through 12: 

(a) Sketch the graph of the given function. 

(b) Express /(f) in terms of the unit step function u c (t). 


7. fit) = 


9. f{t) = 


1 , 

2 ) 


0 < f < 3, 

3 < f < 5, 

5 < f < 7, 

f > 7. 

8. 

/(0 = 

1 , 

- 1 , 

1, 

-1, 

0, 

0 < f < 1, 

1 < t < 2, 
2<t <3, 
3 < t < 4, 

f > 4. 

0 < f < 2, 

10. 

fit) = ■ 

1 

0 < f < 2, 

* ^ n 


11. fit) = 


12. f(t) = 


f, 0 < t < 1, 

t — 1, 1 < t < 2, 

f - 2, 2 < f < 3, 

0, f > 3. 

In each of Problems 13 through 18 find the Laplace transform of the given function. 


t, 0 < t < 2, 

2, 2 < f < 5, 

7 — t, 5 < t <7, 

0, f > 7. 


13. /(f) = 


0, f < 2 

it - 2) 2 , f > 2 


14. /(f) = 


0, t < 1 

f 2 - 2f + 2, f > 1 


0, f < jr 

15. /(f) = ■ t — n, n < t < 2n 
0, t >2n 

17. fit ) = (f - 3)M 2 (f) - (f - 2)w 3 (f) 


16. /(f) = «i(f) + 2 H 3 (f) — 6 z/ 4 (f) 


18. /(f) = f — «i(f)(f — 1), f > 0 


In each of Problems 19 through 24 find the inverse Laplace transform of the given function. 


19. Fis) — 
21. Fis) = 
23. Fis) = 


3! 


(s - 2) 4 

2(s - l)e _2s 
s 2 — 2s + 2 

is — 2)e _J 
j 2 — 4 j + 3 


20. Fis) = 
22. F(s) = 
24. Fis) = 


s 2 + s — 2 
2e~ 2s 


+ e~ 2s - e~ is - e- 


25. Suppose that Fis) = £{/(f)} exists fori > a > 0. 

(a) Show that if c is a positive constant, then 

jC{/(cf)} = -F (-^ , s > ca. 

(b) Show that if A: is a positive constant, then 

= !/(£). 

(c) Show that if n and b are constants with a > 0, then 

C~ l {Fias + b)) = -e~ b,/a f ( -) . 

a \a ) 


330 


Chapter 6. The Laplace Transform 


In each of Problems 26 through 29 use the results of Problem 25 to find the inverse Laplace 
transform of the given function. 


26. 


F(s) = 


2” +1 n! 

s n+1 


27. 


F(s) = 


2s + 1 
4 s 2 + 4s + 5 


28. 


F(s) = 


1 

9s 2 — 12s + 3 


29. F(s) = 


2s- 1 


In each of Problems 30 through 33 find the Laplace transform of the given function. In Problem 
33 assume that term-by-term integration of the infinite series is permissible. 


30. f(t) = 


0 < t < 1 
t > 1 


31. fit) = 


1, 

0 , 

1, 

o, 


0 < t < 1 

1 < t < 2 

2 < t < 3 
t > 3 


2n+l 

32. fit) = 1 - Miff) + • • • + U 2n it) - U 2 „+l(t) =1+ ^(-1 ) k u k (t) 

k = 1 
oo 

33. fit) = 1 + ^ (— 1 ) k u k it). See Figure 6.3.7. 

*:=i 



1 2 

FIGURE 6.3.7 


3 4 5 

A square wave. 


34. Let / satisfy f{t+T)= fit) for all t > 0 and for some fixed positive number T; f is said 
to be periodic with period T on 0 < t < oo. Show that 


C{ fit)) = 



e s 'f(t) dt 


1 - e~ sT 


In each of Problems 35 through 38, use the result of Problem 34 to find the Laplace transform 
of the given function. 


35. fit) = 


0 < t < 1, 

1 < t < 2; 


36. fit) = 



0< t < 1, 
1 < t < 2; 


fit + 2) =fit). 

Compare with Problem 33. 


fit + 2) = fit). 
See Figure 6.3.8. 



FIGURE 6.3.8 A square wave. 
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37. fit) = t, 0 < t < 1; 
f(t+l)=f(t). 

See Figure 6.3.9. 



FIGURE 6.3.9 A sawtooth wave. 


38. fit) = sin t, 0 < t < 7r; 
fit + n) = f {t). 

See Figure 6.3.10. 



n 2n 3n t 

FIGURE 6.3.10 A rectified sine wave. 


39. (a) If fit) = 1 — u\ it), find £{/(?)}; compare with Problem 30. Sketch the graph of 
y=m. 


(b) Let git) = J fi%)d$ 
y = git) and find £{g(f)}. 


, where the function/ is defined in part (a). Sketch the graph of 


(c) Let hit ) = git) — u\it)git — 1), where g is defined in part (b). Sketch the graph of 
y = hit) and find C[hit)}. 


40. Consider the function p defined by 


Pit) = 



0< t < 1, 
1 < t < 2; 


pit + 2) =pit). 


(a) Sketch the graph of y = pit). 

(b) Find £{p(f) ) by noting that p is the periodic extension of the function h in Prob- 
lem 39(c) and then using the result of Problem 34. 

(c) Find C{pit)} by noting that 

Pit) = f fit) dt, 

Jo 

where / is the function in Problem 36, and then using Theorem 6.2.1. 


6.4 Differential Equations with Discontinuous Forcing Functions 


In this section we turn our attention to some examples in which the nonhomogeneous 
term, or forcing function, is discontinuous. 




Find the solution of the differential equation 

2y" + y' + 2y = git), 


where 


git) = U 5 (t) - U 2 o(t) = 


5 < t < 20, 

0 < t < 5 and t > 20. 


( 1 ) 


( 2 ) 
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Assume that the initial conditions are 

y(0) = 0, 


/( 0 ) = 0 . 


(3) 


This problem governs the charge on the capacitor in a simple electric circuit with a unit 
voltage pulse for 5 < t < 20. Alternatively, _y may represent the response of a damped oscillator 
subject to the applied force g(t). 

The Laplace transform of Eq. (1) is 

2 s 2 Y(s) - 2sy(0) - 2/(0) + sY(s) - y(0) + 2Y(s) = jC{ms (0) - C{ii 2 o(t)} 


= (e 5s — e 

Introducing the initial values (3) and solving for Y(s ), we obtain 

Y (s) = 


,S )A- 


s(2s 2 + s + 2) 
To find y = it is convenient to write Y (s) as 


Y(s) = ( e - 5s - e~ 20s )H(s). 


where 


H(s) = 


1 


s(2s 2 +5 + 2) 

Then, if h(t) = Cr l { H ( 5 )}, we have 

y = (p(t) = Us(t)h(t — 5) — u 2 o{t)h(t — 20). 


(4) 

(5) 

( 6 ) 

(7) 


Observe that we have used Theorem 6.3.1 to write the inverse transforms of e 5s H(s) and 
e - 20 s H(s), respectively. Finally, to determine h(t), we use the partial fraction expansion of 
H(s): 


TT , a bs + c 
H{S) ~~s + 2s 2 +s + 2- 

Upon determining the coefficients, we find that a = \ , b = — 1, and c = — \ . Thus 

hm-W-LL i_ 

s 2s 2 + s + 2 


( 8 ) 


1/2 

s 


1^ (■ 5 + j) + I 

2 , 


(9) 


( s + ?) + if 

so that, by referring to lines 9 and 10 of Table 6.2.1, we obtain 

h(t) = 1 — 1 ^e _ ' /4 cos(V , 15f/4) + (VT5/15)e” f/4 s in(-s/l5f/4)j . (10) 

In Figure 6.4.1 the graph of y = 4>(t) from Eqs. (7) and (10) shows that the solution consists 
of three distinct parts. For 0 < t < 5 the differential equation is 


2y" + / + 2y = 0 


( 11 ) 


and the initial conditions are given by Eq. (3). Since the initial conditions impart no energy to 
the system, and since there is no external forcing, the system remains at rest; that is, y = 0 for 
0 < t < 5. This can be confirmed by solving Eq. (11) subject to the initial conditions (3). In 
particular, evaluating the solution and its derivative at t = 5, or, more precisely, as t approaches 
5 from below, we have 

y(5) = 0, /( 5) = 0. (12) 

Once t > 5, the differential equation becomes 
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2y"+y'+2y = l, (13) 

whose solution is the sum of a constant (the response to the constant forcing function) and 
a damped oscillation (the solution of the corresponding homogeneous equation). The plot 
in Figure 6.4.1 shows this behavior clearly for the interval 5 < t < 20. An expression for this 
portion of the solution can be found by solving the differential equation (13) subject to the 
initial conditions (12). Finally, for t > 20 the differential equation becomes Eq. (11) again, 
and the initial conditions are obtained by evaluating the solution of Eqs. (13), (12) and its 
derivative at t = 20. These values are 

V (20) = 0.50162, / (20) = 0.01125. (14) 

The initial value problem (11), (14) contains no external forcing, so its solution is a damped 
oscillation about y = 0, as can be seen in Figure 6.4.1. 
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FIGURE 6.4.1 Solution of the initial value problem (1 ), (2), (3). 


Although it may be helpful to visualize the solution shown in Figure 6.4.1 as composed of 
solutions of three separate initial value problems in three separate intervals, it is somewhat 
tedious to find the solution by solving these separate problems. Laplace transform methods 
provide a much more convenient and elegant approach to this problem and to others having 
discontinuous forcing functions. 


The effect of the discontinuity in the forcing function can be seen if we examine the 
solution </>(f ) of Example 1 more closely. According to the existence and uniqueness 
Theorem 3.2.1, the solution 0 and its first two derivatives are continuous except 
possibly at the points t — 5 and t = 20, where g is discontinuous. This can also be 
seen at once from Eq. (7). One can also show by direct computation from Eq. (7) 
that 0 and 0' are continuous even at t = 5 and t = 20. However, if we calculate 0", 
we find that 

lim 0"(f) — 0, lim 0"(f) = 1/2. 
t — >-5 — t — >-5+ 
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Consequently, cp"(t) has a jump of 1/2 at t = 5. In a similar way we can show that 
has a jump of —1/2 at t = 20. Thus the jump in the forcing term g(t) at these 
points is balanced by a corresponding jump in the highest order term 2 y" on the left 
side of the equation. 

Consider now the general second order linear equation 

y" + p(t)y' + q{t)y = git), (15) 

where p and q are continuous on some interval a < t < ft, but g is only piecewise 
continuous there. If y = i/r(f) is a solution of Eq. (15), then x// and xjr' are continuous 
on a < t < p, but i/r" has jump discontinuities at the same points as g. Similar remarks 
apply to higher order equations; the highest derivative of the solution appearing in 
the differential equation has jump discontinuities at the same points as the forcing 
function, but the solution itself and its lower derivatives are continuous even at those 
points. 


EXAMPLE 

2 


Describe the qualitative nature of the solution of the initial value problem 


where 



/ 

+ 

ii 

= g(t). 

(16) 

y(0) = 

o, 

o 

II 

o 

(17) 


0, 


0 < t < 5, 


git) = 

it- 

- 5)/5, 

5 < t < 10, 

(18) 


l, 


t > 10, 



and then find the solution. 

In this example the forcing function has the graph shown in Figure 6.4.2 and is known as 
ramp loading. It is relatively easy to identify the general form of the solution. For t < 5 the 
solution is simply y = 0. On the other hand, for t > 10 the solution has the form 


y = Ci cos 2f + c 2 sin 2r + 1/4. 


(19) 


The constant 1/4 is a particular solution of the nonhomogeneous equation, while the other 
two terms are the general solution of the corresponding homogeneous equation. Thus the 
solution (19) is a simple harmonic oscillation about y — 1/4. Similarly, in the intermediate 
range 5 < t < 10, the solution is an oscillation about a certain linear function. In an engineering 
context, for example, we might be interested in knowing the amplitude of the eventual steady 
oscillation. 

To solve the problem, it is convenient to write 

git) = [«5(0(r - 5) - «io(0(f - 10)] /5, (20) 

as you may verify. Then we take the Laplace transform of the differential equation and use 
the initial conditions, thereby obtaining 

(j 2 + 4) Y(s) = (e~ 5s - e~ 10s )/5s 2 


or 

Y(s) = (e- 5 * - e~ Ws )H(s)/ 5, 

(21) 

where 

H(S) ~ s 2 (s 2 + 4) ' 

(22) 
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y*- 

l - 

0.5 - 

FIGURE 6.4.2 


y = g(t) 


5 10 15 20 t 

Ramp loading; y = g(t) from Eq. (18). 


Thus the solution of the initial value problem (16), (17), (18) is 

y = </>(t) = [us(t)h(t — 5) — uio(t)h(t — 10)] /5, (23) 

where h(t) is the inverse transform of H(s). The partial fraction expansion of H(s ) is 

1/4 1/4 

H(s) =J T - (24 

s 1 s l + 4 

and it then follows from lines 3 and 5 of Table 6.2.1 that 

h(t) = if — | sin 2f. (25) 


The graph of y = 4>{t) is shown in Figure 6.4.3. Observe that it has the qualitative form that 
we indicated earlier. To find the amplitude of the eventual steady oscillation, it is sufficient to 
locate one of the maximum or minimum points for t > 10. Setting the derivative of the solution 
(23) equal to zero, we find that the first maximum is located approximately at (10.642, 0.2979), 
so the amplitude of the oscillation is approximately 0.0479. 
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FIGURE 6.4.3 
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Solution of the initial value problem (16), (17), (18). 


Note that in this example the forcing function g is continuous but g' is discontinuous at 
t — 5 and t — 10. It follows that the solution ip and its first two derivatives are continuous 
everywhere, but <p"' has discontinuities at t = 5 and at t = 10 that match the discontinuities in 
g' at those points. 
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PROBLEMS 


41 

41 

41 

41 

41 

41 

41 

41 

41 

41 

41 

41 

41 


41 


41 


In each of Problems 1 through 13: 

(a) Find the solution of the given initial value problem. 

(b) Draw the graphs of the solution and of the forcing function; explain how they are related. 


1. 

2 . 

3. 

4. 

5. 

6 . 

7. 

8 . 

9. 

10 . 

11 . 

12 . 

13. 

14. 

15. 

16. 


y"+y=f{t)\ 3/(0) = 0, y'(0) = 1; /(f) = 

y" + 2y' + 2y = h(t)- y(0) = 0, /(0) = 1; h(t) = 


1, 0 < t < 3n 
0, 3?r < t < oo 


1, n < t < 2n 

0, 0 < t < n and t > 2n 


y" + 4y = sinf — zz 2lr (f) sin(f — 2 jt); y(0) = 0, y’(0) = 0 

y" + 4y = sin t + u n (t) sin(t — n)\ y (0) = 0, y'(0) = 0 


1, 0 < f < 10 

0, f > 10 


y" + 3y' + 2y = /(f); y( 0) = 0, y'(0) = 0; /(f) = 

y" + 3/ + 2 y = u 2 (f); y(0) = 0, y'(0) = 1 

y" + y = u 3n (ty, 3(0) = 1, y'(0) = 0 


y" + y = g(0; y(0) = 0, y'(0) = 1; g(t) = 


y" + 4 y = u„ (f) - u 3n (f); y(0) = 0, y'{ 0) = 0 

y< 4 > - y = Ml (f) - u 2 (t ); y(0) = 0, y'(0) = 0, y"(0) = 0, y'"( 0) = 0 

y< 4) + 5 y" + Ay = 1 - u„ (f ); y(0) = 0, y' (0) = 0, y"(0) = 0, /"( 0) = 0 


/( 0) = 

0 


L/2, 

VI 

O 

< 6 

= 1 3 - 

f > 6 


[ 

sinf, 

0 < f < TC 

g(0 = 

0, 


t >n 


Find an expression involving u c {t) for a function / that ramps up from zero at f = f 0 to the 
value h at f = to + 

Find an expression involving n c (f) for a function g that ramps up from zero at f = to to the 
value h at f = fo + k and then ramps back down to zero at f = t 0 + 2k. 

A certain spring-mass system satisfies the initial value problem 


u" + |Z<' + ii = A:g(f), z<(0) = 0, zz'(0) = 0, 


where g(f) = zz 3/2 (f) — 115 / 2(0 and k > 0 is a parameter. 

(a) Sketch the graph of g(f). Observe that it is a pulse of unit magnitude extending over 
one time unit. 

(b) Solve the initial value problem. 

(c) Plot the solution for k = 1/2, k = 1, and k = 2. Describe the principal features of the 
solution and how they depend on k. 

(d) Find, to two decimal places, the smallest value of k for which the solution zz(f) reaches 
the value 2. 

(e) Suppose k = 2. Find the time r after which |zz(f)| < 0.1 for all f > r. 

17. Modify the problem in Example 2 of this section by replacing the given forcing function 
g ( 0 by 


/(f) = [zz 5 (f ) (f - 5) - u 5+k (0(t - 5 - k)] /k. 
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(a) Sketch the graph of /(f) and describe how it depends on k. For what value of k is /(f) 
identical to g(f) in the example? 

(b) Solve the initial value problem 

y" + 4y = /(f), y(0) = 0, /(0) = 0. 

(c) The solution in part (b) depends on k, but for sufficiently large f the solution is always 
a simple harmonic oscillation about y = 1/4. Try to decide how the amplitude of this 
eventual oscillation depends on k. Then confirm your conclusion by plotting the solution 
for a few different values of k. 

18. Consider the initial value problem 

/+ \? + 4y=Mf), y(0) = 0, /(0) = 0, 

where 

[ 1/2 k, A-k <t <4 + k 
f k (t ) = \ 

|0, 0<f<4 — k and f > 4 + k 


and 0 < k < 4. 

(a) Sketch the graph of /*(f). Observe that the area under the graph is independent of k. 
If f k (t) represents a force, this means that the product of the magnitude of the force and 
the time interval during which it acts does not depend on k. 

(b) Write f k (t) in terms of the unit step function and then solve the given initial value 
problem. 

(c) Plot the solution for k = 2, k = 1, and k =\. Describe how the solution depends 
on k. 

Resonance and Beats. In Section 3.8 we observed that an undamped harmonic oscillator 
(such as a spring-mass system) with a sinusoidal forcing term experiences resonance if 
the frequency of the forcing term is the same as the natural frequency. If the forcing 
frequency is slightly different from the natural frequency, then the system exhibits a beat. 
In Problems 19 through 23 we explore the effect of some nonsinusoidal periodic forcing 
functions. 

19. Consider the initial value problem 

y" + y=f(t), y(0) = 0, /(0) = 0, 


where 

n 

fit) = u 0 (t) + 2 ^(-1 ) k u kn (t). 

k=l 

(a) Draw the graph of /(f) on an interval such as 0 < t < 6n. 

(b) Find the solution of the initial value problem. 

(c) Let n = 15 and plot the graph of the solution for 0 < f < 60. Describe the solution 
and explain why it behaves as it does. 

(d) Investigate how the solution changes as n increases. What happens as n — > oo? 

20. Consider the initial value problem 

y + o.iy + v = /«, y( 0 ) = o, /(0) = o, 

where /(f) is the same as in Problem 19. 
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(a) Plot the graph of the solution. Use a large enough value of n and a long enough 
/-interval so that the transient part of the solution has become negligible and the steady 
state is clearly shown. 

(b) Estimate the amplitude and frequency of the steady state part of the solution. 

(c) Compare the results of part (b) with those from Section 3.8 for a sinusoidally forced 
oscillator. 

21. Consider the initial value problem 

y" + y = g(t), yi 0) = o, /(0) = o, 

where 

n 

git) = «o(0 + ^(- \) k u kn it). 

k=l 

(a) Draw the graph of g(t) on an interval such as 0 < / < 6n. Compare the graph with 
that of /(f) in Problem 19(a). 

(b) Find the solution of the initial value problem. 

(c) Let n = 15 and plot the graph of the solution for 0 < f < 60. Describe the solution 
and explain why it behaves as it does. Compare it with the solution of Problem 19. 

(d) Investigate how the solution changes as n increases. What happens as n — > oo? 

22. Consider the initial value problem 

y" + O.iy + y = git), y( 0) = 0, /(0) = 0, 

where git) is the same as in Problem 21. 

(a) Plot the graph of the solution. Use a large enough value of n and a long enough 
f-interval so that the transient part of the solution has become negligible and the steady 
state is clearly shown. 

(b) Estimate the amplitude and frequency of the steady state part of the solution. 

(c) Compare the results of part (b) with those from Problem 20 and from Section 3.8 for 
a sinusoidally forced oscillator. 

23. Consider the initial value problem 

y" +y = hit), _y(0) = 0, y'(0) = 0, 

where 

n 

fit) = M 0 (f) + 2 ^(-l)V U/4 (f). 

k=l 

Observe that this problem is identical to Problem 19 except that the frequency of the 
forcing term has been increased somewhat. 

(a) Find the solution of this initial value problem. 

(b) Let n > 33 and plot the solution for 0 < t < 90 or longer. Your plot should show a 
clearly recognizable beat. 

(c) From the graph in part (b) estimate the “slow period” and the “fast period” for this 
oscillator. 

(d) For a sinusoidally forced oscillator, it was shown in Section 3.8 that the “slow fre- 
quency” is given by \a> — a>o|/2, where a> 0 is the natural frequency of the system and co is 
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the forcing frequency. Similarly, the “fast frequency” is (co + coo) /2. Use these expressions 
to calculate the “fast period” and the “slow period” for the oscillator in this problem. How 
well do the results compare with your estimates from part (c)? 


6.5 Impulse Functions 


In some applications it is necessary to deal with phenomena of an impulsive nature — 
for example, voltages or forces of large magnitude that act over very short time 
intervals. Such problems often lead to differential equations of the form 

ay" + by' + cy = g(t), (1) 


where g(t) is large during a short interval to — r < t < to + x and is otherwise zero. 
The integral /( r), defined by 

/•‘o+r 

I(t)= / g(t)dt, ( 2 ) 

J to-z 

or, since g(t) = 0 outside of the interval (to — r , to + t), 

/ OO 

g(t) dt, (3) 

-OO 

is a measure of the strength of the forcing function. In a mechanical system, where 
g(t) is a force, /( r) is the total impulse of the force g(t) over the time interval 
(to — r, to + r). Similarly, if y is the current in an electric circuit and g(t) is the time 
derivative of the voltage, then /( r) represents the total voltage impressed on the 
circuit during the interval (to — r, to + r). 

In particular, let us suppose that to is zero and that g(t) is given by 


g(t) = d T (f) = 


l/2r, 

0 , 


— x < t < r, 
t < — r or f > r, 


( 4 ) 


where r is a small positive constant (see Figure 6.5.1). According to Eq. (2) or (3), 
it follows immediately that in this case /( r) = 1 independent of the value of r, as 
long as r / 0. Now let us idealize the forcing function d T by prescribing it to act 
over shorter and shorter time intervals; that is, we require that r — >■ 0, as indicated in 
Figure 6.5.2. As a result of this limiting operation, we obtain 

lim d T (t ) = 0, t / 0. (5) 

T — >-0 


y 
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1 2t 
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T ? 


FIGURE 6.5.1 Graph of y = d r (t). 
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ii 

III I ! 


t 

FIGURE 6.5.2 Graphs of y = d z ( t ) as r — > 0. 


Further, since 7(r) = 1 for each r ^ 0, it follows that 


lim /( r) = 1. (6) 

T — ^0 

Equations (5) and (6) can be used to define an idealized unit impulse function 5 , 
which imparts an impulse of magnitude one at t — 0 but is zero for all values of t 
other than zero. That is, the “function” 5 is defined to have the properties 


5(0 = 0, t ^ 0; 


(7) 



5(0 dt = 1. 


( 8 ) 


There is no ordinary function of the kind studied in elementary calculus that satisfies 
both Eqs. (7) and (8). The “function” 5, defined by those equations, is an example of 
what are known as generalized functions; it is usually called the Dirac 2 delta function. 
Since 5 ( t ) corresponds to a unit impulse at t = 0, a unit impulse at an arbitrary point 
t = to is given by 5(f — to). From Eqs. (7) and (8) it follows that 


8(t - t (i ) = 0, t / to; 


(9) 



8(t — to) dt = 1. 


( 10 ) 


The delta function does not satisfy the conditions of Theorem 6.1.2, but its Laplace 
transform can nevertheless be formally defined. Since 5 ( t ) is defined as the limit of 
d T ( t ) as r — >■ 0, it is natural to define the Laplace transform of 5 as a similar limit of 


2 Paul A. M. Dirac (1902-1984), English mathematical physicist, received his Ph.D. from Cambridge in 
1926 and was professor of mathematics there until 1969. He was awarded the Nobel Prize in 1933 (with 
Erwin Schrodinger) for fundamental work in quantum mechanics. His most celebrated result was the 
relativistic equation for the electron, published in 1928. From this equation he predicted the existence of 
an “anti-electron,” or positron, which was first observed in 1932. Following his retirement from Cambridge, 
Dirac moved to the United States and held a research professorship at Florida State University. 
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the transform of d T . In particular, we will assume that to > 0 and define £{8(t — to)} 
by the equation 


£{8(t — t 0 )} = lim £{d z (t — t 0 )}. 

r — >0 


( 11 ) 


To evaluate the limit in Eq. (11), we first observe that if r < f 0 , which must eventually 
be the case as r — »■ 0, then f 0 — r > 0. Since d z (t — t 0 ) is nonzero only in the interval 
from to — r to to + r, we have 


C{d T (t- t 0 )} = 



st d z (t — t 0 ) dt 


e st d T (t — f 0 ) dt. 


Substituting for d z (t — to) from Eq. (4), we obtain 


£{d z (t — t 0 )} = 


j rto+* 

2r lo-T 


dt = e 

2 s x 


t=t 0 +T 

t=t 0 -T 


= —e~ sta {e ST - e~ ST ) 

2s T 

or 

, sinh sr ,, 

C{d T (t-to)}= ( 12 ) 

sr 

The quotient (sinh5t)/5t is indeterminate as r — >■ 0, but its limit can be evaluated by 
L’Hospital’s rule. We obtain 

sinh sr s cosh sr 

lim = lim = 1. 

t->o sr t—>o s 

Then from Eq. (11) it follows that 

C{8(t - to)} = e~ st \ (13) 


Equation (13) defines £{8{t — to)} for any to > 0. We extend this result, to allow to to 
be zero, by letting to -> 0 on the right side of Eq. (13); thus 

£{8(t)} = lim e~ st ° = 1. (14) 

f 0 ->0 


In a similar way it is possible to define the integral of the product of the delta 
function and any continuous function /. We have 



<5(t — to)f(t) dt — lim 



d r (t — to)f(t) dt. 


(15) 


Using the definition (4) of d z (t) and the mean value theorem for integrals, we find 
that 
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where to — r < t* < to + r . Hence t* — > to as r —> 0, and it follows from Eq. (15) that 



S(t- to) f{t) dt=f(t 0 ). 


(16) 


The following example illustrates the use of the delta function in solving an initial 
value problem with an impulsive forcing function. 


EXAMPLE 

1 


Find the solution of the initial value problem 


2y" +y' + 2y = S(t - 5), (17) 

}’(0) = 0, /(0) = 0. (18) 

This initial value problem arises from the study of the same electrical circuit or mechanical 
oscillator as in Example 1 of Section 6.4. The only difference is in the forcing term. 

To solve the given problem, we take the Laplace transform of the differential equation and 
use the initial conditions, obtaining 

(2i 2 +i + 2)y(j) = e“ 5s . 


Thus 


Y(s) = 


2s 2 + s + 2 2 ( s+ i ) 2 +if 

By Theorem 6.3.2 or from line 9 of Table 6.2.1, 

1 


£ - 


, , 4 ^, 

(»+i) +H I ^5 4 


Hence, by Theorem 6.3.1, we have 


y = C ^YCs)} = -J= u s (t)e ( ' 5,/4 sin (t - 5), 


(19) 


(20) 


( 21 ) 


y 






0.3 






0.2 






0.1 


1 

[ 

1 

1 > 



5 

10 

!5 y 

20 t 

-0.1 







FIGURE 6.5.3 Solution of the initial value problem (17), (18). 
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which is the formal solution of the given problem. It is also possible to write y in the form 


y = 


0 , 

VI5 


t < 5, 


2 5J/4 , ■ ^5 


( 22 ) 


The graph of Eq. (22) is shown in Figure 6.5.3. Since the initial conditions at t = 0 are 
homogeneous and there is no external excitation until t = 5, there is no response in the interval 
0 < t < 5. The impulse at t = 5 produces a decaying oscillation that persists indefinitely. The 
response is continuous at t = 5 despite the singularity in the forcing function at that point. 
However, the first derivative of the solution has a jump discontinuity at t = 5, and the second 
derivative has an infinite discontinuity there. This is required by the differential equation (17), 
since a singularity on one side of the equation must be balanced by a corresponding singularity 
on the other side. 


In dealing with problems with impulsive forcing the use of the delta function usually 
simplifies the mathematical calculations, often quite significantly. However, if the 
actual excitation extends over a short, but nonzero, time interval, then an error will 
be introduced by modeling the excitation as taking place instantaneously. This error 
may be negligible, but in a practical problem it should not be dismissed without 
consideration. In Problem 16 you are asked to investigate this issue for a simple 
harmonic oscillator. 


PROBLEMS 


41 

41 

41 

41 

41 

41 

41 

41 

41 

41 

41 

41 

41 


In each of Problems 1 through 12: 

(a) Find the solution of the given initial value problem. 

(b) Draw a graph of the solution. 

1. y" + 2y' + 2y = ,5(f-jr); y(0) = 1, y'(0) = 0 

2. y" + 4y = 8(t — n) — 8(t — 2jt); y(0) = 0, y'(0) = 0 

3. y" + 3/ + 2y = S(t - 5) + u w (t); y(0) = 0, y'(0) = 1/2 

4. y"-y = -205(1-3); y(0) = 1, y'(0) = 0 

5. y" + 2 y * 1 11 + 3y = sinf + 8(t — 3jt); y(0) = 0, y'(0) = 0 

6. y" + 4 y = 8(t - 4jt); y(0) = 1/2, /(0) = 0 

7- y" +y = 8(t — 27r)c°sr; y(0) = 0, y'(0) = 1 

8. y" + 4y = 28(t - jr/4); y(0) = 0, /( 0) = 0 

9. y" +y= u n/2 (t ) + 38 (t - 3n/2) - M 2ir (t); y(0) = 0, /(0) = 0 

10. 2y" + y + 4y = 8(t - n/6) sin f; y(0) = 0, y'(0) = 0 

11. y" + 2 y + 2y = cos t + <5(f - jt/ 2); y(0) = 0, y'(0) = 0 

12. y (4) — y = 8(t — 1); y(0) = 0, y'(0) = 0, y"(0) = 0, y"'(0) = 0 

13. Consider again the system in Example 1 of this section, in which an oscillation is excited 
by a unit impulse at t = 5. Suppose that it is desired to bring the system to rest again after 
exactly one cycle — that is, when the response first returns to equilibrium moving in the 
positive direction. 

(a) Determine the impulse k8(t — to) that should be applied to the system in order to 
accomplish this objective. Note that k is the magnitude of the impulse and t 0 is the time 
of its application. 
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(b) Solve the resulting initial value problem and plot its solution to confirm that it behaves 
in the specified manner. 

14. Consider the initial value problem 

y" + yy' + y = &(t - 1), y(0) = o, /(()) = o, 

where y is the damping coefficient (or resistance). 

(a) Let y = \- Find the solution of the initial value problem and plot its graph. 

(b) Find the time t\ at which the solution attains its maximum value. Also find the maxi- 
mum value yi of the solution. 

(c) Let y = | and repeat parts (a) and (b). 

(d) Determine how t\ and yj vary as y decreases. What are the values of fi and yi when 
Y = 0? 

15. Consider the initial value problem 

y" + yy' + y=k8(t- 1), y(0) = 0, y'(0) = 0, 

where k is the magnitude of an impulse at t = 1 and y is the damping coefficient (or 
resistance). 

(a) Let y — Find the value of k for which the response has a peak value of 2; call this 
value k\ . 

(b) Repeat part (a) for y = j. 

(c) Determine how k\ varies as y decreases. What is the value of k\ when y = 0? 

16. Consider the initial value problem 

/ + y = fk(t), y(0) = 0, y'(0) = 0, 

where fift) = [z< 4 _* (f) — iu+kit)]/2k with 0 < k < 1. 

(a) Find the solution y = fit, k) of the initial value problem. 

(b) Calculate lim <p(t,k) from the solution found in part (a). 

k^0 

(c) Observe that lim f k (t) = 8(t — 4). Find the solution (j> 0 {t) of the given initial value 

k — >-0 

problem with/tR) replaced by 5(r — 4). Is it true that ij>o(t) = lim <p(t,k)7 

>-0 

(d) Plot <p(t, 1/2), tpit, 1/4), and <pa(t) on the same axes. Describe the relation between 
<t>(t,k ) and cf>o(t). 

Problems 17 through 22 deal with the effect of a sequence of impulses on an undamped 
oscillator. Suppose that 

y" +y =f(t), y (0) = 0, y'(0) = 0. 


41 

41 


For each of the following choices for/R): 

(a) Try to predict the nature of the solution without solving the problem. 

(b) Test your prediction by finding the solution and drawing its graph. 

(c) Determine what happens after the sequence of impulses ends. 

20 20 

17. f(t) =Y.8(t-kn) 18. fit ) = £ (-1)* +1 <5R - krc) 

k = 1 k = 1 


20 


19. fit) =£«(*- kjr/2) 

k=l 


20 

41 20. fit) = J2i-l) k ^8it - kn/2) 

k=l 
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15 40 

21. f(t) = £«[*- (2k - 1 )n] 22. f(t) = £ (-1 ) k+1 S(t - llk/4) 

k = 1 k = 1 

23. The position of a certain lightly damped oscillator satisfies the initial value problem 

20 

y" + 0.1/ + y = ^(-l/ +1 5(r - kit), y(0) = 0, /( 0) = 0. 

k=l 

Observe that, except for the damping term, this problem is the same as Problem 18. 

(a) Try to predict the nature of the solution without solving the problem. 

(b) Test your prediction by finding the solution and drawing its graph. 

(c) Determine what happens after the sequence of impulses ends. 

24. Proceed as in Problem 23 for the oscillator satisfying 

15 

y" + 0.1/ + y = 8\t - (2k - 1)jt], y(0) = 0, y'(0) = 0. 

*=i 

Observe that, except for the damping term, this problem is the same as Problem 21. 

25. (a) By the method of variation of parameters, show that the solution of the initial value 
problem 

y" + 2/ + 2y=f(t); y( 0) = 0, /(0) = 0 


is 

y = f e~ (t ~ r) f(z) sin(t — r) dr. 

Jo 

(b) Show that if f(t) = S(t — n), then the solution of part (a) reduces to 

y = w jr (f)e -(,-ir) sin(f — n). 

(c) Use a Laplace transform to solve the given initial value problem with 
f(t) = S(t — n) and confirm that the solution agrees with the result of part (b). 
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Sometimes it is possible to identify a Laplace transform H(s) as the product of two 
other transforms F(s) and G(.v), the latter transforms corresponding to known func- 
tions/ and g, respectively. In this event, we might anticipate that H(s) would be the 
transform of the product of / and g. However, this is not the case; in other words, 
the Laplace transform cannot be commuted with ordinary multiplication. On the 
other hand, if an appropriately defined “generalized product” is introduced, then the 
situation changes, as stated in the following theorem. 


Theorem 6.6.1 ^ = £{/(0} and G(s) = £{g(f)} both exist for s > a > 0, then 


H(s) = F(s)G(s) = £{h(t )}, 


s > a, 


(i) 
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where 

fit - x)gix)dx = f fix) g(t - x)dx. (2) 

Jo 

The function h is known as the convolution of / and g; the integrals in Eq. (2) are 
known as convolution integrals. 



The equality of the two integrals in Eq. (2) follows by making the change of vari- 
able t — r = £ in the first integral. Before giving the proof of this theorem, let us 
make some observations about the convolution integral. According to this theorem, 
the transform of the convolution of two functions, rather than the transform of their 
ordinary product, is given by the product of the separate transforms. It is conven- 
tional to emphasize that the convolution integral can be thought of as a “generalized 
product” by writing 

hit) = if *g)(t). (3) 

In particular, the notation if * g)(t) serves to indicate the first integral appearing in 

Eq. (2). 

The convolution f * g has many of the properties of ordinary multiplication. For 
example, it is relatively simple to show that 


f*g = g*f 

(commutative law) 

( 4 ) 

f*(gl+gl)=f*gl+f*g2 

(distributive law) 

( 5 ) 

if * g) *h=f * ig*h) 

(associative law) 

(6) 

/*0 = 0*/ = 0 . 


( 7 ) 


In Eq. (7) the zeros denote not the number 0 but the function that has the value 0 
for each value of t. The proofs of these properties are left to you as exercises. 

However, there are other properties of ordinary multiplication that the convolution 
integral does not have. For example, it is not true in general that / * 1 is equal to /. 
To see this, note that 

(/*1)(0= f fit - x) -ldx = f fit — x) dx. 

Jo Jo 

If, for example, fit) = cos t, then 

t=i 

(/*l)(f) = 1 cos(f — r) dx = — sin(f — r) 

Jo r=0 

= — sin 0 + sin t 

— sin t . 

Clearly, if * l)(f) ^ /(f) in this case. Similarly, it may not be true that / * / is non- 
negative. See Problem 3 for an example. 

Convolution integrals arise in various applications in which the behavior of the 
system at time t depends not only on its state at time t but also on its past history. 
Systems of this kind are sometimes called hereditary systems and occur in such diverse 
fields as neutron transport, viscoelasticity, and population dynamics, among others. 
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Turning now to the proof of Theorem 6.6.1, we note first that if 


and 


F(s) = 


G(s) = 



e-*mdt 


e sr g(T)dr, 


then 


/»oo /» oo 

F(s)G(s)= e~*mdS e-"g( r) 
JO J 0 


Jr. 


( 8 ) 


Since the integrand of the first integral does not depend on the integration variable 
of the second, we can write F(s)G(s) as an iterated integral 


/*oo r /*oo 

F(s)G(s) = I e~ ST g( r) / 

J o L Jo 

/»oo r />oo 

= / g(r) / d% 

Jo L jo 


Jr 


Jr. 


( 9 ) 


The latter integral can be put into a more convenient form by introducing a change 
of variable. Let £ = t — r, for fixed r, so that J£ = dt. Further, £ = 0 corresponds to 
t = x and £ = oo corresponds to f = oo; then the integral with respect to £ in Eq. (9) 
is transformed into one with respect to t: 


F(s)G(s ) = 




e rt /(f — r) Jf 


Jr. 


(10) 


The iterated integral on the right side of Eq. (10) is carried out over the shaded wedge- 
shaped region extending to infinity in the fr-plane shown in Figure 6.6.1. Assuming 
that the order of integration can be reversed, we finally obtain 


F(s)G(s) = 



fit- r)g(r) Jr 


dt 


( 11 ) 



FIGURE 6.6.1 Region of integration in F(s)G(s). 
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EXAMPLE 

1 


EXAMPLE 

2 


or 

pOO 

F(s)G(s)= / e~ s, h(t)dt 

Jo 

= mm ( 12 ) 

where h(t) is defined by Eq. (2). This completes the proof of Theorem 6.6.1. 


Find the inverse transform of 


ms) = 


a 

s 2 (s 2 + a 2 ) 


(13) 


ft is convenient to think of H(s) as the product of s~ 2 and a/(s 2 + a 2 ), which, according to 
lines 3 and 5 of Table 6.2.1, are the transforms of t and sin at, respectively. Hence, by Theorem 
6.6.1, the inverse transform of H(s) is 


h{t) = / (t — z) sinar dr = 

Jo a ~ 

You can verify that the same result is obtained if h(t) is written in the alternative form 


at — sin at 


(14) 


li(t) = 


/" tsi 

Jo 


sina(f — r) dr, 


which confirms Eq. (2) in this case. Of course, h(t) can also be found by expanding H{s) in 
partial fractions. 


Find the solution of the initial value problem 

y" + 4y = g(t), (15) 

y(0) = 3, /( 0) = -1. (16) 

By taking the Laplace transform of the differential equation and using the initial conditions, 
we obtain 

s 2 Y(s) — 3s + 1 4- 4Y (j) = G(s) 


or 


Y(s) = 


3s — 1 G(s) 
s 2 + 4 s 2 + 4 


(17) 


Observe that the first and second terms on the right side of Eq. (17) contain the dependence 
of Y ( 5 ) on the initial conditions and forcing function, respectively. It is convenient to write 
Y (s) in the form 


Y (s) = 3 


s 

s 2 + 4 


1 

2 


2 1 
s 2 + 4 + 2 


2 

s 2 + 4 


G(s). 


(18) 


Then, using lines 5 and 6 of Table 6.2.1 and Theorem 6.6.1, we obtain 


y = 3 cos 2r — i sin 2t + ( 


/' 


sin 2 (t — r)g(r) dr. 


(19) 


If a specific forcing function g is given, then the integral in Eq. (19) can be evaluated (by 
numerical means, if necessary). 


Example 2 illustrates the power of the convolution integral as a tool for writing 
the solution of an initial value problem in terms of an integral. In fact, it is possible 
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to proceed in much the same way in more general problems. Consider the problem 
consisting of the differential equation 

ay" + by' + cy = g(t), (20) 

where a, b, and c are real constants and g is a given function, together with the initial 
conditions 

y(0)=y 0 , /(0) = 3 >q. (21) 

The transform approach yields some important insights concerning the structure of 
the solution of any problem of this type. 

The initial value problem (20), (21) is often referred to as an input-output problem. 
The coefficients a, b , and c describe the properties of some physical system, and g{t) 
is the input to the system. The values yo and y' Q describe the initial state, and the 
solution y is the output at time t. 

By taking the Laplace transform of Eq. (20) and using the initial conditions (21), 
we obtain 

(as 2 + bs + c)T(s) — (us + b)yo — ay' Q = G(s ). 


If we let 


<Ks) = 


(as + b)y 0 + ay' 0 
as 2 + bs + c 


then we can write 


TCs) = 


G(s) 

as 2 + bs + c 


Consequently, 


Y(s) — &(s) + 'I'(s). 

y = (p(G + $(t). 


( 22 ) 

(23) 

(24) 


where <j>(t) — C 1 {4>(j)} and i fr(t) = C 1 {'k(s)}. Observe that cp(t) is the solution of 
the initial value problem 

ay" + by 1 + cy = 0, y(0)=y 0 , /(0)=>’o, (25) 


obtained from Eqs. (20) and (21) by setting g(t) equal to zero. Similarly, i jr(t) is the 
solution of 


ay" + by' + cy = g(t), y(0) = 0, /(0) = 0, (26) 


in which the initial values yo and y' 0 are each replaced by zero. 

Once specific values of a, b , and c are given, we can find 4>(t) — £ -1 {0(.s)} by using 
Table 6.2.1, possibly in conjunction with a translation or a partial fraction expansion. 
To find i/r(t) = (^)J, it is convenient to write 'b(s) as 


T(.s) = H(s)G(s), (27) 

where H(s) — (as 2 + bs + c) _l . The function H is known as the transfer function 3 
and depends only on the properties of the system under consideration; that is, H(s) is 
determined entirely by the coefficients a, b, and c. On the other hand, G(s ) depends 


3 This terminology arises from the fact that H(s) is the ratio of the transforms of the output and the input 
of the problem ( 26 ). 
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only on the external excitation git) that is applied to the system. By the convolution 
theorem we can write 

i lr{t) = C~ l {H{s)Gis)} = f h(t — r)g(r) dr, (28) 

Jo 

where h(t ) = { H (,v) } , and git) is the given forcing function. 

To obtain a better understanding of the significance of h{t), we consider the case in 
which G(s) = 1; consequently, g(t) = 8(t) and 'I'(s) = H(s). This means that y = h{t ) 
is the solution of the initial value problem 

ay" + by' +cy = 8(t), y(0) = 0, /( 0) = 0, (29) 

obtained from Eq. (26) by replacing g(t) by 8(t). Thus h(t) is the response of the 
system to a unit impulse applied at t = 0, and it is natural to call hit) the impulse 
response of the system. Equation (28) then says that ft it) is the convolution of the 
impulse response and the forcing function. 

Referring to Example 2, we note that in that case, the transfer function is 
H{s) — 1 /{s 2 3 + 4) and the impulse response is h{t) — (sin2f)/2. Also, the first two 
terms on the right side of Eq. (19) constitute the function </>(f ), the solution of the 
corresponding homogeneous equation that satisfies the given initial conditions. 


PROBLEMS 


1. Establish the commutative, distributive, and associative properties of the convolution 
integral. 

(a) f*g = g*f 

(b) / * (gi + gi) = f * gi +f * g2 

(c) f * ig * h) = (f * g) * h 

2. Find an example different from the one in the text showing that if * 1 )(t) need not be 
equal to fit). 

3. Show, by means of the example fit) = sin ?, that / * / is not necessarily nonnegative. 


In each of Problems 4 through 7 find the Laplace transform of the given function. 

4. fit) = f it — t) 2 cos2r dr 5. fit) = f sin r rfr 

Jo Jo 


-f 


6. f(t) = I (t — z)e r dz 


-f* 


7 . fit) = t sin(r — r) cos r dr 


In each of Problems 8 through 11 find the inverse Laplace transform of the given function by 
using the convolution theorem. 

8 - ^O) = a, l 9. Fis) = 


10. Fis) = 


5 4 (j 2 + 1) 
1 


11. Fis) = 


is + l)(s 2 + 4) 
Gis) 


is + l) 2 (s 2 + 4) ' 5 2 + l 

12. (a) If / it) = t m and git) = t n , where m and n are positive integers, show that 


f*g = t‘ 


f.m+n+1 


/' 


n (l — u) n du. 
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(b) Use the convolution theorem to show that 


f 

Jo 


u m (1 — u) n du = 


(m + n + 1)! 


(c) Extend the result of part (b) to the case where m and n are positive numbers but not 
necessarily integers. 

In each of Problems 13 through 20 express the solution of the given initial value problem in 
terms of a convolution integral. 


13. 

/' 

+ 

e o 

ii 

*-*• 

O 

= 0, /(0) 

= 1 



14. 

y" 

+ 2 y' + 2 y = sin at; 

o 

II 

o 

/(0) = o 



15. 

4 y 

" + 4 y + 11 y = g(0; 

o 

II 

o 

y'(0) = o 



16. 

y" 

+ y' +\y = l-n„(0; 

y(0) = 

1, /(0) = 

-1 


17. 

y" 

+ 4/ + 4 y = g(t ); 

y( 0) = 2 , 

y' (0) = -3 



18. 

y" 

+ 3 y' + 2 y = cos at; 

y( 0) = l, 

y' (0) = o 



19. 

y <4) - y = g(t)', v(0) = 

= 0, y'(0): 

= 0, y"( 0) 

= 0, 

Y"( 0) 

20. 

y< 4) + 5 y" + 4y = g(t); 

y(0) = l, 

y'(0) = o. 

y"( 0) 

= 0, 

21. 

Consider the equation 








0(0+ I 

Jo 

i 

kit-W&di; = 

m. 


/"( 0 ) = 0 


in which / and k are known functions, and 0 is to be determined. Since the unknown 
function ij> appears under an integral sign, the given equation is called an integral equation; 
in particular, it belongs to a class of integral equations known as Volterra integral equations. 
Take the Laplace transform of the given integral equation and obtain an expression for 
C[4>(t)} in terms of the transforms C{f(t)} and £[k(t)} of the given functions/ and k. The 
inverse transform of £{0(t)} is the solution of the original integral equation. 

22. Consider the Volterra integral equation (see Problem 21) 


010 + [ (f-£)0(£)rff = sin2t. 

Jo 

(a) Solve the integral equation (i) by using the Laplace transform. 

(b) By differentiating Eq. (i) twice, show that 0(0 satisfies the differential equation 

<p"(t) + 0(0 = — 4sin2r. 

Show also that the initial conditions are 


(i) 


0(0) = 0, 0'(O) = 2. 

(c) Solve the initial value problem in part (b) and verify that the solution is the same as 
the one in part (a). 

In each of Problems 23 through 25: 

(a) Solve the given Volterra integral equation by using the Laplace transform. 

(b) Convert the integral equation into an initial value problem, as in Problem 22(b). 

(c) Solve the initial value problem in part (b) and verify that the solution is the same as the 
one in part (a). 
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23. 

25. 


m + [ (*-$)*($)# = i 

Jo 

<Kt ) + 2 f cos (f - £)</>(?) d£ = e - ' 
Jo 


24. </>(f) — [ (t -?)</>(?) d? = l 
Jo 


There are also equations, known as integro-differential equations, in which both derivatives 

and integrals of the unknown function appear. In each of Problems 26 through 28: 

(a) Solve the given integro-differential equation by using the Laplace transform. 

(b) By differentiating the integro-differential equation a sufficient number of times, convert 

it into an initial value problem. 

(c) Solve the initial value problem in part (b) and verify that the solution is the same as the 

one in part (a). 

26. <f>'{t) + f (t - ?)0(?) d£ = t, 0( 0) = 0 

27. <f>'(t) -1 [ (f - f) 2 0(? ) df = — f, d>(0) = 1 

28. <*>'(0 + 0(0= / sin(f-f)0(£)d£, 0(0) = 1 

Jo 

29. The Tautochrone. A problem of interest in the history of mathematics is that of finding 
the tautochrone 4 — the curve down which a particle will slide freely under gravity alone, 
reaching the bottom in the same time regardless of its starting point on the curve. This 
problem arose in the construction of a clock pendulum whose period is independent of 
the amplitude of its motion. The tautochrone was found by Christian Huygens (1629- 
1695) in 1673 by geometrical methods, and later by Leibniz and Jakob Bernoulli using 
analytical arguments. Bernoulli’s solution (in 1690) was one of the first occasions on 
which a differential equation was explicitly solved. The geometric configuration is shown 
in Figure 6.6.2. 


y 


P(a, b) 


C 



x 


FIGURE 6.6.2 The tautochrone. 


The starting point P(a, b) is joined to the terminal point (0, 0) by the arc C. Arc length s 
is measured from the origin, and f(y) denotes the rate of change of s with respect to y: 


f(y ) = 


ds 

dy 



(i) 


4 The word “tautochrone” comes from the Greek words tauto , meaning same, and chronos , meaning time. 
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Then it follows from the principle of conservation of energy that the time T ( b ) required 
for a particle to slide from P to the origin is 


T(b) = 


1 f b f(y ) 
yJTg Jo yjb -y 


(ii) 


(a) Assume that T(b) = 7o, a constant, for each b. By taking the Laplace transform of 
Eq. (ii) in this case, and using the convolution theorem, show that 


then show that 



m = 


a/2 g T 0 
X y/y' 


Hint: See Problem 27 of Section 6.1. 

(b) Combining Eqs. (i) and (iv), show that 


dx 2a — y 
dy~y y 


(hi) 


(iv) 


(v) 


where a = gT^/n 2 . 

(c) Use the substitution y — 2a sin 2 (0/2) to solve Eq. (v), and show that 


x = c*(# + sin#), y = a(\ — cos#). 


(vi) 
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is an arc of a cycloid. 
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There are many physical problems that involve a number of separate elements linked 
together in some manner. For example, electrical networks have this character, as 
do many problems in mechanics and in other fields. In these and similar cases, the 
corresponding mathematical problem consists of a system of two or more differential 
equations, which can always be written as first order equations. In this chapter we 
focus on systems of first order linear equations, and in particular equations having 
constant coefficients, utilizing some of the elementary aspects of linear algebra to 
unify the presentation. In many respects this chapter follows the same lines as the 
treatment of second order linear equations in Chapter 3. 


7.1 Introduction 


Systems of simultaneous ordinary differential equations arise naturally in problems 
involving several dependent variables, each of which is a function of the same single 
independent variable. We will denote the independent variable by t, and will let 
xi,X 2 ,X 3 , . . . represent dependent variables that are functions of t. Differentiation 
with respect to t will be denoted by a prime. 

For example, consider the spring-mass system in Figure 7.1.1. The two masses 
move on a frictionless surface under the influence of external forces F\{t) and FTr), 
and they are also constrained by the three springs whose constants are ki,k. 2 , and 
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respectively. Using arguments similar to those in Section 3.7, we find the following 
equations for the coordinates xi and X 2 of the two masses: 


mi 


d 2 x i 
dt 2 


m 2 


d 2 x 2 
dt 2 


- k 2 (x 2 - Xi) - kix 1 + Fi{t) 

= -(ki + k 2 )X\ + k 2 x 2 + Fiit), 
= k 2 x 2 - k 2 (x 2 - xi) + F 2 (t) 


(1) 


= k 2 x i - (k 2 + k 3 )x 2 + F 2 (t). 
A derivation of Eqs. (1) is outlined in Problem 17. 


l F 1 it) | F 2 {t) 

I ► I ► 


k\ 

*NNN SNh 

' kn ' U 

1 

m l 

i 


i 

m 2 



^www^ 

1 1 

1 X-. 1 Xo 

1 1 w 1 1 _ 


FIGURE 7.1.1 A two-mass, three-spring system. 


Next, consider the parallel LRC circuit shown in Figure 7.1.2. Let V be the voltage 
drop across the capacitor and I the current through the inductor. Then, referring to 
Section 3.7 and to Problem 19 of this section, we can show that the voltage and current 
are described by the system of equations 


dl _ V 
dt L ’ 
dV _ I V 
~dt ~ ~C ~ RC ’ 

where L is the inductance, C is the capacitance, and R is the resistance. 


( 2 ) 



FIGURE 7.1.2 A parallel LRC circuit. 


One reason why systems of first order equations are particularly important is that 
equations of higher order can always be transformed into such systems. This is usually 
required if a numerical approach is planned, because almost all codes for generat- 
ing numerical approximations to solutions of differential equations are written for 
systems of first order equations. The following example illustrates how easy it is to 
make the transformation. 
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EXAMPLE 

1 


The motion of a certain spring-mass system (see Example 3 of Section 3.7) is described by the 
second order differential equation 

u" + 0.125m' + m = 0. (3) 


Rewrite this equation as a system of first order equations. 

Let x\ = m and X 2 — u' . Then it follows that x\ = x 2 . Further, u" = x r 2 . Then, by substituting 
for u, u', and u" in Eq. (3), we obtain 

x' 2 + 0.125x2 + X\ — 0. 


Thus x\ and x 2 satisfy the following system of two first order differential equations: 

x\ = x 2 , 

x' 2 = —x\ — 0.125x2. 


(4) 


The general equation of motion of a spring-mass system 

mu!' + yu' + ku — F(t) (5) 

can be transformed into a system of first order equations in the same manner. If we 
let x\ = u and x 2 = u', and proceed as in Example 1, we quickly obtain the system 


X, = —(k/m)x i — (y /m)X 2 + F(t)/m. 

To transform an arbitrary nth order equation 

y w = F{t,y,y’,...,/ n - l) ) (7) 

into a system of n first order equations, we extend the method of Example 1 by 
introducing the variables xi,X 2 , . . . ,x„ defined by 

xi =y, x 2 =y, x 3 =y", ..., x n =y n - x \ (8) 


It then follows immediately that 

x[ = x 2 , 

A_ = 

x n- 1 = X ni 

and, from Eq. (7), 

x 'n = F(t,X i,X 2 ,...,X„). 

Equations (9) and (10) are a special case of the more general system 

x\ = Fi(t,xi,x 2 ,...,x„), 
x' 2 = F 2 (t.x i,x 2 ,...,x„), 

X'n = F n (t,X i,X 2 ,...,X„). 


( 9 ) 


(10) 


( 11 ) 
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In a similar way the system (1) can be reduced to a system of four first order equations 
of the form (11), and the system (2) is already in this form. In fact, systems of the 
form (11) include almost all cases of interest, so much of the more advanced theory 
of differential equations is devoted to such systems. 

A solution of the system (11) on the interval I: a < t < fl is a set of n functions 

*l = 0l(O, *2 = <fc(0, x n = (j) n (t) (12) 

that are differentiable at all points in the interval I and that satisfy the system of equa- 
tions (11) at all points in this interval. In addition to the given system of differential 
equations, there may also be given initial conditions of the form 

*l(f 0 )=*l, X2(t 0 )=X°, ..., x n (t 0 )=x°, (13) 

that to is a specified value of t in /, and xj, . . . , xjj are prescribed numbers. The 
differential equations (11) and the initial conditions (13) together form an initial 
value problem. 

A solution (12) can be viewed as a set of parametric equations in an n-dimensional 
space. For a given value of f, Eqs. (12) give values for the coordinates xi, . . . ,x„ 
of a point in the space. As t changes, the coordinates in general also change. The 
collection of points corresponding to a < t < ft forms a curve in the space. It is 
often helpful to think of the curve as the trajectory, or path, of a particle moving in 
accordance with the system of differential equations (11). The initial conditions (13) 
determine the starting point of the moving particle. 

The following conditions on F \ , F>, . . . , F„, which are easily checked in specific 
problems, are sufficient to ensure that the initial value problem (11), (13) has a 
unique solution. Theorem 7.1.1 is analogous to Theorem 2.4.2, the existence and 
uniqueness theorem for a single first order equation. 


Theorem 7.1.1 Let eac h °f the functions F\,. . . ,F n and the partial derivatives 3Fi/3xi, 
...,3Fi/3x„, . . . , 3F„/3xi, . . . , 3F„/3x„ be continuous in a region R of 
txiX 2 ■ ■ -x„-space defined by a < t < /J, ai < x\ < /h, . . . ,a n <x n < /8„, and let the 
point (to.XpXj, . . . ,x°) be in R. Then there is an interval 1 1 — F < h in which there 
exists a unique solution x\ = <p\ (/),..., x„ = 4> n (t) of the system of differential equa- 
tions (11) that also satisfies the initial conditions (13). 


The proof of this theorem can be constructed by generalizing the argument in 
Section 2.8, but we do not give it here. However, note that, in the hypotheses of the 
theorem, nothing is said about the partial derivatives of F \, . . . , F n with respect to the 
independent variable t . Also, in the conclusion, the length 2h of the interval in which 
the solution exists is not specified exactly, and in some cases it may be very short. 
Finally, the same result can be established on the basis of somewhat weaker but more 
complicated hypotheses, so the theorem as stated is not the most general one known, 
and the given conditions are sufficient, but not necessary, for the conclusion to hold. 

If each of the functions F\, . . . ,F n in Eqs. (11) is a linear function of the dependent 
variables x\, . . . ,x„, then the system of equations is said to be linear; otherwise, it is 
nonlinear. Thus the most general system of n first order linear equations has the form 
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PROBLEMS 


x\ = P\ \(t)X\ H b P\n(t)X n + gl(t), 

x' 2 = p 2 l(t)Xl H b P 2 n(t)x n + g 2 (t), ^ 

X n = p n \ (t)X\ H b Pnn(t)x n + g„(t). 

If each of the functions giit), . . . ,g n (t ) is zero for all t in the interval /, then the 
system (14) is said to be homogeneous; otherwise, it is nonhomogeneous. Observe 
that the systems (1) and (2) are both linear. The system (1) is nonhomogeneous unless 
Fi{t) = F 2 (t) — 0, while the system (2) is homogeneous. For the linear system (14), 
the existence and uniqueness theorem is simpler and also has a stronger conclusion. 
It is analogous to Theorems 2.4.1 and 3.2.1. 


If the functions pn, p\ 2 , . . . ,p nn , gi,---,gn are continuous on an open interval 
1: a < t < /l, then there exists a unique solution xi = </>i(f), • • ■ ,x n = (pnit) of the 
system (14) that also satisfies the initial conditions (13), where to is any point in /, 
and x°, . . . , x° n are any prescribed numbers. Moreover, the solution exists through- 
out the interval /. 


Note that, in contrast to the situation for a nonlinear system, the existence and 
uniqueness of the solution of a linear system are guaranteed throughout the interval 
in which the hypotheses are satisfied. Furthermore, for a linear system the initial 
values . . . ,x° at t = t 0 are completely arbitrary, whereas in the nonlinear case the 
initial point must lie in the region R defined in Theorem 7.1.1. 

The rest of this chapter is devoted to systems of linear first order equations (non- 
linear systems are included in the discussion in Chapters 8 and 9). Our presentation 
makes use of matrix notation and assumes that you have some familiarity with the 
properties of matrices. The basic facts about matrices are summarized in Sections 7.2 
and 7.3, and some more advanced material is reviewed as needed in later sections. 


In each of Problems 1 through 4 transform the given equation into a system of first order 
equations. 

1. u" + 0.5w' + 2u = 0 2. u" + 0.5m' + 2u = 3 sin t 

3. t * 1 2 * * 5 6 7 u" + tu! + (t 2 — 0.25 )u = 0 4. i/ (4) — u = 0 

In each of Problems 5 and 6 transform the given initial value problem into an initial value 

problem for two first order equations. 

5. u" + 0.25m' + 4a = 2 cos 3f, u(0) — 1, w'(0) = — 2 

6. u" + p{t)u' + q(t)u — g(t), u)0) = uq, u'(O) = u' 0 

7. Systems of first order equations can sometimes be transformed into a single equation of 
higher order. Consider the system 

x[ = —2xi + x 2 , x' 2 — X\ — 2x 2 . 

(a) Solve the first equation for x 2 and substitute into the second equation, thereby ob- 
taining a second order equation for X\. Solve this equation for x 1 and then determine x 2 
also. 
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(b) Find the solution of the given system that also satisfies the initial conditions xi(0) = 2, 
x 2 ( 0 ) = 3. 

(c) Sketch the curve, for t > 0, given parametrically by the expressions for x\ and x 2 
obtained in part (b). 


In each of Problems 8 through 12 proceed as in Problem 7. 

(a) Transform the given system into a single equation of second order. 

(b) Find x\ and x 2 that also satisfy the given initial conditions. 

(c) Sketch the graph of the solution in the xix 2 -plane for t > 0. 


8. x[ = 3xi — 2 x 2 , xi(0) = 3 

x, = 2xi — 2 x 2 , x 2 (0) = 1 

10. x[ — x\ — 2x 2 , xi(0) = —1 

x 2 = 3xi — 4 x 2 , x 2 (0) = 2 

12. x! x = — 0.5xi + 2 x 2 , Xj(0) = —2 


9. Xj = 1.25xi + 0.75x 2 , xi(0) = —2 

x! 2 = 0.75xi + 1.25x 2 , x 2 (0) = 1 

11. x\ = 2x 2 , xi(0) = 3 

x 2 = — 2xi, x 2 (0) = 4 


x ' 2 = — 2xi — 0.5x 2 , x 2 (0) = 2 

13. Transform Eqs. (2) for the parallel circuit into a single second order equation. 

14. Show that if an, a\ 2 , a 2 1 , and a 22 are constants with a \ 2 and a 2 1 not both zero, and if the 
functions gi and g 2 are differentiable, then the initial value problem 


x[ - duXi + a 12 x 2 + gi(t), xi(0) = x° 

x 2 — a 21 X! + a 22 x 2 + g 2 (t), x 2 (0)=X2 


can be transformed into an initial value problem for a single second order equation. Can 
the same procedure be carried out if a n , ■ ■ . ,a 22 are functions of f? 

15. Consider the linear homogeneous system 


x ' = Pn(t)x + p 12 (t)y, 
y' = P2i(t)x + p 22 (t)y. 


Show that if x = xi (t), y = y t (t) and x = x 2 (f), y = y 2 {t) are two solutions of the given 
system, thenx = cixi(t) + c 2 x 2 (t), y = ciyi(r) + c 2 y 2 (t) is also a solution for any constants 
Ci and c 2 . This is the principle of superposition. 

16. Let x = Xi(t), y = yi(7) and x = x 2 (f), y — y 2 (t) be any two solutions of the linear nonho- 
mogeneous system 


x' = Pn(t)x + Pn(t)y + gi(t), 

y = P2i(t)x + p 22 {t)y + g 2 (t). 


Show that x = Xi(t) — x 2 (t),y = yi(t) — y 2 {t) is a solution of the corresponding homoge- 
neous system. 

17. Equations (1) can be derived by drawing a free-body diagram showing the forces acting 
on each mass. Figure 7.1.3fl shows the situation when the displacements xi and x 2 of the 
two masses are both positive (to the right) andx 2 > X\. Then springs 1 and 2 are elongated 
and spring 3 is compressed, giving rise to forces as shown in Figure 7.1.36. Use Newton’s 
law (F = ma ) to derive Eqs. (1). 

18. Transform the system (1) into a system of first order equations by letting y 2 — X\ , y 2 = x 2 , 
y 2 = Xj, and V 4 = x 2 . 
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FIGURE 7.1.3 (a) The displacements X\ and x 2 are both positive, (b) The tree-body diagram 
for the spring-mass system. 


Electric Circuits. The theory of electric circuits, such as that shown in Figure 7.1.2, consisting 
of inductors, resistors, and capacitors, is based on Kirchhoff’ s laws: (1) The net flow of current 
into each node (or junction) is zero, and (2) the net voltage drop around each closed loop 
is zero. In addition to Kirchhoffs laws, we also have the relation between the current 7 in 
amperes through each circuit element and the voltage drop V in volts across the element: 

V = 777, R = resistance in ohms; 

=7, C = capacitance in farads 1 ; 
at 

d.1 

L— = V, L = inductance in henrys. 
dt 

Kirchhoffs laws and the current-voltage relation for each circuit element provide a system of 
algebraic and differential equations from which the voltage and current throughout the circuit 
can be determined. Problems 19 through 21 illustrate the procedure just described. 

19. Consider the circuit shown in Figure 7.1.2. Let 7 1; 7 2 , and 7 3 be the currents through 
the capacitor, resistor, and inductor, respectively. Likewise, let V\, V 2 , and V 3 be the 
corresponding voltage drops. The arrows denote the arbitrarily chosen directions in which 
currents and voltage drops will be taken to be positive. 

(a) Applying Kirchhoffs second law to the upper loop in the circuit, show that 

Vi - v 2 = 0. (i) 


In a similar way, show that 

V 2 -V 3 = 0. (ii) 

(b) Applying Kirchhoffs first law to either node in the circuit, show that 

h+I 2 + h = 0. (iii) 

(c) Use the current-voltage relation through each element in the circuit to obtain the 
equations 

CV[=h, V 2 = RI 2 , 7.7' = V 3 . (iv) 


Actual capacitors typically have capacitances measured in microfarads. We use farad as the unit for 
numerical convenience. 
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(d) Eliminate Vt, V 3 , I\, and I 2 among Eqs. (i) through (iv) to obtain 

CV'i = -h ~ LI 3 = Vi. (V) 

Observe that if we omit the subscripts in Eqs. (v), then we have the system (2) of this 
section. 


20. Consider the circuit shown in Figure 7.1.4. Use the method outlined in Problem 19 to 
show that the current I through the inductor and the voltage V across the capacitor satisfy 
the system of differential equations 


dl 

dt 


-I-V, 


dV 

dt 


21 -V. 


R = 1 ohm 



L = 1 henry 


FIGURE 7.1.4 The circuit in Problem 20. 


21. Consider the circuit shown in Figure 7.1.5. Use the method outlined in Problem 19 to 
show that the current I through the inductor and the voltage V across the capacitor satisfy 
the system of differential equations 


L 


dl 

dt 


= -RJ-V, 



= 1 - 


V 

Ri' 


L 



FIGURE 7.1.5 The circuit in Problem 21. 


22. Consider the two interconnected tanks shown in Figure 7.1.6. Tank 1 initially contains 
30 gal of water and 25 oz of salt, and Tank 2 initially contains 20 gal of water and 15 oz 
of salt. Water containing 1 oz/gal of salt flows into Tank 1 at a rate of 1.5 gal/min. The 
mixture flows from Tank 1 to Tank 2 at a rate of 3 gal/min. Water containing 3 oz/gal of 
salt also flows into Tank 2 at a rate of 1 gal/min (from the outside). The mixture drains 
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1.5 gal/min 


1 gal/min 



FIGURE 7.1.6 Two interconnected tanks (Problem 22). 


from Tank 2 at a rate of 4 gal/min, of which some flows back into Tank 1 at a rate of 1.5 
gal/min, while the remainder leaves the system. 

(a) Let Qi(t) and Q 2 (t ), respectively, be the amount of salt in each tank at time t. Write 
down differential equations and initial conditions that model the flow process. Observe 
that the system of differential equations is nonhomogeneous. 

(b) Find the values of Qi and Q 2 for which the system is in equilibrium, that is, does not 
change with time. Let Qf and Qf be the equilibrium values. Can you predict which tank 
will approach its equilibrium state more rapidly? 

(c) Letxi = Qi(t) — Qf andx 2 = Qi(t) — Qf - Determine an initial value problem forxi 
and x 2 . Observe that the system of equations for x\ and x 2 is homogeneous. 

23. Consider two interconnected tanks similar to those in Figure 7.1.6. Tank 1 initially contains 
60 gal of water and Q J oz of salt, and Tank 2 initially contains 100 gal of water and Q\ 
oz of salt. Water containing q\ oz/gal of salt flows into Tank 1 at a rate of 3 gal/min. The 
mixture in Tank 1 flows out at a rate of 4 gal/min, of which half flows into Tank 2, while 
the remainder leaves the system. Water containing q 2 oz/gal of salt also flows into Tank 2 
from the outside at the rate of 1 gal/min. The mixture in Tank 2 leaves the tank at a rate 
of 3 gal/min, of which some flows back into Tank 1 at a rate of 1 gal/min, while the rest 
leaves the system. 

(a) Draw a diagram that depicts the flow process described above. Let Qi(t) and Q 2 (t), 
respectively, be the amount of salt in each tank at time t . Write down differential equations 
and initial conditions for <2i and Q 2 that model the flow process. 

(b) Find the equilibrium values Qf and Qf in terms of the concentrations q\ and q 2 . 

(c) Is it possible (by adjusting q\ and q 2 ) to obtain Qf = 60 and Qf = 50 as an equilibrium 
state? 

(d) Describe which equilibrium states are possible for this system for various values of q\ 
and q 2 . 
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7.2 Review of Matrices 


/ a n 

an 

U\n \ 


^21 

an 

^2 n 

(i) 


ami 

ttmn/ 



For both theoretical and computational reasons, it is advisable to bring some of the 
results of matrix algebra 2 * * * * to bear on the initial value problem for a system of linear 
differential equations. For reference purposes this section and the next are devoted 
to a brief summary of the facts that will be needed later. More details can be found 
in any elementary book on linear algebra. We assume, however, that you are familiar 
with determinants and how to evaluate them. 

We designate matrices by boldfaced capitals A, B, C, . . . , occasionally using bold- 
faced Greek capitals 4> , ^ , A matrix A consists of a rectangular array of numbers, 

or elements, arranged in m rows and n columns — that is, 


A = 


We speak of A as anm x n matrix. Although later in the chapter we will often assume 
that the elements of certain matrices are real numbers, in this section we assume that 
the elements of matrices may be complex numbers. The element lying in the zth 
row and j th column is designated by a,y, the first subscript identifying its row and the 
second its column. Sometimes the notation (fly) is used to denote the matrix whose 
generic element is a, ; . 

Associated with each matrix A is the matrix A r , which is known as the transpose 
of A and is obtained from A by interchanging the rows and columns of A. Thus, if 
A = (fly), then A 7 = (fly,-). Also, we will denote by fl,y the complex conjugate of fly, 
and by A the matrix obtained from A by replacing each element fly by its conjugate 
fl,y. The matrix A is called the conjugate of A. It will also be necessary to consider 

— T 

the transpose of the conjugate matrix A . This matrix is called the adjoint of A and 
will be denoted by A* . 

For example, let 


Then 


A 7 = 


A = 


2 - i -5 - 


3 

4 + 3/ 


A = 


2 / 


3 2 + i 

4 - 3 i -5 - 2 i 


X* = 


3 4 - 3 i 

2 + i —5 — 2 i 


2 The properties of matrices were first extensively explored in 1858 in a paper by the English algebraist 

Arthur Cayley (1821-1895), although the word “matrix” was introduced by his good friend James Sylvester 

(1814-1897) in 1850. Cayley did some of his best mathematical work while practicing law from 1849 to 

1863; he then became professor of mathematics at Cambridge, a position he held for the rest of his life. 

After Cayley's groundbreaking work, the development of matrix theory proceeded rapidly, with significant 

contributions by Charles Hermite, Georg Frobenius, and Camille Jordan, among others. 
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We are particularly interested in two somewhat special kinds of matrices: square 
matrices, which have the same number of rows and columns — that is, m = n; and 
vectors (or column vectors), which can be thought of as n x 1 matrices, or matrices 
having only one column. Square matrices having n rows and n columns are said to 
be of order n. We denote (column) vectors by boldfaced lowercase letters, x, y, £, 

The transpose x T of an n x 1 column vector is a 1 x n row vector — that is, the 

matrix consisting of one row whose elements are the same as the elements in the 
corresponding positions of x. 

Properties of Matrices. 

1. Equality. Two m x n matrices A and B are said to be equal if all corresponding 
elements are equal — that is, if a y = b t j for each i and j. 

2. Zero. The symbol 0 will be used to denote the matrix (or vector), each of whose 
elements is zero. 

3. Addition. The sum of two m x n matrices A and B is defined as the matrix 
obtained by adding corresponding elements: 

A + B = (fly) + ( bjj ) = (fly + bij). (2) 

With this definition, it follows that matrix addition is commutative and associative, 
so that 

A + B = B + A, A + (B + C) = (A + B) + C. (3) 

4. Multiplication by a Number. The product of a matrix A by a complex number 
a is defined as follows: 

aA = a(fly) = (afly); (4) 

that is, each element of A is multiplied by a. The distributive laws 

a(A + B) = aA + aB, (a + /3)A — aA + f}A (5) 

are satisfied for this type of multiplication. In particular, the negative of A, denoted 
by —A, is defined by 

—A = (— 1)A. (6) 

5. Subtraction. The difference A — B of two m x n matrices is defined by 

A — B = A + (— B). (7) 

Thus 

A - B = (fly) - (bij) = (fly - bjj), (8) 

which is similar to Eq. (2). 

6. Multiplication. The product AB of two matrices is defined whenever the num- 
ber of columns in the first factor is the same as the number of rows in the second. If 
A and B are m x n and n x r matrices, respectively, then the product C = AB is an 
m x r matrix. The element in the /th row and j th column of C is found by multiplying 
each element of the /th row of A by the corresponding element of the ;th column of 
B and then adding the resulting products. In symbols, 

n 

Cij — J2 a ikbkj- 
k= 1 


( 9 ) 
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By direct calculation, it can be shown that matrix multiplication satisfies the associa- 
tive law 

(AB)C = A(BC) (10) 

and the distributive law 

A(B + C) = AB + AC. (11) 

However, in general, matrix multiplication is not commutative. For both products 
AB and BA to exist and to be of the same size, it is necessary that A and B be square 
matrices of the same order. Even in that case the two products are usually unequal, 
so that, in general, 

AB / BA. (12) 


EXAMPLE 

1 


To illustrate the multiplication of matrices, and also the fact that matrix multiplication is not 
necessarily commutative, consider the matrices 



ll 

-2 

A 



1 

-A 

A = 

0 

2 

-i 

> » = 

1 

-1 

0 


l 2 

1 

i 


l 2 

-1 

ij 


From the definition of multiplication given in Eq. (9) we have 



<2 

-2 + 2 


1 + 2-1 

-1 + 0 + 

AB = 

0 

+ 2-2 


0-2 + 1 

0 + 0- 


l 4 

+ 1 + 2 


2-1-1 

-2 + 0 + 


( 2 

2 




= 

0 

-1 - 


I- 



l 2 

0 - 

A 



Similarly, we find that 









(0 

-3 0\ 




BA = 

1 

-4 2 





l 4 

-5 4) 



Clearly, AB + BA. 


7. Multiplication of Vectors. There are several ways of forming a product of two 
vectors x and y, each with n components. One is a direct extension to n dimensions 
of the familiar dot product from physics and calculus; we denote it by \ T y and write 

n 

x r y = (13) 

!= 1 

The result of Eq. (13) is a (complex) number, and it follows directly from Eq. (13) 
that 

x r y = y r x, x r (y + z) = x r y + x r z, (ax) T y = a(x r y) = x r (ay). (14) 
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There is another vector product that is also defined for any two vectors having the 
same number of components. This product, denoted by (x,y), is called the scalar or 
inner product, and is defined by 

n 

( x >y ) = Z^- ( 15 ) 

i=i 

The scalar product is also a (complex) number, and by comparing Eqs. (13) and (15), 
we see that 

(x,y)=x r y. (16) 

Thus, if all the elements of y are real, then the two products (13) and (15) are identical. 
From Eq. (15) it follows that 

(x,y) = (y,x), (X,y + z) = (x,y) + (x,z), 

(17) 

(ax,y) = a(x,y), (x,ay) = or(x,y). 


Note that even if the vector x has elements with nonzero imaginary parts, the scalar 
product of x with itself yields a nonnegative real number 


n n 

(x,x) = 1-El 2 - 

i = 1 i= 1 


(18) 


The nonnegative quantity (x,x) 1/2 , often denoted by ||x||, is called the length, or 
magnitude, of x. If (x, y) = 0, then the two vectors x and y are said to be orthogonal. 
For example, the unit vectors i, j, k of three-dimensional vector geometry form an 
orthogonal set. On the other hand, if some of the elements of x are not real, then the 
product 

n 


T 

X X 


= E- 


For example, let 




i = 1 



( 1 ) 


^2 — f 

X = 

-2 

. y = 

i 


v 1 + h 


\ 3 / 


(19) 


Then 


x r y = (i)(2 - i) + (-2)(t) + (1 + 0(3) = 4 + 3 i, 
(x,y) = (0(2 + 0 + (— 2)(— 0 + (1 + 0(3) = 2 + 7q 
x T \ = (if + (— 2) 2 + (1 + if = 3 + 2 i, 

(x,x) = (0(-0 + (-2) (-2) + (1 + Od - 0 = 7. 


8. Identity. The multiplicative identity, or simply the identity matrix I, is given by 


(1 0 ■■■ 0 \ 

0 1 ... 0 


\o 0 ... 1/ 


( 20 ) 
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From the definition of matrix multiplication we have 

AI = IA = A (21) 

for any (square) matrix A. Flence the commutative law does hold for square matrices 
if one of the matrices is the identity. 

9. Inverse. The square matrix A is said to be nonsingular or invertible if there is 
another matrix B such that AB = I and BA = I, where I is the identity. If there is 
such a B, it can be shown that there is only one. It is called the multiplicative inverse, 
or simply the inverse, of A, and we write B = A -1 . Then 

AA ^ 1 = A -1 A = I. (22) 


Matrices that do not have an inverse are called singular or noninvertible. 

There are various ways to compute A -1 from A, assuming that it exists. One way 
involves the use of determinants. Associated with each element fly of a given matrix 
is the minor My, which is the determinant of the matrix obtained by deleting the ith 
row and y'th column of the original matrix — that is, the row and column containing 
Ay. Also associated with each element fly is the cofactor C,y defined by the equation 


Cy = (-l)^My. 

If B = A -1 , then it can be shown that the general element by is given by 

l, ~ det A' 


(23) 


(24) 


Although Eq. (24) is not an efficient way 3 to calculate A -1 , it does suggest a con- 
dition that A must satisfy for it to have an inverse. In fact, the condition is both 
necessary and sufficient: A is nonsingular if and only if det A ^ 0. If det A = 0, then 
A is singular. 

Another and usually better way to compute A -1 is by means of elementary row 
operations. There are three such operations: 


1. Interchange of two rows. 

2. Multiplication of a row by a nonzero scalar. 

3. Addition of any multiple of one row to another row. 

The transformation of a matrix by a sequence of elementary row operations is re- 
ferred to as row reduction or Gaussian 4 elimination. Any nonsingular matrix A can 
be transformed into the identity I by a systematic sequence of these operations. It is 
possible to show that if the same sequence of operations is then performed on I, it is 


3 For large n the number of multiplications required to evaluate A -1 by Eq. (24) is proportional to n\. 
If one uses more efficient methods, such as the row reduction procedure described later, the number of 
multiplications is proportional only to n 3 . Even for small values of n (such as n = 4), determinants are 
not an economical tool in calculating inverses, and row reduction methods are preferred. 

4 Carl Friedrich Gauss (1777-1855) spent most of his life at Gottingen and made major contributions to 
many areas of mathematics, including number theory, algebra, non-Euclidean and differential geometry, 
and analysis, as well as to more applied fields such as geodesy, statistics, and celestial mechanics. He is 
generally considered to be among the half-dozen best mathematicians of all time. 
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transformed into A -1 . It is most efficient to perform the sequence of operations on 
both matrices at the same time by forming the augmented matrix A 1 1. The following 
example illustrates the calculation of an inverse matrix in this way. 


EXAMPLE 

2 


Find the inverse of 


/I -1 -lV 


A = 


3 

V 2 


-1 

2 


We begin by forming the augmented matrix A 1 1: 


/I -1 -1 | 


A 1 1 = 


3 -1 

\ 2 2 



The matrix A can be transformed into I by the following sequence of operations, and at the 
same time, I is transformed into A -1 . The result of each step appears below the statement, 

(a) Obtain zeros in the off-diagonal positions in the first column by adding (—3) times the 
first row to the second row and adding (—2) times the first row to the third row. 

/I —1 —1 | 1 0 0\ 

0 2 5 I -3 1 0 

v 0 4 5 | -2 0 1/ 


(b) Obtain a 1 in the diagonal position in the second column by multiplying the second row 

by \- 


n 

-1 

-1 

1 

0 

0 

1 

5 

3 

i 

2 1 

2 

2 

\0 

4 

5 1 

-2 

0 


(c) Obtain zeros in the off-diagonal positions in the second column by adding the second row 
to the first row and adding (—4) times the second row to the third row. 



n 

0 

1 1 

1 

2 

1 

2 

°\ 


0 

1 

f 1 

3 

2 

1 

2 

° 


\0 

0 - 

-5 I 

4 

-2 

i) 

(d) Obtain a 1 in the diagonal position 

in the 

third 

column 

by multiplying the third row by 

H)- 








n 

0 

1 1 

1 

2 

1 

2 

°\ 


0 

1 

1 1 

3 

2 

1 

2 

0 


Vo 

0 

1 1 

4 

5 

2 

5 
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(e) Obtain zeros in the off-diagonal positions in the third column by adding (— |) times the 
third row to the first row and adding (— |) times the third row to the second row. 



I 7_ _J_ X 

in in in 


The last of these matrices is 1 1 A x , a fact that can be verified by direct multiplication with 
the original matrix A. 


This example was made slightly simpler by the fact that the original matrix A had 
a 1 in the upper left corner (an = 1). If this is not the case, then the first step is to 
produce a 1 there by multiplying the first row by 1/an, as long as an / 0. If an = 0, 
then the first row must be interchanged with some other row to bring a nonzero 
element into the upper left position before proceeding. 


Matrix Functions. We sometimes need to consider vectors or matrices whose elements 
are functions of a real variable t. We write 





U n (t) • 

• a\ n it)\ 

x(f) = 


MO = 




\X n (t) / 


ycimi it) 

ttmn (0 / 


(25) 


respectively. 

The matrix A (t) is said to be continuous at t = to or on an interval a < t < /3 if each 
element of A is a continuous function at the given point or on the given interval. 
Similarly, A (t) is said to be differentiable if each of its elements is differentiable, and 
its derivative dA/dt is defined by 


dA ( dajj\ 
dt \ dt ) ’ 


(26) 


that is, each element of dA/dt is the derivative of the corresponding element of A. 
In the same way, the integral of a matrix function is defined as 


f 


A (t) dt = 



(27) 


For example, if 


A(0 


/sin t 

V 1 


t 

cost 


/cos t 

V 0 




A(f) dt = 



then 


A'(t) = 
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PROBLEMS 


Many of the rules of elementary calculus extend easily to matrix functions; in partic- 
ular, 

d dA 

— (CA) = C— — , where C is a constant matrix; (28) 


s (A+B) 


dA dB 

dt dt 


(29) 


d 

dt 


dB 

iAB^A- 



(30) 


In Eqs. (28) and (30), care must be taken in each term to avoid interchanging the 
order of multiplication. The definitions expressed by Eqs. (26) and (27) also apply 
as special cases to vectors. 



2. If A = 


(a) A — 2B 
(c) AB 


(b) 3A + B 
(d) BA 



'-2 

1 

2 \ 


/ 1 

2 

3\ 

If A = 

1 

0 

-3 

and B = 

3 

-1 

-1 


l 2 

-1 

1 / 


l - 2 

1 

o ) 

(a) A r 






(b) 

B r 

(e) A r 

+ B r 





(d) 

(A + 


find 


4. If A = 


3-2; 


1 + i 


2 — i —2 + 3 i 


, find 


(a) A r (b) A (c) A* 




2 


1 2 

1 

-A 

5. If A = 

2 

-1 

2 and B = 

-2 

3 

3 


l 1 

2 


\ 1 

0 

2/ 


verify that 2 (A + B) = 2 A + 2B. 



l 1 

-2 

°\ 

1 2 

1 

-A 

'2 

1 

°\ 

6. If A = 

3 

2 

-1 , B = 

-2 

3 

3 , and C = 

1 

2 

2 


\ 2 

0 

3/ 

\ 1 

0 

2/ 

1° 

1 

-v 


verify that 


(a) (AB)C = A(BC) (b) (A + B) + C = A + (B + C) 

(c) A(B + C) = AB + AC 


372 


Chapter 7. Systems of First Order Linear Equations 


7. Prove each of the following laws of matrix algebra: 


(a) A + B = B + A 
(c) a(A + B) = aA + aB 
(e) A(BC) = (AB)C 



f 2 ) 

(-1 + i\ 

8. Ifx = 

3 i and y = 

2 


\1 - 7 

l 3 -«7 


(a) x r y 
(c) (x,y) 


9. Ifx = 

r- 2, i 

and y = 


(a) x r 3 

^ 2 > 
= y r x 


V + 2i ) 


(b) A + (B + C) = (A + B) + C 
(d) ( a + /S)A = kA + /3A 
(f) A(B + C) = AB + AC 


(b) y r y 
(d) (y,y) 

show that 


(b) (x,y) = (y, x) 


In each of Problems 10 through 19 either compute the inverse of the given matrix, or else show 
that it is singular. 




f 1 

2 

3 \ 


r \ 

1 


12 . 

2 

4 

5 

13 . 

2 

-1 

1 


( 3 

5 

*J 


l 1 

1 

V 



1 1 

2 

A 



<2 

1 

°\ 


14 . 

-2 

1 

8 


15 . 

0 

2 

1 



l 1 

-2 

-V 




0 

V 



/I 

-1 

-A 



( 2 

3 

1 N 


16 . 

2 

1 

° 


17 . 

-1 

2 

1 



l 3 

-2 




l 4 

-1 

-b 

1 


/ 1 

0 

0 

-1\ 


/ 1 

-1 

2 

0\ 


0 

-1 

1 

0 


-1 

2 

-4 

2 

18 . 

-1 

0 

1 

0 

19 . 

1 

0 

1 

3 


l o 

1 

-1 

V 


l- 2 

2 

0 

-V 


20. Prove that if there are two matrices B and C such that AB = I and AC = I, then B = C. 
This shows that a matrix A can have only one inverse. 



^ e l 

2e-< 

e 2 ‘\ 

/ 2e< 

e 1 

3e 2t \ 

21. If A(f) = 

2e* 

e~‘ 

—e 2t I and B(f) = 

—e l 

2e~ l 

e 2t 


\-e< 

3e~‘ 

2e 2t ) 

K Zef 

—e~ l 

-e 2t ) 


(a) A + 3B 
(c) dA/dt 


(b) AB 

(d) f A(f) dt 

Jo 
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In each of Problems 22 through 24 verify that the given vector satisfies the given differential 
equation. 




(1 

1 

A 


( 6 \ 

( °\ 

24. x' = 

2 

1 

-l x, 

x = 

-8 e-' + 2 

1 


1° 

-1 




-1/ 


In each of Problems 25 and 26 verify that the given matrix satisfies the given differential 
equation. 


25. 






r \ 

-1 

4 \ 


f e‘ 

e~ 2 ‘ 

e 3 '\ 

II 

* 

<N 

3 

2 

-1 

#(0 = 

-4e‘ 

—e~ 2 ‘ 

2e 3 ' 


l 2 

1 



v “ e ' 

—e~ 2 ‘ 

e 3l J 


7.3 Systems of Linear Algebraic Equations; Linear Independence, 

Eigenvalues, Eigenvectors 

In this section we review some results from linear algebra that are important for the 
solution of systems of linear differential equations. Some of these results are easily 
proved and others are not; since we are interested simply in summarizing some useful 
information in compact form, we give no indication of proofs in either case. All the 
results in this section depend on some basic facts about the solution of systems of 
linear algebraic equations. 

Systems of Linear Algebraic Equations. A set of n simultaneous linear algebraic equations 
in n variables 


ciuXi + cinxi H b ainX„ = bi, 


Cln\X\ “h Ctn2X2 “b * * * “b O-nnXn — bn 


( 1 ) 


can be written as 


Ax = b, 


( 2 ) 
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where the n x n matrix A and the vector b are given, and the components of x are 
to be determined. If b = 0, the system is said to be homogeneous; otherwise, it is 

nonhomogeneous. 

If the coefficient matrix A is nonsingular — that is, if det A is not zero — then there 
is a unique solution of the system (2). Since A is nonsingular. A -1 exists, and the 
solution can be found by multiplying each side of Eq. (2) on the left by A -1 ; thus 

x = A _1 b. (3) 

In particular, the homogeneous problem Ax = 0, corresponding to b = 0 in Eq. (2), 
has only the trivial solution x = 0. 

On the other hand, if A is singular — that is, if det A is zero — then solutions of 
Eq. (2) either do not exist, or do exist but are not unique. Since A is singular. A -1 
does not exist, so Eq. (3) is no longer valid. The homogeneous system 

Ax = 0 (4) 

has (infinitely many) nonzero solutions in addition to the trivial solution. The situ- 
ation for the nonhomogeneous system (2) is more complicated. This system has no 
solution unless the vector b satisfies a certain further condition. This condition is that 

(b,y) = 0, (5) 

for all vectors y satisfying A*y = 0, where A* is the adjoint of A. If condition (5) is 
met, then the system (2) has (infinitely many) solutions. These solutions are of the 
form 

x = x (0, + S, (6) 

where x (0) is a particular solution of Eq. (2), and £ is the most general solution of the 
homogeneous system (4). Note the resemblance between Eq. (6) and the solution of 
a nonhomogeneous linear differential equation. The proofs of some of the preceding 
statements are outlined in Problems 26 through 30. 

The results in the preceding paragraph are important as a means of classifying the 
solutions of linear systems. However, for solving particular systems, it is generally 
best to use row reduction to transform the system into a much simpler one from which 
the solution(s), if there are any, can be written down easily. To do this efficiently, we 
can form the augmented matrix 


''fill • 

Ain 

* 

\flnl 

Ann 

: 

b n / 


by adjoining the vector b to the coefficient matrix A as an additional column. The 
dashed line replaces the equals sign and is said to partition the augmented matrix. We 
now perform row operations on the augmented matrix so as to transform A into an 
upper triangular matrix — that is, a matrix whose elements below the main diagonal 
are all zero. Once this is done, it is easy to see whether the system has solutions, and 
to find them if it does. Observe that elementary row operations on the augmented 
matrix (7) correspond to legitimate operations on the equations in the system (1). 
The following examples illustrate the process. 
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EXAMPLE 

1 


Solve the system of equations 

x\ — 2x2 + 3x3 = 7, 

—X\ + X 2 — 2 x 3 = — 5, ( 8 ) 

2xi — X 2 — X 3 = 4. 

The augmented matrix for the system (8) is 

/ 1 -2 3 | 7\ 

-1 l-2|-5. (9) 

V 2 -1 -1 I 4/ 

We now perform row operations on the matrix (9) with a view to introducing zeros in the lower 
left part of the matrix. Each step is described and the result recorded below. 

(a) Add the first row to the second row and add (—2) times the first row to the third row. 



(1 

-2 

3 1 

A 


0 

-1 

1 1 

2 


V° 

3 

-7 I 

-10/ 

(b) Multiply the second row by —1. 


(l 

-2 

3 1 

A 


0 

1 

-1 1 

“A 


A 

3 

-7 I 

—10/ 

(c) Add (—3) times the second row to the third row. 



/I 

-2 

3 1 

A 


0 

1 

-1 1 

-2 


V° 

0 

-4 I 

-V 

(d) Divide the third row by —4. 


/1 

-2 

3 1 

A 


0 

1 

-1 1 

-2 


v° 

0 

1 1 

1 ) 


The matrix obtained in this manner corresponds to the system of equations 

Xi — 2x2 + 3x3 = 7, 

x 2 — x 3 = —2, (10) 

X3 = 1, 


which is equivalent to the original system (8). Note that the coefficients in Eqs. (10) form 
a triangular matrix. From the last of Eqs. (10) we have X3 = 1, from the second equation 
x 2 = — 2 + X3 = — 1, and from the first equation xi = 7 + 2x 2 — 3.X3 = 2. Thus we obtain 
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I 


EXAMPLE 

2 


which is the solution of the given system (8). Incidentally, since the solution is unique, we 
conclude that the coefficient matrix is nonsingular. 


Discuss solutions of the system 


x\ — 2x2 + 3x3 = b\, 

—X\ + x 2 - 2 x 3 = b 2 , (11) 

2xi — X 2 + 3x3 = b 3 

for various values of bi,b 2 , and b 3 . 

Observe that the coefficients in the system (11) are the same as those in the system (8) 
except for the coefficient of x 3 in the third equation. The augmented matrix for the system 
(11) is 

/ 1 -2 3 | bA 

-1 1 -2 | b 2 \. (12) 

V 2 -1 3 | b 3 ) 

By performing steps (a), (b), and (c) as in Example 1, we transform the matrix (12) into 

/l —2 3 | b t \ 

0 1 -1 | -b, - b 2 . (13) 

^0 0 0 | b\ + 3b 2 + b 3 J 

The equation corresponding to the third row of the matrix (13) is 

b 2 + 3 b 2 + b 3 = 0; (14) 

thus the system (11) has no solution unless the condition (14) is satisfied by b 2 , b 2 , and b 3 . It 
is possible to show that this condition is just Eq. (5) for the system (11). 

Let us now assume that b 3 = 2,b 2 = l, and b 3 — —5, in which case Eq. (14) is satisfied. Then 
the first two rows of the matrix (13) correspond to the equations 


Xi — 2x2 + 3x3 — 2, 

x 2 x 3 = -3. 


(15) 


To solve the system (15), we can choose one of the unknowns arbitrarily and then solve for 
the other two. If we let x 3 = a, where a is arbitrary, it then follows that 

X 2 = a — 3, 

X\ = 2(a — 3) — 3a + 2 = —a — 4. 


If we write the solution in vector notation, we have 




(16) 


It is easy to verify that the second term on the right side of Eq. (16) is a solution of the nonho- 
mogeneous system (11) and that the first term is the most general solution of the homogeneous 
system corresponding to (11). 
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Row reduction is also useful in solving homogeneous systems and systems in which 
the number of equations is different from the number of unknowns. 

Linear Independence. A set of k vectors x (1) , . . . , \ <k> is said to be linearly dependent if 
there exists a set of (complex) numbers c\, , c k , at least one of which is nonzero, 
such that 

Cl x (1) + • ■ ■ + c k x (k) = 0. (17) 

In other words, x (1) , . . . , x (k) are linearly dependent if there is a linear relation among 
them. On the other hand, if the only set ci, . . . , c k for which Eq. (17) is satisfied is 
Ci = C 2 = • • • = c k = 0, then x (1) , . . . ,x (k) are said to be linearly independent. 

Consider now a set of n vectors, each of which has n components. Let x lt — xf be 
the 7th component of the vector x® , and let X = (Xy). Then Eq. (17) can be written 
as 


H b x^c n ^ 

K^nCl + ■ • • + 4 n) C„/ 


( X\\C\ b X\ n c„\ 


yx^iCi H h x nn c n J 


= Xc = 0. 


(18) 


If detX ^ 0, then the only solution of Eq. (18) is c = 0, but if detX = 0, there are 
nonzero solutions. Thus the set of vectors x (1) , . . . , x (,!) is linearly independent if and 
only if det X ^ 0. 


EXAMPLE 

3 


Determine whether the vectors 



f r \ 

(2\ 

f - 4 \ 

x« = 

2 , x® = 

1 , x® = 

1 



w 



(19) 


are linearly independent or linearly dependent. If they are linearly dependent, find a linear 
relation among them. 

To determine whether x (1) ,x (2) , and x® are linearly dependent, we seek constants ci, C 2 , and 
c 3 such that 

CiX (1) + c 2 x (2) + c 3 x (3) = 0. (20) 


Equation (20) can also be written in the form 


/ 

1 

2 

“ 4 ^ 

( Ci 


2 

1 

1 

Cl 


-1 

3 

-W 

V 3 



( 21 ) 


and solved by means of elementary row operations starting from the augmented matrix 


/ 1 

2 

-4 I 

°\ 


2 

1 

1 1 

0 ■ 

(22) 

v- 1 

3 

-11 | 

o ) 



We proceed as in Examples 1 and 2. 
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(a) Add (—2) times the first row to the second row, and add the first row to the third row. 

/I 2 -4 | 0\ 

0-3 9 | 0 ] 

v 0 5 -15 | 0/ 


(b) Divide the second row by —3; then add (—5) times the second row to the third row. 

( 1 2 —4 | 0\ 

0 1 —3 | 0 j 

0 0 0 | 0 / 


Thus we obtain the equivalent system 


ci + 2 c 2 - 4c 3 = 0, 
c 2 - 3c 3 = 0. 


(23) 


From the second of Eqs. (23) we have c 2 = 3c 3 , and from the first we obtain 
Ci = 4 c 3 — 2 c 2 = — 2c 3 . Thus we have solved for c 3 and c 2 in terms of c 3 , with the latter remain- 
ing arbitrary. If we choose c 3 = — 1 for convenience, then ci = 2 and c 2 = —3. In this case the 
relation (20) becomes 

2x (1) - 3 x< 2) - x (3) = 0, 


and the given vectors are linearly dependent. 

Alternatively, we can compute det (x,y), whose columns are the components of x (1) , x (2) , and 
x (3) , respectively. Thus 


det (.r,y) = 


1 

2 

-1 


2 -4 
1 1 

3 -11 


and direct calculation shows that it is zero. Hence x (1 \ x (2) , and x (3) are linearly dependent. 
However, if the coefficients ci, c 2 , and c 3 are required, we still need to solve Eq. (20) to find 
them. 


Frequently, it is useful to think of the columns (or rows) of a matrix A as vectors. 
These column (or row) vectors are linearly independent if and only if det A / 0. 
Further, if C = AB, then it can be shown that det C = (det A) (det B). Therefore, if 
the columns (or rows) of both A and B are linearly independent, then the columns 
(or rows) of C are also linearly independent. 

Now let us extend the concepts of linear dependence and independence to a set 
of vector functions x (1) (f), . . . ,x (k \t) defined on an interval a < t < p. The vectors 
x (1) (f), . . . , x® (t) are said to be linearly dependent on a < t < p if there exists a set of 
constants ci, . . . , c*, not all of which are zero, such that c 3 x (1) (?) + ■■■ + C£X (/c) (t) = 0 
for all t in the interval. Otherwise, x (1) (f), . . . ,x (<:) (f) are said to be linearly indepen- 
dent. Note that if x (1) (f), . . . , are linearly dependent on an interval, they are 

linearly dependent at each point in the interval. However, if x (1) (t), . . . , \ ,k) (t) are 
linearly independent on an interval, they may or may not be linearly independent at 
each point; they may, in fact, be linearly dependent at each point, but with different 
sets of constants at different points. See Problem 15 for an example. 
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Eigenvalues and Eigenvectors. The equation 

Ax = y (24) 

can be viewed as a linear transformation that maps (or transforms) a given vector x 
into a new vector y. Vectors that are transformed into multiples of themselves are 
important in many applications. 5 To find such vectors, we set y = ax, where X is a 
scalar proportionality factor, and seek solutions of the equation 

Ax = A.x, (25) 

or 

(A - Xl)x = 0. (26) 


The latter equation has nonzero solutions if and only if X is chosen so that 

det(A - /,!) = 0. (27) 


Values of X that satisfy Eq. (27) are called eigenvalues of the matrix A, and the 
nonzero solutions of Eq. (25) or (26) that are obtained by using such a value of X are 
called the eigenvectors corresponding to that eigenvalue. 

If A is a 2 x 2 matrix, then Eq. (26) has the form 


(a n ~X a 12 \ /iA _ /0\ 
\ a 2 i «22 - X) \x 2 J \0j 


(28) 


and Eq. (27) becomes 


(an — X)(a 2 2 — X) — 012^21 = 0. 


(29) 


The following example illustrates how eigenvalues and eigenvectors are found. 


EXAMPLE 

4 


Find the eigenvalues and eigenvectors of the matrix 



The eigenvalues X and eigenvectors x satisfy the equation (A — AI)x = 0, or 

3 — X -1 \ /xA _ /0 
4 -2-itj \x 2 j ~ \0 


The eigenvalues are the roots of the equation 


det(A — A.I) 


3 — X -1 
4 —2 — X 


X 2 — X — 2 = 0. 


Thus the eigenvalues are X\ = 2 and X 2 = — 1. 


(30) 


(31) 


(32) 


5 For example, this problem is encountered in finding the principal axes of stress or strain in an elastic 
body, and in finding the modes of free vibration in a conservative system with a finite number of degrees 
of freedom. 
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To find the eigenvectors, we return to Eq. (31) and replace X by each of the eigenvalues in 
turn. For X = 2 we have 

C ::)(:)-(:) 


(33) 


Flence each row of this vector equation leads to the condition x\ — X 2 = 0, so x\ and x 2 are 
equal but their value is not determined. If x 1 = c, then x 2 = c also, and the eigenvector x (1) is 


IT® - , 


c / 0. 


(34) 


Thus there is an infinite family of eigenvectors, indexed by the arbitrary constant c, correspond- 
ing to the eigenvalue Ai. We will choose a single member of this family as a representative of 
the rest; in this example it seems simplest to let c = 1. Then, instead of Eq. (34), we write 


(35) 


and remember that any nonzero multiple of this vector is also an eigenvector. We say that x (1) 
is the eigenvector corresponding to the eigenvalue A| = 2. 

Now, setting X = — 1 in Eq. (31), we obtain 


4 -1\ Xl 
4 -l) U 


(36) 


Again we obtain a single condition on X\ and x 2 , namely, 4x t — x 2 = 0. Thus the eigenvector 
corresponding to the eigenvalue X 2 = — 1 is 


M) _ 


(37) 


or any nonzero multiple of this vector. 


As Example 4 illustrates, eigenvectors are determined only up to an arbitrary 
nonzero multiplicative constant; if this constant is specified in some way, then the 
eigenvectors are said to be normalized. In Example 4, we chose the constant c so 
that the components of the eigenvectors would be small integers. However, any 
other choice of c is equally valid, although perhaps less convenient. Sometimes it 
is useful to normalize an eigenvector x by choosing the constant so that its length 
||x|| = (x, x)^~ = 1. 

Equation (27) is a polynomial equation of degree n in A, so there are n eigenvalues 
Ai, . . . , X n , some of which may be repeated. If a given eigenvalue appears m times as 
a root of Eq. (27), then that eigenvalue is said to have algebraic multiplicity m. Each 
eigenvalue has at least one associated eigenvector, and an eigenvalue of algebraic 
multiplicity m may have q linearly independent eigenvectors. The number q is called 
the geometric multiplicity of the eigenvalue, and it is possible to show that 

1 < q < m. (38) 

Further, examples demonstrate that q may be any integer in this interval. If each 
eigenvalue of A is simple (has algebraic multiplicity one), then each eigenvalue also 
has geometric multiplicity one. 
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It is possible to show that if and A 2 are two eigenvalues of A, then their corre- 
sponding eigenvectors x (1) and \ a> are linearly independent (Problem 34). This result 
extends to any set Ai , . . . , A.* of distinct eigenvalues: their eigenvectors x (1) , . . . ,x (i) 
are linearly independent. Thus, if each eigenvalue of an n x n matrix is simple, then 
the n eigenvectors of A, one for each eigenvalue, are linearly independent. On the 
other hand, if A has one or more repeated eigenvalues, then there may be fewer than n 
linearly independent eigenvectors associated with A, since for a repeated eigenvalue 
we may have q < m. As we will see in Section 7.8, this fact may lead to complications 
later on in the solution of systems of differential equations. 


EXAMPLE 

5 


Find the eigenvalues and eigenvectors of the matrix 


A = 


/0 

1 

V 1 


1 

0 

1 



The eigenvalues X and eigenvectors x satisfy the equation (A — AI)x = 0, or 


( -x 

1 

A 

'x\ \ 

1 

—X 

A 

x 2 

V 1 

1 

-x) 

w 


The eigenvalues are the roots of the equation 



det(A — AI) 


—X 1 1 

1 —X 1 
1 1 —X 


—X 3 + 3A + 2 = 0. 


(39) 


(40) 


(41) 


The roots of Eq. (41) are Ai = 2, X 2 = — 1, and A 3 = —1. Thus 2 is a simple eigenvalue, and —1 
is an eigenvalue of algebraic multiplicity 2, or a double eigenvalue. 

To find the eigenvector x (1) corresponding to the eigenvalue A 3 , we substitute A. = 2 in 
Eq. (40); this gives the system 


I- 

-2 

1 

A 

^'1 \ 


1 

-2 

1 


\ 

1 

1 

-v 

W 


We can reduce this to the equivalent system 

/2 -1 -l\ /xA 

0 1 -1 U 

\0 0 0 ) [x 3 J 




(42) 


(43) 


by elementary row operations. Solving this system yields the eigenvector 


x 


(i) _ 



For X = — 1, Eqs. (40) reduce immediately to the single equation 


(44) 


X\ + X 2 + X 3 = 0. 


(45) 
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Thus values for two of the quantities X\ , x 2 , X3 can be chosen arbitrarily, and the third is 
determined from Eq. (45). For example, if xi = c\ and X 2 = C 2 , then X 3 = —c\ — c 2 . In vector 
notation we have 



/ 

C1 \ 

/ 

1 \ 

/ 

0 \ 

X = 


C2 = C\ 


0 +C 2 




V 

-Cl -c 2 ) 

V 

-1 ) 

\ 

J 


For example, by choosing c\ = 1 and C 2 = 0, we obtain the eigenvector 



(46) 


(47) 


Any nonzero multiple of x <2) is also an eigenvector, but a second independent eigenvector can 
be found by making another choice of C\ and C 2 — for instance, c\ = 0 and c 2 = 1. In this case 
we obtain 


x 


( 3 ) _ 



(48) 


which is linearly independent of x (2) . Therefore, in this example, two linearly independent 
eigenvectors are associated with the double eigenvalue. 


An important special class of matrices, called self-adjoint or Hermitian matrices, 
are those for which A* = A; that is, = n,y. Hermitian matrices include as a subclass 
real symmetric matrices — that is, matrices that have real elements and for which 
A t = A. The eigenvalues and eigenvectors of Hermitian matrices always have the 
following useful properties: 

1. All eigenvalues are real. 

2. There always exists a full set of n linearly independent eigenvectors, regardless of the 
algebraic multiplicities of the eigenvalues. 

3. If x (1) and x (2) are eigenvectors that correspond to different eigenvalues, then 
(x ( 1 , ,x (2) ) = 0. Thus, if all eigenvalues are simple, then the associated eigenvectors form 
an orthogonal set of vectors. 

4. Corresponding to an eigenvalue of algebraic multiplicity m, it is possible to choose m 
eigenvectors that are mutually orthogonal. Thus the full set of n eigenvectors can always 
be chosen to be orthogonal as well as linearly independent. 

The proofs of statements 1 and 3 above are outlined in Problems 32 and 33. Ex- 
ample 5 involves a real symmetric matrix and illustrates properties 1, 2, and 3, but 
the choice we have made for x (2) and x (3) does not illustrate property 4. However, 
it is always possible to choose an x (2) and x (3) so that (x a> ,x (?y> ) = 0. For instance, in 
Example 5 we could have chosen x (2) as before and x (3) by using ci = 1 and co = —2 
in Eq. (46). In this way we obtain 
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x (2) = 


' 1 ’ 

0 

II 

cn - 

' 1 ’ 

-2 

, -1 , 


, 1 , 


as the eigenvectors associated with the eigenvalue X — — 1. These eigenvectors are 
orthogonal to each other as well as to the eigenvector x (1) that corresponds to the 
eigenvalue X — 2. 


PROBLEMS 


In each of Problems 7 through 1 1 determine whether the members of the given set of vectors 
are linearly independent. If they are linearly dependent, find a linear relation among them. 
The vectors are written as row vectors to save space but may be considered as column vectors; 
that is, the transposes of the given vectors may be used instead of the vectors themselves. 


7. 

X® 

= (1,1,0), 

x® = 

(0,1,1), 

x® = 

(1,0,1) 




8. 

X® 

= (2,1,0), 

X® = 

(0,1,0), 

X® = 

(-1,2,0) 




9. 

X® 

= (1,2, 2, 3), 

X® 

= (-1,0,3, 

1), 

x® = (-2, - 

1,1,0), 

X® 

= (-3, 0,-1, 3) 

10. 

X® 

= (1,2, -1,0), 

x® = (2,3,1, 

-1), 

x® = (— 1, 

0,2,2), 

X® 

= (3,-1, 1,3) 

11. 

X® 

= d,2, -2), 

X® 

= (3,1,0), 

X® 

= (2, —1, 1), 

x® = 

(4,3, 

-2) 


12. Suppose that each of the vectors x® , . . . , x <m) has n components, where n < m. Show that 
x (1) , . . . , x (m> are linearly dependent. 

In each of Problems 13 and 14 determine whether the members of the given set of vectors are 
linearly independent for — oo < t < oo. If they are linearly dependent, find the linear relation 
among them. As in Problems 7 through 11, the vectors are written as row vectors to save space. 

13. x®(f) = (e - ',2e~0, x®(f) = (e~* ,e~*), x®(f) = (3e"',0) 

14. x®(f) = (2 sin f, sin f), x®(f) = (sinf, 2sinf) 

15. Let 

x (1) (0 = ^ , x®(t) = 

Show that x® (t) and x® (f) are linearly dependent at each point in the interval 0 < t < 1. 
Nevertheless, show that x® (f) and x® (t) are linearly independent on 0 < t < 1. 



In each of Problems 1 through 6 either solve the given system of equations, or else show that 
there is no solution. 

1. X\ — Xj, = 0 
3xi + xi + X 3 = 1 
— X\ A X2 T 2x3 — 2 


3. X\ + 2x2 — X 3 = 2 

2xi + X 2 + X 3 = 1 

x\— X 2 + 2 x 3 = — 1 
5. xi — X 3 = 0 

3xi + X 2 + X 3 = 0 

— X\ -|- X 2 T 2 x 3 = 0 


2 . Xi + 2 .X 2 — X 3 = 1 
2 .X 3 -|- X 2 T X 3 = 1 
Xi — X 2 + 2 .X 3 = 1 

4. xi + 2x2 — X3 = 0 

2xi + X2 + X3 = 0 
Xl — X2 + 2x3 = 0 

6 . Xi + 2 .X 2 — X 3 = —2 

— 2xi — 4x 2 + 2x 3 = 4 
2xi + 4x 2 — 2x 3 = —4 
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In each of Problems 16 through 25 find all eigenvalues and eigenvectors of the given matrix. 




tl 

0 

0N \ 


< 3 

2 

2 \ 

22. 

2 

1 

-2 

23. 

1 

4 

1 


l 3 

2 



l - 2 

-4 

-1/ 



f H/9 

-2/9 

8/9\ 


^3 

2 

4 \ 

24. 

-2/9 

2/9 

10/9 

25. 

2 

0 

2 


i, 8/9 

10/9 

5/9/ 


l 4 

2 

V 


Problems 26 through 30 deal with the problem of solving Ax = b when det A = 0. 

26. (a) Suppose that A is a real-valued n x n matrix. Show that (Ax,y) = (x,A r y) for any 
vectors x and y. 

Hint: You may find it simpler to consider first the case n = 2; then extend the result to an 
arbitrary value of n. 

(b) If A is not necessarily real, show that (Ax, y) = (x, A*y) for any vectors x and y. 

(c) If A is Hermitian, show that (Ax, y) = (x, Ay) for any vectors x and y. 

27. Suppose that, for a given matrix A, there is a nonzero vector x such that Ax = 0. Show 
that there is also a nonzero vector y such that A*y = 0. 

28. Suppose that det A = 0 and that Ax = b has solutions. Show that (b, y) = 0, where y is any 
solution of A*y = 0. Verify that this statement is true for the set of equations in Example 2. 
Hint: Use the result of Problem 26(b). 

29. Suppose that det A = 0 and that x = x (0) is a solution of Ax = b. Show that if £ is a solution 
of A£ = 0 and a is any constant, then x = x (0) + is also a solution of Ax = b. 

30. Suppose that det A = 0 and that y is a solution of A*y = 0. Show that if (b,y) = 0 for 
every such y, then Ax = b has solutions. Note that this is the converse of Problem 28; the 
form of the solution is given by Problem 29. 

Hint: What does the relation A*y = 0 say about the rows of A? Again, it may be helpful 
to consider the case n = 2 first. 

31. Prove that 1 = 0 is an eigenvalue of A if and only if A is singular. 

32. In this problem we show that the eigenvalues of a Hermitian matrix A are real. Let x be 
an eigenvector corresponding to the eigenvalue 1. 

(a) Show that (Ax,x) = (x, Ax). Hint: See Problem 26(c). 

(b) Show that l(x, x) = l(x, x). Hint: Recall that Ax = lx. 

(c) Show that 1 = 1; that is, the eigenvalue 1 is real. 
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33. Show that if M and k 2 are eigenvalues of a Hermitian matrix A, and if A 7 ^ then the 
corresponding eigenvectors x (1) and x (2) are orthogonal. 

Hint: Use the results of Problems 26(c) and 32 to show that (Ai — A 2 )(x ( 1 ) ,x (2 ') = 0. 

34. Show that if and X 2 are eigenvalues of any matrix A, and if Ai 7 ^ X 2 , then the corre- 
sponding eigenvectors x (1) and x <2) are linearly independent. 

Hint: Start from C!X (1 > + c 2 x (2) = 0; multiply by A to obtain C|A|X ni + c 2 X 2 x (2) = 0. Then 
show that ci = c 2 = 0 . 


7.4 Basic Theory of Systems of First Order Linear Equations 


The general theory of a system of n first order linear equations 
x\ = pu(t)xi H b pin(t)x n + gi(0, 


x' n = Pnl(t)X\ H b Pnn{t)Xn + gn(0 


(1) 


closely parallels that of a single linear equation of /zth order. The discussion in this 
section therefore follows the same general lines as that in Sections 3.2 and 4.1. To 
discuss the system (1) most effectively, we write it in matrix notation. That is, we 
consider x\ = </>i(t), ...,x n = 4> n (t) to be components of a vector x = <f>(t)\ similarly, 
gi (t), . . . ,g n (t) are components of a vector g(t), and pn(t), . . . , p nn (t ) are elements of 
an n x n matrix P(f). Equation ( 1 ) then takes the form 

x' = P(0x + g(f). (2) 


The use of vectors and matrices not only saves a great deal of space and facilitates 
calculations but also emphasizes the similarity between systems of equations and 
single (scalar) equations. 

A vector x = (j>(t) is said to be a solution of Eq. (2) if its components satisfy the sys- 
tem of equations (1). Throughout this section we assume that P and g are continuous 
on some interval a < t < /l; that is, each of the scalar functions pu, . . . ,p nn , gi,...,g n 
is continuous there. According to Theorem 7.1.2, this is sufficient to guarantee the 
existence of solutions of Eq. (2) on the interval a < t < ft. 

It is convenient to consider first the homogeneous equation 

x' = P(f)x (3) 


obtained from Eq. (2) by setting g (t) — 0. Once the homogeneous equation has been 
solved, there are several methods that can be used to solve the nonhomogeneous 
equation (2); this is taken up in Section 7.9. We use the notation 


^ 11 ( 0 ^ 


^u-lf)^ 

*21 (t) 

, ..., x (k \t) = 

*2 k(t) 

\x n \(t); 


\Xnkit)) 


( 4 ) 


to designate specific solutions of the system (3). Note that Xjj(t) = x^ (t) refers to 
the zth component of the /th solution \ <J> (t). The main facts about the structure 
of solutions of the system (3) are stated in Theorems 7.4.1 to 7.4.4. They closely 
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Theorem 7.4.1 


resemble the corresponding theorems in Sections 3.2 and 4.1; some of the proofs are 
left to you as exercises. 


If the vector functions x (1) and x (2) are solutions of the system (3), then the linear 
combination cix (1) + C 2 X (2) is also a solution for any constants c\ and C 2 - 


This is the principle of superposition; it is proved simply by differentiating 
cix (1) + c 2 x (2) and using the fact that x (1) and x (2) satisfy Eq. (3). By repeated appli- 
cation of Theorem 7.4.1 we reach the conclusion that if x (1) , . . . ,x (i) are solutions of 
Eq. (3), then 

x = cix (1) (f) H b CfcX 1 {k) (f) (5) 


is also a solution for any constants ci, . . . , c*. As an example, it can be verified that 


x (1) (f) 






( 6 ) 


satisfy the equation 



(7) 


According to Theorem 7.4.1, 


x = Cl 



■ c 2 



e 


-t 


= Cl X (1) (t) + c 2 x (2) (0 


( 8 ) 


also satisfies Eq. (7). 

As we indicated previously, by repeatedly applying Theorem 7.4.1, it follows that 
every finite linear combination of solutions of Eq. (3) is also a solution. The question 
that now arises is whether all solutions of Eq. (3) can be found in this way. By analogy 
with previous cases, it is reasonable to expect that for the system (3) of n first order 
equations, it is sufficient to form linear combinations of n properly chosen solutions. 
Therefore let x (1) , . . . ,x ( ' !) be n solutions of the system (3), and consider the matrix 
X(r) whose columns are the vectors x (l) (f), . . . ,x (n \t): 


(x\\{t) 


X(t) = 


\Xnl(t) 


Xl n(f)\ 
Xfi/i (t) } 


(9) 


Recall from Section 7.3 that the columns of X(r) are linearly independent for a given 
value of t if and only if det X / 0 for that value of t. This determinant is called the 
Wronskian of the n solutions x (1) , . . . ,x (M) and is also denoted by W[x (1) , . . . ,x (n) ]; 
that is, 

W[x (1) , . . . ,x w ](0 = detX(f). (10) 

The solutions x (1) , . . . ,x (,I) are then linearly independent at a point if and only if 
W[ x (1) , . . . ,x w ] is not zero there. 
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Theorem 7.4.3 


If the vector functions x (1) , . . . , x (,I) are linearly independent solutions of the system 
(3) for each point in the interval a <t<p, then each solution x = 0(f) of the system 
(3) can be expressed as a linear combination of x (1) , . . . ,x (n) 

4>(t) = cix (1) (f) H b c n x (n \t) (11) 


in exactly one way. 


Before proving Theorem 7.4.2, note that according to Theorem 7.4.1, all expressions 
of the form (11) are solutions of the system (3), while by Theorem 7.4.2 all solutions 
of Eq. (3) can be written in the form (11). If the constants c\,...,c n are thought of as 
arbitrary, then Eq. (11) includes all solutions of the system (3), and it is customary to 
call it the general solution. Any set of solutions x (1) , . . . , x (n> of Eq. (3) that is linearly 
independent at each point in the interval a < t < is said to be a fundamental set 
of solutions for that interval. 

To prove Theorem 7.4.2, we will show, given any solution 0 of Eq. (3), that 

0(f) = cix ( h(f) + f- c„x (n) (f) for suitable values of ci, . . . ,c„. Let t = t 0 be some 

point in the interval a < t < and let § = 0(fo). We now wish to determine whether 

there is any solution of the form x = cix (1) (f) H + c n x (n) (f) that also satisfies the 

same initial condition x(fo) = £. That is, we wish to know whether there are values 
of ci, . . . ,c„ such that 

Cix (1) (f 0 ) H b c„x (n \to) = £, (12) 


or, in scalar form, 


ClXn(fo) + ■ • • + C n Xin(t 0 ) = £l, 


Cl-Lilfk)) + • • • + o) — Hn. 


(13) 


The necessary and sufficient condition that Eqs. (13) possess a unique solution 
ci, . . ,,c„ is precisely the nonvanishing of the determinant of coefficients, which is 
the Wronskian W[ x (1) , . . . ,x (n) ] evaluated at t = to. The hypothesis thatx (1) , . . . ,x (n) 
are linearly independent throughout a < t < ft guarantees that W[x (1) , . . . ,x w ] is 
not zero at f = f 0 , and therefore there is a (unique) solution of Eq. (3) of the form 
x = cix (1) (f) + • • • + c„x (,I) (f) that also satisfies the initial condition (12). By the 
uniqueness part of Theorem 7.1.2, this solution is identical to 0(f), and hence 
0(f) = cix (1) (f) + b c n x (n) {t), as was to be proved. 


If x (1) , . . . , x (n) are solutions of Eq . (3) on the interval a < t < ft , then in this interval 
W[x (1) , . . . ,x (n) ] either is identically zero or else never vanishes. 


The significance of Theorem 7.4.3 lies in the fact that it relieves us of the necessity 
of examining W[x (1) , . . . ,x (n) ] at all points in the interval of interest, and enables us 
to determine whether x (1) , . . . ,x (,,) form a fundamental set of solutions merely by 
evaluating their Wronskian at any convenient point in the interval. 
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Theorem 7.4.3 is proved by first establishing that the Wronskian of x (1) , . . . ,x (n) 
satisfies the differential equation (see Problem 2) 


dW 

dt 


(Pn+P22 H \-Pnn)W. 


(14) 


Hence 


W (t) = c exp 


/ 


[Pll(0 H ’rPnn(t)] dt 


(15) 


Theorem 7.4.4 


where c is an arbitrary constant, and the conclusion of the theorem follows imme- 
diately. The expression for W(t) in Eq. (15) is known as Abel's formula; note the 
similarity to Eq. (27) of Section 3.2. 

Alternatively, Theorem 7.4.3 can be established by showing that if n solutions 
x (1) , . . . ,x <n) of Eq. (3) are linearly dependent at one point t = t 0 , then they must 
be linearly dependent at each point in a < t < (see Problem 8). Consequently, if 
x (1) , . . . , \ (n> are linearly independent at one point, they must be linearly independent 
at each point in the interval. 

The next theorei 
set of solutions. 


Let 


e (1) = 


ites that the system 

/n 


/°\ 

0 


1 

0 

, e (2) = 

0 

w 


\o) 


P («) _ 

, • • • 9 C 


/0\ 

0 

0 

w 


further let x (1) , . . . ,x (n) be the solutions of the system (3) that satisfy the initial 
conditions 

x (1) (f 0 ) = e (1) , ..., x<">«b) = e<">, (16) 

respectively, where to is any point in a < t < j3. Then x (1) , , . . ,x (n) form a funda- 
mental set of solutions of the system (3). 


To prove this theorem, note that the existence and uniqueness of the solutions 
x (1 ), . . . ,x (n) mentioned in Theorem 7.4.4 are ensured by Theorem 7.1.2. It is not 
hard to see that the Wronskian of these solutions is equal to 1 when t = to ; therefore 
x (1 \ . . . , x (n) are a fundamental set of solutions. 

Once one fundamental set of solutions has been found, other sets can be gener- 
ated by forming (independent) linear combinations of the first set. For theoretical 
purposes the set given by Theorem 7.4.4 is usually the simplest. 

To summarize: 

1. Any set of n linearly independent solutions of the system x' = P(t)x constitutes a funda- 
mental set of solutions. 

2. Under the conditions given in this section, such fundamental sets always exist. 

3. Every solution of the system x' = P(t)x can be represented as a linear combination of any 
fundamental set of solutions. 
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PROBLEMS 


1. Prove the statement following Theorem 7.4.1 for an arbitrary value of the integer k. 

2. In this problem we outline a proof of Theorem 7.4.3 in the case n = 2. Let x (1) and x (2) 
be solutions of Eq. (3) for a < t < ft, and let W be the Wronskian of x (1) and x (2) . 

(a) Show that 


dx^ 

dxf* 


X® 

xf 

dt 

dt 

+ 

dx 

dxi 2) 

X? 

x, f 



l 


dt 

dt 


(b) Using Eq. (3), show that 


dW 

dt 


= (Pn + P22W. 


(c) Find W (t) by solving the differential equation obtained in part (b). Use this expression 
to obtain the conclusion stated in Theorem 7.4.3. 

(d) Prove Theorem 7.4.3 for an arbitrary value of n by generalizing the procedure of parts 
(a), (b), and (c). 

3. Show that the Wronskians of two fundamental sets of solutions of the system (3) can differ 
at most by a multiplicative constant. 

Hint: Use Eq. (15). 

4. If x\ = y and x 2 = y', then the second order equation 


/' +P(t)/ + q(t)y = 0 


(i) 


corresponds to the system 


A = x 2 , 

x' 2 = —q{t)x 1 — p{t)x 2 - (ii) 


Show that if x (1) and x (2) are a fundamental set of solutions of Eqs. (ii), and if y (>) and y <2) 
are a fundamental set of solutions of Eq. (i), then W[y (1) , y (2) ] = cW[x (1) , x (2) ], where c is a 
nonzero constant. 

Hint: y (1, (f) and y {T> (t) must be linear combinations of xn(f) and x\ 2 (t). 

5. Show that the general solution of x' = P(f)x + g(f) is the sum of any particular solution x <p> 
of this equation and the general solution x (c) of the corresponding homogeneous equation. 

6. Consider the vectors x (1, (0 = and x a) (t) = 

(a) Compute the Wronskian of x (1> and x (2) . 

(b) In what intervals are x (1) and x (2) linearly independent? 

(c) What conclusion can be drawn about the coefficients in the system of homogeneous 
differential equations satisfied by x (1) and x (2) ? 

(d) Find this system of equations and verify the conclusions of part (c). 



7. Consider the vectors x (1) (t) 
in Problem 6. 



and x <2> (t) 



, and answer the same questions as 
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The following two problems indicate an alternative derivation of Theorem 7.4.2. 

8. Let x (1) , . . . ,x (m) be solutions of x' = P(r)x on the interval a < t < f). Assume that P is 
continuous and let to be an arbitrary point in the given interval. Show that x (1) , . . . ,x (m) 
are linearly dependent for a < t < fi if (and only if) x (1) (f 0 ), . . . ,x (m, (f 0 ) are linearly de- 
pendent. In other words x (1) , . . . , x (m) are linearly dependent on the interval (a, fi) if they 
are linearly dependent at any point in it. 

Hint: There are constants such that Cix (1) (r 0 ) + • • • + c m x <m) (t 0 ) = 0. Let 

z (?) = Cix (1) (f) + • • • + c m x (m) (f), and use the uniqueness theorem to show that z (t) = 0 
for each t in a < t < fi. 

9. Let x (1) , . . . , x w be linearly independent solutions of x' = P(f)x, where P is continuous on 

a < t < fi. 

(a) Show that any solution x = z (t) can be written in the form 

z (f) = CiX (1) (t) -I 1- c„x (H) (0 


for suitable constants c\,...,c n . 

Hint: Use the result of Problem 12 of Section 7.3, and also Problem 8 above. 

(b) Show that the expression for the solution z (t) in part (a) is unique; that is, if 
z (t) = Aqx (1) (0 + • • • + k„x m (t), then k\ = Ci, . . . , k n = c n . 

Hint: Show that (Ay — ci)x (1) (?) + ••• + ( k n — c„)x (n> ( t ) = 0 for each tina < t < fi, and use 
the linear independence of x (1) , . . . ,x ( " ) . 


7.5 Homogeneous Linear Systems with Constant Coefficients 


We will concentrate most of our attention on systems of homogeneous linear equa- 
tions with constant coefficients — that is, systems of the form 

x' = Ax, (1) 


where A is a constant n x n matrix. Unless stated otherwise, we will assume further 
that all the elements of A are real (rather than complex) numbers. 

If n = 1, then the system reduces to a single first order equation 


dx 

— = ax, 
dt 


( 2 ) 


whose solution is x = ce at . In Section 2.5 we noted that x = 0 is the only equilibrium 
solution if a ^ 0. Other solutions approach x — 0 if a < 0, and in this case we say 
that x = 0 is an asymptotically stable equilibrium solution. On the other hand, if 
a > 0, then x — 0 is unstable, since other solutions depart from it. For systems of n 
equations, the situation is somewhat analogous but more complicated. Equilibrium 
solutions are found by solving Ax = 0. We usually assume that det A ^0, so x = 0 
is the only equilibrium solution. An important question is whether other solutions 
approach this equilibrium solution or depart from it as f increases; in other words, is 
x = 0 asymptotically stable or unstable? Or are there still other possibilities? 

The case n = 2 is particularly important and lends itself to visualization in the 
xiX 2 -plane, called the phase plane. By evaluating Ax at a large number of points 
and plotting the resulting vectors, we obtain a direction field of tangent vectors to 
solutions of the system of differential equations. A qualitative understanding of the 
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behavior of solutions can usually be gained from a direction field. More precise 
information results from including in the plot some solution curves, or trajectories. 
A plot that shows a representative sample of trajectories for a given system is called 
a phase portrait. Examples of direction fields and phase portraits appear later in this 
section. 

To find solutions of the system (1), we will try to generalize the treatment of second 
order linear equations in Section 3.1. Thus we assume that a solution will involve an 
exponential function e rt . Further, solutions of Eq. (1) are vectors, so let us multiply 
e rt by a constant vector £. Thus we seek solutions of Eq. (1) of the form 

x = §e rt , (3) 

where the exponent r and the vector § are to be determined. Substituting from 
Eq. (3) for x in the system (1) gives 

rl-e r ‘ = A i-e rt . 

Upon canceling the nonzero scalar factor e rt , we obtain A£ = r£, or 

(A - rl)$ = 0, (4) 

where I is the n x n identity matrix. Thus, to solve the system of differential equations 
(1), we must solve the system of algebraic equations (4). This latter problem is 
precisely the one that determines the eigenvalues and eigenvectors of the matrix A. 
Therefore the vector x given by Eq. (3) is a solution of Eq. (1), provided that r is an 
eigenvalue and § an associated eigenvector of the coefficient matrix A. 

The following two examples illustrate the solution procedure in the case of 
2x2 coefficient matrices. We also show how to construct the corresponding phase 
portraits. Later in the section we return to a further discussion of the general n x n 
system. 


Consider the system 

EXAMPLE 

1 



(5) 


Plot a direction field and determine the qualitative behavior of solutions. Then find the general 
solution and draw several trajectories. 

A direction field for this system is shown in Figure 7.5.1. From this figure it is easy to see 
that a typical solution departs from the neighborhood of the origin and ultimately has a slope 
of approximately 2 in either the first or the third quadrant. 

To find solutions explicitly, we assume that x = §e" and substitute for x in Eq. (5). We are 
led to the system of algebraic equations 


1 -r 
4 


1 

1 — r 



( 6 ) 


Equations (6) have a nontrivial solution if and only if the determinant of coefficients is zero. 
Thus allowable values of r are found from the equation 


= (1 - r) 2 - 4 


= r 2 - 2r - 3 = 0. 


1 — r 
4 


(7) 
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FIGURE 7.5.1 Direction field for the system (5). 


Equation (7) has the roots r! = 3 and r 2 = —1; these are the eigenvalues of the coefficient 
matrix in Eq. (5). If r = 3, then the system (6) reduces to the single equation 


— 2$i + §2 — 0 . 

Thus f 2 = 2fi, and the eigenvector corresponding to r\ = 3 can be taken as 

Similarly, corresponding to r 2 = —1, we find that § 2 = — 2£ 1; so the eigenvector is 

The corresponding solutions of the differential equation are 


x (1 >(t) = Q e 3 '. 

The Wronskian of these solutions is 

W[x (1) ,x (2) ](f) = 


x (2) (rt = 


e Jl e 1 
2e 3 ' -2e-< 


= — 4e 21 , 


( 8 ) 


( 9 ) 


( 10 ) 


( 11 ) 


( 12 ) 


which is never zero. Hence the solutions x (1) and x (2) form a fundamental set, and the general 
solution of the system (5) is 


x = ClX fl) (0 + c 2 x (/, (0 


( 2 ), 


= Cl Ur 3 ' + C2 - 2 e " 


(13) 


where Ci and c 2 are arbitrary constants. 
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To visualize the solution (13), it is helpful to consider its graph in the xiX 2 -plane for various 
values of the constants c\ and C 2 . We start with x = cix (1) (f) or, in scalar form, 

X\ = cie 3 ', x 2 = 2c 1 e 3t . 

By eliminating t between these two equations, we see that this solution lies on the straight line 
x 2 = 2x\\ see Figure 7.5.2 a. This is the line through the origin in the direction of the eigenvector 
£ (1) . If we look on the solution as the trajectory of a moving particle, then the particle is in the 
first quadrant when c\ > 0 and in the third quadrant when ci < 0. In either case the particle 
departs from the origin as t increases. Next consider x = c 2 x <2) (t), or 

x\ = c 2 e~‘, x 2 = —2c 2 e~'. 

This solution lies on the line x 2 = — 2xj, whose direction is determined by the eigenvector £ <2) . 
The solution is in the fourth quadrant when C 2 > 0 and in the second quadrant when C 2 < 0, as 
shown in Figure 1.5.2a. In both cases the particle moves toward the origin as t increases. The 
solution (13) is a combination of x a) (0 and x (2) (f). For large t the term cix (1) (t) is dominant 
and the term c 2 x (2 ^ ( t ) becomes negligible. Thus all solutions for which c i ^0 are asymptotic to 
the line x 2 = 2xi as t — > oo. Similarly, all solutions for which c 2 ^ 0 are asymptotic to the line 
x 2 = —2x\ as t -*■ —oo. The graphs of several solutions are shown in Figure 1.5.2a. The pattern 
of trajectories in this figure is typical of all 2 x 2 systems x' = Ax for which the eigenvalues 
are real and of opposite signs. The origin is called a saddle point in this case. Saddle points 
are always unstable because almost all trajectories depart from them as t increases. 

In the preceding paragraph we have described how to draw by hand a qualitatively correct 
sketch of the trajectories of a system such as Eq. (5), once the eigenvalues and eigenvectors 
have been determined. However, to produce a detailed and accurate drawing, such as Figure 
1.5.2a and other figures that appear later in this chapter, a computer is extremely helpful, if 
not indispensable. 

As an alternative to Figure 1.5.2a , you can also plot x\ or x 2 as a function of t; some typical 
plots of x\ versus t are shown in Figure 1.5.2b , and those of x 2 versus t are similar. For certain 
initial conditions it follows that Ci = OinEq. (13), so that xi = c 2 e~‘ andxi —>■ Oast — ► oo. One 
such graph is shown in Figure 1.5.2b. corresponding to a trajectory that approaches the origin 



FIGURE 7.5.2 (a) Trajectories of the system (5); the origin is a saddle point. ( b ) Plots of x 2 
versus t for the system (5). 
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in Figure 7.5.2 a. For most initial conditions, however, c\ ^ 0 and x\ is given by x\ = cie 3 ' + 
C 2 e~'. Then the presence of the positive exponential term causes x\ to grow exponentially in 
magnitude as t increases. Several graphs of this type are shown in Figure 7.5.2 b, corresponding 
to trajectories that depart from the neighborhood of the origin in Figure 1.5.2a. It is important 
to understand the relation between parts (a) and (b) of Figure 7.5.2 and other similar figures 
that appear later, since you may want to visualize solutions either in the xj^-plane or as 
functions of the independent variable t. 


Consider the system 

EXAMPLE 

2 




(14) 


Draw a direction field for this system; then find its general solution and plot several trajectories 
in the phase plane. 

The direction field for the system ( 14) in Figure 7.5.3 shows clearly that all solutions approach 
the origin. To find the solutions, we assume that x = £e"; then we obtain the algebraic system 



The eigenvalues satisfy 


(-3 - r )(- 2 - r) - 2 = r 2 + 5r + 4 

= (r + l)(r + 4) = 0, 


so r\ = — 1 and ri = —4. For r = —1, Eq. (15) becomes 






(15) 


(16) 


(17) 


//////////z 
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, \ \ A | /. / 

. \ \ \ / / 

» M 1 / ^ 
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FIGURE 7.5.3 Direction field for the system (14). 
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Hence £2 = \/2£i, and the eigenvector £ (1) corresponding to the eigenvalue r 1 = —1 can be 
taken as 


s (1) = 



(18) 


Similarly, corresponding to the eigenvalue r 2 = —4 we have £1 = — y/2 £ 2 , so the eigenvector is 


. ( 2 ) / ^ 


£ = 


Thus a fundamental set of solutions of the system (14) is 


1 

V2 


x a \t) = I ' 1 e~‘, x (2) (0 = 


-V2 


and the general solution is 


— r,v (1) CM 4- — , 


V2 


-V2\ 


(19) 


( 20 ) 


( 21 ) 


Graphs of the solution (21) for several values of c\ and C 2 are shown in Figure 7.5.4n. 
The solution x (1 >(f) approaches the origin along the line x 2 = s/2x\, and the solution x (2) (t) 
approaches the origin along the line x\ = — V 2 X 2 . The directions of these lines are determined 
by the eigenvectors £ (1 > and £ (2) , respectively. In general, we have a combination of these 
two fundamental solutions. As t — > 00 , the solution x (2) (f) is negligible compared to x (1) (t). 
Thus, unless ci = 0, the solution (21) approaches the origin tangent to the line x 2 = V2x\ . 
The pattern of trajectories shown in Figure 7.5.4« is typical of all 2 x 2 systems x' = Ax for 
which the eigenvalues are real, different, and of the same sign. The origin is called a node 
for such a system. If the eigenvalues were positive rather than negative, then the trajectories 
would be similar but traversed in the outward direction. Nodes are asymptotically stable if 
the eigenvalues are negative and unstable if the eigenvalues are positive. 



FIGURE 7.5.4 (a) Trajectories of the system (14); the origin is a node. ( b ) Plots of X\ versus 
t for the system (14). 
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Although Figure 7.5.4u was computer-generated, a qualitatively correct sketch of the tra- 
jectories can be drawn quickly by hand on the basis of a knowledge of the eigenvalues and 
eigenvectors. 

Some typical plots of x\ versus t are shown in Figure 7.5.4 b. Observe that each of the 
graphs approaches the r-axis asymptotically as t increases, corresponding to a trajectory that 
approaches the origin in Figure 7.5.2 a. The behavior of x? as a function of t is similar. 


The two preceding examples illustrate the two main cases for 2x2 systems having 
eigenvalues that are real and different: the eigenvalues have either opposite signs 
(Example 1) or the same sign (Example 2). The other possibility is that zero is an 
eigenvalue, but in this case it follows that det A = 0, which violates the assumption 
made at the beginning of this section. However, see Problems 7 and 8. 

Returning to the general system (1), we proceed as in the examples. To find solu- 
tions of the differential equation (1), we must find the eigenvalues and eigenvectors 
of A from the associated algebraic system (4). The eigenvalues r\, . . . , r n (which need 
not all be different) are roots of the nth degree polynomial equation 

det (A - r\) = 0. (22) 

The nature of the eigenvalues and the corresponding eigenvectors determines the 
nature of the general solution of the system (1). If we assume that A is a real- valued 
matrix, there are three possibilities for the eigenvalues of A: 

1. All eigenvalues are real and different from each other. 

2. Some eigenvalues occur in complex conjugate pairs. 

3. Some eigenvalues are repeated. 

If the eigenvalues are all real and different, as in the two preceding examples, 
then associated with each eigenvalue r, is a real eigenvector § and the n eigen- 
vectors (■ (l) , . . . , § (,l) are linearly independent. The corresponding solutions of the 
differential system (1) are 

x a \t) = § a) e nt , ..., x (n) (0 = (23) 


To show that these solutions form a fundamental set, we evaluate their Wronskian: 


W[x (1) ,...,x ( ' !) ](0 


£ fV 1 ' 


I fW 




_ e (n+-+r„)t 


h (n) 

5 1 


(24) 


t (i) ... t («) 

n ? n 

First, we observe that the exponential function is never zero. Next, since the eigen- 
vectors § (1) , . . . ,(• are linearly independent, the determinant in the last term of 
Eq. (24) is nonzero. As a consequence, the Wronskian W[x (1) , . . . ,x (n) ](t) is never 
zero; hence x (1) , . . . , x (n) form a fundamental set of solutions. Thus the general solu- 
tion of Eq. (1) is 


x = ci$ m e rit + • • • + c n % (n V” r . 


(25) 
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If A is real and symmetric (a special case of Hermitian matrices), recall from 
Section 7.3 that all the eigenvalues r \ , . . . , r n must be real. Further, even if some of the 
eigenvalues are repeated, there is always a full set of n eigenvectors § (1) , . . . , £ (n) that 
are linearly independent (in fact, orthogonal). Flence the corresponding solutions of 
the differential system (1) given by Eq. (23) again form a fundamental set of solutions, 
and the general solution is again given by Eq. (25). The following example illustrates 
this case. 


EXAMPLE 

3 


Find the general solution of 




(26) 


Observe that the coefficient matrix is real and symmetric. The eigenvalues and eigenvectors 
of this matrix were found in Example 5 of Section 7.3: 


n =2, S«=(l|; 


(27) 








r 2 = -1, 

r 3 = -1; 

S (2, = 

u) 

, s (3, = 

Li) 


(28) 


Hence a fundamental set of solutions of Eq. (26) is 



m 


( A 


f °\ 

x (1 >(f) = 

1 e 2 ', 

x®(f) = 

0 e-‘. 

x <3) (f) = 



W 






and the general solution is 



A \ 

( 

f 0) 

X = Cl 

1 e 2 ' + c 2 

Ole ' + c 3 



w 


-1/ 


(29) 


(30) 


This example illustrates the fact that even though an eigenvalue (r = —1) has algebraic mul- 
tiplicity 2, it may still be possible to find two linearly independent eigenvectors £ (2) and § (3) 
and, as a consequence, to construct the general solution (30). 

The behavior of the solution (30) depends critically on the initial conditions. For large t the 
first term on the right side of Eq. (30) is the dominant one; therefore, if c\ ^ 0, all components 
of x become unbounded as t — > oo. On the other hand, for certain initial points c\ will be 
zero. In this case, the solution involves only the negative exponential terms, and x-> 0 as 
t — > oo. The initial points that cause c\ to be zero are precisely those that lie in the plane 
determined by the eigenvectors £ (2) and ^ (3) corresponding to the two negative eigenvalues. 
Thus solutions that start in this plane approach the origin as t — > oo, while all other solutions 
become unbounded. 


If some of the eigenvalues occur in complex conjugate pairs, then there are still n 
linearly independent solutions of the form (23), provided that all the eigenvalues are 
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different. Of course, the solutions arising from complex eigenvalues are complex- 
valued. However, as in Section 3.3, it is possible to obtain a full set of real-valued 
solutions. This is discussed in Section 7.6. 

More serious difficulties can occur if an eigenvalue is repeated. In this event the 
number of corresponding linearly independent eigenvectors may be smaller than the 
algebraic multiplicity of the eigenvalue. If so, the number of linearly independent 
solutions of the form will be smaller than n. To construct a fundamental set 
of solutions, it is then necessary to seek additional solutions of another form. The 
situation is somewhat analogous to that for an nth order linear equation with constant 
coefficients; a repeated root of the characteristic equation gave rise to solutions of 
the form e rt , te rt , t 2 e rt , The case of repeated eigenvalues is treated in Section 7.8. 

Finally, if A is complex, then complex eigenvalues need not occur in conjugate 
pairs, and the eigenvectors are normally complex-valued even though the associated 
eigenvalue may be real. The solutions of the differential equation (1) are still of the 
form (23), provided that the eigenvalues are distinct, but in general all the solutions 
are complex-valued. 


PROBLEMS In each of Problems 1 through 6: 

(a) Find the general solution of the given system of equations and describe the behavior of 
the solution as t — > oo. 

(b) Draw a direction field and plot a few trajectories of the system. 


41 

41 

41 



& : ') x 

* <•*-(: 

41 &*-(j |). 


In each of Problems 7 and 8: 

(a) Find the general solution of the given system of equations. 

(b) Draw a direction field and a few of the trajectories. In each of these problems the coef- 
ficient matrix has a zero eigenvalue. As a result, the pattern of trajectories is different from 
those in the examples in the text. 


41 





X 


In each of Problems 9 through 14 find the general solution of the given system of equations. 


9. x' = 


1 


10. x' = 


-1 


2 + / 
-1 — i 



(1 

1 

2 \ 


^3 

2 

4 \ 

11 . x' = 

1 

2 

! X 

12 . x' = 

2 

0 

2 


l 2 

1 



l 4 

2 

3 J 
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( 1 

1 

A 


r \ 

-1 

4 \ 

13. x' = 

2 

1 

- 1 X 

14. x' = 

3 

2 

— 1 ] * 


l - 8 

-5 

-3 ) 


l 2 

1 



In each of Problems 15 through 18 solve the given initial value problem. Describe the behavior 
of the solution as t — >■ oo. 


15. 






17. x' = 

f l 1 2\ 

0 2 2 

X 

O 

II 

(0) 

18. x' = 

/ 0 0 -1\ 

2 0 0 

* 

3 

II 

P) 


l- 1 1 V 


Iv 


l- 1 2 V 


V 5 / 


19. The system tx' = Ax is analogous to the second order Euler equation (Section 5.4). Assum- 
ing that x = where £ is a constant vector, show that £ and r must satisfy (A — rl)% = 0 
in order to obtain nontrivial solutions of the given differential equation. 


Referring to Problem 19, solve the given system of equations in each of Problems 20 through 
23. Assume that t > 0. 


20 . 



22 . 




21. tx' = 


23. tx' = 



x 



X 


In each of Problems 24 through 27 the eigenvalues and eigenvectors of a matrix A are given. 
Consider the corresponding system x' = Ax. 

(a) Sketch a phase portrait of the system. 

(b) Sketch the trajectory passing through the initial point (2, 3). 

(c) For the trajectory in part (b) sketch the graphs of x\ versus t and of x 2 versus t on the same 
set of axes. 


24. 

25. 

26. 

27. 


,, = i. !»> = (-'); , = T> = (‘) 

,-i. »»-(-;); , = 2. !»-(;) 

<-, = 1 . !<» = (*); -2 = 2 , !•» = (_*) 


28. Consider a 2 x 2 system x' = Ax. If we assume that r 2 , the general solution is 
x = ci§ (1) e ri ' + C 2 I ( 2 V 2 ', provided that § (1) and £ (2) are linearly independent. In this 
problem we establish the linear independence of £ (1) and £ (2) by assuming that they are 
linearly dependent, and then showing that this leads to a contradiction. 

(a) Note that § (1) satisfies the matrix equation (A — (1) = 0; similarly, note that 

(A - r 2 1)$ (2) = 0. 


400 


Chapter 7. Systems of First Order Linear Equations 


(b) Show that (A - r 2 1)£ (1) = (n - r 2 )$ (1) . 

(c) Suppose that § (1) and £ (2) are linearly dependent. Then c (1) + c 2 £ (2) = 0 and at least 
one of ci and c 2 is not zero; suppose that ci ^ 0. Show that (A — r 2 I)(ci£ (1) + c 2 £ <2) ) = 0 , 
and also show that (A — r 2 I)(ci£ (1) + c 2 £ (2) ) = citri — r 2 )£ (1 )_ Hence ci = 0, which is a 
contradiction. Therefore £ (1) and § (2) are linearly independent. 

(d) Modify the argument of part (c) if we assume that c 2 ^ 0. 

(e) Carry out a similar argument for the case in which the order n is equal to 3; note that 
the procedure can be extended to an arbitrary value of n. 

29. Consider the equation 

ay" + by' + cy = 0, (i) 

where a, b , and c are constants with a ^ 0. In Chapter 3 it was shown that the general 
solution depended on the roots of the characteristic equation 

ar 2 + br + c = 0. (ii) 

(a) Transform Eq. (i) into a system of first order equations by letting x 1 — y, x 2 = y' . Find 
the system of equations x' = Ax satisfied by x = 

(b) Find the equation that determines the eigenvalues of the coefficient matrix A in part 

(a). Note that this equation is just the characteristic equation (ii) of Eq. (i). 

30. The two-tank system of Problem 22 in Section 7.1 leads to the initial value problem 



where Xi and x 2 are the deviations of the salt levels <2i and Q 2 from their respective 
equilibria. 

(a) Find the solution of the given initial value problem. 

(b) Plot x\ versus t and x 2 versus t on the same set of axes. 

(c) Find the smallest time T such that |xi(f)| < 0.5 and \x 2 (t)\ < 0.5 for all t > T. 

31. Consider the system 



(a) Solve the system for a = 0.5. What are the eigenvalues of the coefficient matrix? 
Classify the equilibrium point at the origin as to type. 

(b) Solve the system for a = 2. What are the eigenvalues of the coefficient matrix? Clas- 
sify the equilibrium point at the origin as to type. 

(c) In parts (a) and (b) solutions of the system exhibit two quite different types of behav- 
ior. Find the eigenvalues of the coefficient matrix in terms of a and determine the value 
of a between 0.5 and 2 where the transition from one type of behavior to the other occurs. 

Electric Circuits. Problems 32 and 33 are concerned with the electric circuit described by the 
system of differential equations in Problem 21 of Section 7.1: 

/Ri _I \ 

L L 

1 1 

V c ~~cr 2 ) 



- / 
dt \ V 
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32. (a) Find the general solution of Eq. (i) if 7?i = 1 £2, R 2 = | £2, L = 2 H, and C = | F. 

(b) Show that 7 (r J — >• 0 and F (f) — ^ 0 as t — >■ 00 regardless of the initial values 7(0) and 

V(0). 

33. Consider the preceding system of differential equations (i). 

(a) Find a condition on 7?i, R 2 , C, and L that must be satisfied if the eigenvalues of the 
coefficient matrix are to be real and different. 

(b) If the condition found in part (a) is satisfied, show that both eigenvalues are negative. 
Then show that 7(f) — > 0 and V (7) -> 0 as t — > 00 regardless of the initial conditions. 

(c) If the condition found in part (a) is not satisfied, then the eigenvalues are either 
complex or repeated. Do you think that 7(f) -> 0 and V(t) — > 0 as t — > 00 in these cases 
as well? 

Hint: In part (c) one approach is to change the system (i) into a single second order 
equation. We also discuss complex and repeated eigenvalues in Sections 7.6 and 7.8. 


7.6 Complex Eigenvalues 


In this section we consider again a system of n linear homogeneous equations with 
constant coefficients 

x' = Ax, (1) 

where the coefficient matrix A is real-valued. If we seek solutions of the form x = £e rt , 
then it follows, as in Section 7.5, that r must be an eigenvalue and § a corresponding 
eigenvector of the coefficient matrix A. Recall that the eigenvalues ri, . . . ,r„ of A 
are the roots of the equation 

det(A - rl) = 0, (2) 

and that the corresponding eigenvectors satisfy 

(A - rl)$ = 0 . (3) 

If A is real, then the coefficients in the polynomial equation (2) for r are real, and 
any complex eigenvalues must occur in conjugate pairs. For example, if r\ = X + ip, 
where X and p are real, is an eigenvalue of A, then so is ri_ = X — ip. To explore the 
effect of complex eigenvalues, we begin with an example. 


EXAMPLE 

1 


Find a fundamental set of real-valued solutions of the system 



and display them graphically. 

A direction field for the system (4) is shown in Figure 7.6.1. This plot suggests that the 
trajectories in the phase plane spiral clockwise toward the origin. 

To find a fundamental set of solutions, we assume that 


x = £e" 


(5) 
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FIGURE 7.6.1 A direction field for the system (4). 


and obtain the set of linear algebraic equations 


1 


-1 -\-r)\h 


A 


( 6 ) 


for the eigenvalues and eigenvectors of A. The characteristic equation is 


~\~ T 1 
-1 -\-r 


= r + r + I = 0 ; 


(7) 


therefore the eigenvalues are r\ = — \ + i and r 2 = — \ — i. From Eq. (6) a straightforward 
calculation shows that the corresponding eigenvectors are 


$ (1) = 


| (2) = 


( 8 ) 


Observe that the eigenvectors § (1) and £ G) are also complex conjugates. Hence a fundamental 
set of solutions of the system (4) is 


x w (t) = | . | e 


(-1/2+0* 


X IZ) « = I I e 


(- 1 / 2 - 0 * 


(9) 


To obtain a set of real-valued solutions, we can proceed as in Section 3.3 and find the real 
and imaginary parts of either x (1) or x (2) . In fact, 


... /1\ _ / e ,/2 cos A (e ,/2 s i n A 

x (?) = I i e (C0Sf + /sinf) = ( — e _( / 2 sin r ) \ e~ ,/2 cos t ) ' (10) 


Hence 


u(?) = e 


- on- 


cost 
— sin? 


v(?) = e 


- 


sin? 

cos? 


( 11 ) 
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is a set of real-valued solutions. To verify that u (f) and v(f) are linearly independent, we 
compute their Wronskian: 


W(u, v)(f) 


e ,/2 cos t 
—e~' 12 sint 


e“' /2 sin t 
e _,/2 cos t 


= e '(cos 2 t + sin 2 1) = e ‘ . 

Since the Wronskian is never zero, it follows that u (t) and v(/) constitute a fundamental set of 
(real-valued) solutions of the system (4). 

The graphs of the solutions u(f) and v(f) are shown in Figure 1.6.2a. Since 



the graphs of u(f) and v(t) pass through the points (1,0) and (0, 1), respectively. Other solutions 
of the system (4) are linear combinations of u(f) and v(f), and graphs of a few of these solutions 
are also shown in Figure 7 ,6.2a. Each trajectory approaches the origin along a spiral path as 
t — oo, making infinitely many circuits about the origin; this is due to the fact that the solutions 
(11) are products of decaying exponential and sine or cosine factors. Some typical graphs of 
X] versus t are shown in Figure 1.6.2b ; each one represents a decaying oscillation in time. 

Figure 1.6.2a is typical of all 2 x 2 systems x' = Ax whose eigenvalues are complex with 
negative real part. The origin is called a spiral point and is asymptotically stable because all 
trajectories approach it as t increases. For a system whose eigenvalues have a positive real 
part, the trajectories are similar to those in Figure 1.6.2a, but the direction of motion is away 
from the origin and the trajectories become unbounded. In this case, the origin is unstable. If 
the real part of the eigenvalues is zero, then the trajectories neither approach the origin nor 
become unbounded but instead traverse repeatedly a closed curve about the origin. Examples 
of this behavior can be seen in Figures 1.63b and 1.6Ab below. In this case the origin is called 
a center and is said to be stable, but not asymptotically stable. In all three cases the direction 
of motion may be either clockwise, as in this example, or counterclockwise, depending on the 
elements of the coefficient matrix A. 



FIGURE 7.6.2 (a) Trajectories of the system (4); the origin is a spiral point. 
(b) Plots of x\ versus t for the system (4). 
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Returning to the general equation (1) 

x' = Ax, 

we can proceed just as in the example. Suppose that there is a pair of complex conju- 
gate eigenvalues, n = k + ip and r 2 = \ — ip. Then the corresponding eigenvectors 
(• (1) and !• (2) are also complex conjugates. To see that this is so, recall that r\ and § (1) 
satisfy 

(A - (1) = 0. (12) 

On taking the complex conjugate of this equation and noting that A and I are real- 
valued, we obtain 

(A-rjI)^ =0, (13) 

where r\ and £ (1) are the complex conjugates of zq and § (1) , respectively. In other 
words, r 2 — r\ is also an eigenvalue, and § (2) = £ (1) is a corresponding eigenvector. 
The corresponding solutions 

x rl) (f) = § m e nt , x (2) (0 = J a) e Tlt (14) 

of the differential equation (1) are then complex conjugates of each other. Therefore, 
as in Example 1, we can find two real-valued solutions of Eq. (1) corresponding to 
the eigenvalues r\ and r 2 by taking the real and imaginary parts of x (1) (f) or x (2) (f) 
given by Eq. (14). 

Let us write § (1) = a + zb, where a and b are real; then we have 
x (1) (t) = (a + z'b)e (A - +!M); 

= (a + zb)e ;, (cos pt + i sin pt ). (15) 

Upon separating x (1) (t) into its real and imaginary parts, we obtain 

x (1) (t) = e^(acos pt — b sin pt) + ie Xt (asm pt + b cos pt). (16) 

If we write x (1) (f) = u(f) + i\(t), then the vectors 

u (t) = e u (a cos pt — b sin pt), 

. (17) 

\(t) = e xt (a sin pt + b cos pt) 

are real-valued solutions of Eq. (1). It is possible to show that u and v are linearly 
independent solutions (see Problem 27). 

For example, suppose that the matrix A has two complex eigenvalues r\ = X + ip, 
r 2 = A — ip, and that r^, . . . , r n are all real and distinct. Let the corresponding eigen- 
vectors be § (1) = a + zb, § <2) = a — zb, § (3 \ . . . , % (n) . Then the general solution of 
Eq. (1) is 

x = Cl u(t) + c 2 v(t) + c 3 § ( V 3 ' + • • • + c„§ (18) 

where u(t) and \(t) are given by Eqs. (17). We emphasize that this analysis applies 
only if the coefficient matrix A in Eq. (1) is real, for it is only then that complex 
eigenvalues and eigenvectors must occur in conjugate pairs. 

For 2x2 systems with real coefficients, we have now completed our description 
of the three main cases that can occur. 

1. Eigenvalues are real and have opposite signs; x = 0 is a saddle point. 

2. Eigenvalues are real and have the same sign but are unequal; x = 0 is a node. 

3. Eigenvalues are complex with nonzero real part; x = 0 is a spiral point. 
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Other possibilities are of less importance and occur as transitions between two 
of the cases just listed. For example, a zero eigenvalue occurs during the transition 
between a saddle point and a node. Purely imaginary eigenvalues occur during a 
transition between asymptotically stable and unstable spiral points. Finally, real and 
equal eigenvalues appear during the transition between nodes and spiral points. 


The system 

EXAMPLE 

2 



(19) 


contains a parameter a. Describe how the solutions depend qualitatively on a ; in particular, 
find the critical values of a at which the qualitative behavior of the trajectories in the phase 
plane changes markedly. 

The behavior of the trajectories is controlled by the eigenvalues of the coefficient matrix. 
The characteristic equation is 

r 2 -ar + 4 = 0, (20) 


so the eigenvalues are 

a ± \J a 2 — 16 


(21) 


From Eq. (21) it follows that the eigenvalues are complex conjugates for — 4 < a < 4 and are 
real otherwise. Thus two critical values are a = — 4 and a = 4 , where the eigenvalues change 
from real to complex, or vice versa. Fora < —4 both eigenvalues are negative.so all trajectories 
approach the origin, which is an asymptotically stable node. For a > 4 both eigenvalues are 
positive, so the origin is again a node, this time unstable; all trajectories (except x = 0) become 
unbounded. In the intermediate range, — 4 < a < 4 , the eigenvalues are complex and the 
trajectories are spirals. However, for —4 < a < 0 the real part of the eigenvalues is negative, 
the spirals are directed inward, and the origin is asymptotically stable, whereas for 0 < a < 4 
the real part of the eigenvalues is positive and the origin is unstable. Thus a = 0 is also a critical 
value where the direction of the spirals changes from inward to outward. For this value of a, 
the origin is a center and the trajectories are closed curves about the origin, corresponding to 
solutions that are periodic in time. The other critical values, a = ± 4 , yield eigenvalues that are 
real and equal. In this case the origin is again a node, but the phase portrait differs somewhat 
from those in Section 7.5. We take up this case in Section 7.8. 


A Multiple Spring-Mass System. Consider the system of two masses and three springs 
shown in Figure 7.1.1, whose equations of motion are given by Eqs. (1) in Section 
7.1. If we assume that there are no external forces, then Fi(t) = 0, F 2 (t) = 0, and the 
resulting equations are 


m\ 


d 2 x i 
dt 2 


-(ki + k 2 )x i + k 2 x 2 , 


m 2 


d 2 x 2 

dt 2 


k 2 x\ - ik 2 + k 3 )x 2 . 


( 22 ) 


These equations can be solved as a system of two second order equations (see Prob- 
lem 29), but, consistent with our approach in this chapter, we will transform them 
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into a system of four first order equations. Let y \ = x\,y 2 = x 2 ,y 3 = x[, and y 4 = x' 2 . 
Then 

y'i = )> 3 , y '2 = V4, (23) 

and, from Eqs. (22), 

miy ' 3 = -(ki + k 2 )yi + k 2 y 2 , ra 2 / 4 = k 2 y\ - (k 2 + h)y 2 . (24) 

The following example deals with a particular case of this two-mass, three-spring 
system. 


EXAMPLE 

3 


Suppose that m\ = 2, m 2 = 9/4, ki = 1, k 2 = 3, and k 3 = 15/4 in Eqs. (23) and (24) so that 
these equations become 

y[ = ys, y' 2 = y*, y’ 3 = -2yi + \yi, y\ = \yi - 3y2- (25) 


Analyze the possible motions described by Eqs. (25), and draw graphs showing typical behav- 
ior. 

We can write the system (25) in matrix form as 


/ 0 0 1 

0 0 0 

-2 3/2 0 

v 4/3 -3 0 


°\ 

1 

0 

0 / 


(26) 


Keep in mind that y\ and y 2 are the positions of the two masses, relative to their equilibrium 
positions, and that y 3 and y 4 are their velocities. We assume, as usual, that y = £e", where 
r must be an eigenvalue of the matrix A and £ a corresponding eigenvector. It is possible, 
though a bit tedious, to find the eigenvalues and eigenvectors of A by hand, but it is easy with 
appropriate computer software. The characteristic polynomial of A is 

r 4 + 5r 2 + 4= (r 2 + l)(r 2 + 4) (27) 


so the eigenvalues are r\ = i, r 2 = —i, r 3 = 2 i, and r 4 = —2 i. The corresponding eigenvectors 
are 



/ 3\ 


( ^ 


f ^ 


( 3 \ 


2 

3 i 

W 

§ (2) = 

1 1 

I (3) _ 

-4 

6 i 

1-8*7 

I <4) = 

^ 

1 | 

00 ON 


The complex-valued solutions £ (1) e" and £ are complex conjugates, so two real-valued 
solutions can be found by finding the real and imaginary parts of either of them. For instance, 
we have 



/3\ 




^ m e a = 

2 

3 i 

(cos t + i sin t) 


W 





( 3 cos 


< 3 sin T 

= 

2 cos t 

— 3sinr 

+ i 

2sint 

3 cos t 


v — 2 sin t) 


k2 cos t ) 


= u m (t) + iy m (t). 


(29) 
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In a similar way, we obtain 


| < 3 V ft = 


/ 3\ 
-4 
6i 

V-8'7 


(cos 2 t + ; sin 2 t) 


( 3 008 2 ^ 


( 3 sin 2 

— 4cos2t 

—6 sin 2 1 

+ i 

—4 sin 2 1 

6 cos 2 1 

v 8 sin 2 fy 


8 cos 2 1) 


= u (2) (0 + iv ( 2 ) (t). 


(30) 


We leave it to you to verify that u (1) , v (1) , u (2 \ and v (2) are linearly independent and therefore 
form a fundamental set of solutions. Thus the general solution of Eq. (26) is 



( 3 cos T 


< 3 sin t'' 


f 3 cos 2 T* 


( 3 sin 2^ 


2 cost 


2 sinf 


—4 cos 2 1 


— 4sin2t 

y = ci 

— 3sint 

+ c 2 

3 cost 

+ c 3 

—6 sin 2 1 

+ C 4 

6 cos 2t 


^—2 sin ty 


^2 cos t ) 


v 8 sin 2 1 ) 


^—8 cos 2 1) 


where c 1; c 2 , c 3 , and c 4 are arbitrary constants. 

The phase space for this system is four-dimensional, and each solution, obtained by a par- 
ticular set of values for ci, . . . , c 4 in Eq. (31), corresponds to a trajectory in this space. Since 
each solution, given by Eq. (31), is periodic with period 2n, each trajectory is a closed curve. 
No matter where the trajectory starts at t = 0, it returns to that point at t = 2jt, f = 4n, and 
so forth, repeatedly traversing the same curve in each time interval of length 2jt. We do not 
attempt to show any of these four-dimensional trajectories here. Instead, in the figures below 
we show projections of certain trajectories in the yiy 3 - or y 2 y 4 -plane, thereby showing the 
motion of each mass separately. 

The first two terms on the right side of Eq. (31) describe motions with frequency 1 and 
period 2 tt. Note that y 2 = (2/3)>’i in these terms and that y 4 = (2/3)y 3 . This means that the 
two masses move back and forth together, always going in the same direction, but with the 
second mass moving only two-thirds as far as the first mass. If we focus on the solution u ( 1 ) (t) 
and plot vi versus t and y 2 versus t on the same axes, we obtain the cosine graphs of amplitude 
3 and 2, respectively, shown in Figure 7.6.3 a. The trajectory of the first mass in the yi^-plane 
lies on the circle of radius 3 shown in Figure 7.6.3 b, traversed clockwise starting at the point 
(3,0) and completing a circuit in time 2 n . Also shown in this figure is the trajectory of the 
second mass in the y 2 .y 4 -plane, which lies on the circle of radius 2 , also traversed clockwise 
starting at (2, 0) and also completing a circuit in time 2n. The origin is a center in the respective 
y\y$- and y^W-planes. Similar graphs (with an appropriate shift in time) are obtained from 
v (1) or from a linear combination of u (1) and v (1) . 

The remaining terms on the right side of Eq. (31) describe motions with frequency 2 and 
periods. Observe that in this case y 2 = — (4/3Yyi andy 4 = — (4/3)y 3 . This means that the two 
masses are always moving in opposite directions and that the second mass moves four-thirds 
as far as the first mass. If we look only at u (2) (0 and plot y\ versus t and y 2 versus t on the 
same axes, we obtain Figure 7.6.4n. There is a phase difference of n, and the amplitude of y 2 
is four-thirds that of yi, confirming the preceding statements about the motions of the masses. 
Figure 7.6.46 shows a superposition of the trajectories for the two masses in their respective 
phase planes. Both graphs are ellipses, the inner one corresponding to the first mass and the 
outer one to the second. The trajectory on the inner ellipse starts at (3,0), and the trajectory 
on the outer ellipse starts at (—4, 0). Both are traversed clockwise, and a circuit is completed 
in time n . The origin is a center in the respective y 3 y 3 - and V 2 y 4 -planes. Once again, similar 
graphs are obtained from v (2) or from a linear combination of u (2) and v (2) . 
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FIGURE 7.6.3 (a) A plot of y\ versus t and y 2 versus t for the solution u ll) (0- (b) Superpo- 
sition of projections of trajectories in the yiy 3 - and y 2 y 4 -planes for the solution u (1) (/). 




FIGURE 7.6.4 (a) A plot of y\ versus t and y 2 versus t for the solution u (2) (0- (b) Superpo- 
sition of projections of trajectories in the yiy 3 - and y 2 .y 4 -planes for the solution u (2) (f). 

The types of motion described in the two preceding paragraphs are called fundamental 
modes of vibration for the two-mass system. Each of them results from fairly special initial 
conditions. For example, to obtain the fundamental mode of frequency 1, both of the con- 
stants c 3 and c 4 in Eq. (31) must be zero. This occurs only for initial conditions in which 
3y 2 (0) = 2yi(0) and 3y 4 (0) = 2y 3 (0). Similarly, the mode of frequency 2 is obtained only when 
both of the constants cq and c 2 in Eq. (31) are zero — that is, when the initial conditions are 
such that 3y 2 (0) = — 4yi(0) and 3y 4 (0) = — 4y 3 (0). 

For more general initial conditions the solution is a combination of the two fundamental 
modes. A plot of yi versus t for a typical case is shown in Figure 7.6.5 a, and the projection 
of the corresponding trajectory in the yi_y 3 -plane is shown in Figure 7.6.5 b. Observe that this 
latter figure may be a bit misleading in that it shows the projection of the trajectory crossing 
itself. This cannot be the case for the actual trajectory in four dimensions, because it would 
violate the general uniqueness theorem: there cannot be two different solutions issuing from 
the same initial point. 
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( b ) 

FIGURE 7.6.5 A solution of the system (22) satisfying the initial condition 
y (0) = (—1,4, 1,1). (a) A plot of y\ versus t. (b) The projection of the trajectory in 

the yiy 3 -plane. As stated in the text, the actual trajectory in four dimensions does not intersect 
itself. 


PROBLEMS In each of Problems 1 through 6: 

(a) Express the general solution of the given system of equations in terms of real-valued 
functions. 

(b) Also draw a direction field, sketch a few of the trajectories, and describe the behavior of 
the solutions as t oo. 


' *' = () 

41 5 r = (j 



41 2. *■=(-; 

41 4.r-(» :;)■ 

& & »■ = ( J J) 


X 
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In each of Problems 7 and 8 express the general solution of the given system of equations in 
terms of real-valued functions. 



(1 

0 

°\ 



0 

2 \ 

7. x' = 

2 

1 

-2 x 

8. x' = 

1 

-1 

° 


l 3 

2 



l - 2 

-1 



In each of Problems 9 and 10 find the solution of the given initial value problem. Describe the 
behavior of the solution as t — > oo. 



In each of Problems 11 and 12: 

(a) Find the eigenvalues of the given system. 

(b) Choose an initial point (other than the origin) and draw the corresponding trajectory in 
the x\x 2 -plane. 

(c) For your trajectory in part (b) draw the graphs of x\ versus t and of x 2 versus t. 

(d) For your trajectory in part (b) draw the corresponding graph in three-dimensional 
£riX2-space. 



In each of Problems 13 through 20 the coefficient matrix contains a parameter a. In each of 
these problems: 

(a) Determine the eigenvalues in terms of a. 

(b) Find the critical value or values of cr where the qualitative nature of the phase portrait for 
the system changes. 

(c) Draw a phase portrait for a value of a slightly below, and for another value slightly above, 
each critical value. 


41 

41 

41 

41 


13. x' = 


15. x' = 


17. x' = 


19. x' = 


a 1 
— 1 a 

2 -5 

a —2 

-1 
-1 -1 

a 10 
-1 -4 




16. x' = 




In each of Problems 21 and 22 solve the given system of equations by the method of Problem 
19 of Section 7.5. Assume that t > 0. 


21'*'= 1 1 :!)* 


22. lx' = (2 J,„ 


In each of Problems 23 and 24: 

(a) Find the eigenvalues of the given system. 

(b) Choose an initial point (other than the origin) and draw the corresponding trajectory in 
the ^i^-plane. Also draw the trajectories in the x\x 3 - and .v 2 x 3 -planes. 
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(c) For the initial point in part (b) draw the corresponding trajectory in X 1 X 2 X 3 -space. 




1 

°\ 



1 

°\ 

23. x' = 

-1 

1 

4 

0 x 

24. x' = 

-1 

1 

4 

0 


l 0 

0 



l 0 

0 

y 


25. Consider the electric circuit shown in Figure 7.6.6. Suppose that = R 2 = 4 £2, 
C = \ F, and L = 8 H. 

(a) Show that this circuit is described by the system of differential equations 


d 

dt 




(i) 


where 7 is the current through the inductor and V is the voltage drop across the capacitor. 
Hint: See Problem 19 of Section 7.1. 

(b) Find the general solution of Eqs. (i) in terms of real-valued functions. 

(c) Find 7(f) and V (f) if 7(0) = 2 A and 1/(0) = 3 V. 

(d) Determine the limiting values of 7(f) and V (f) as f — >■ 00 . Do these limiting values 
depend on the initial conditions? 



FIGURE 7.6.6 The circuit in Problem 25. 


26. The electric circuit shown in Figure 7.6.7 is described by the system of differential equations 


1 1 

\~C ~RC / 

C 

— K 

R 

-w\ <■ 

L 

FIGURE 7.6.7 The circuit in Problem 26. 

where 7 is the current through the inductor and V is the voltage drop across the capacitor. 
These differential equations were derived in Problem 19 of Section 7.1. 

(a) Show that the eigenvalues of the coefficient matrix are real and different if L > 4R 2 C; 
show that they are complex conjugates if L < 4R 2 C. 



d_ 1 

dt l V 
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(b) Suppose that JJ = 1J2 , C = ^ F, and L = 1 H. Find the general solution of the 
system (i) in this case. 

(c) Find I{t) and V (f) if 7(0) = 2 A and V (0) = 1 V. 

(d) For the circuit of part (b) determine the limiting values of 1 ( t ) and V (t) as t -> oo. Do 
these limiting values depend on the initial conditions? 

27. In this problem we indicate how to show that u(f) and v(t), as given by Eqs. (17), are linearly 
independent. Let n = X + i/i and 7\ = X — i\i be a pair of conjugate eigenvalues of the 
coefficient matrix A of Eq. (1); let | (1) = a + ib and £ (1) = a — ib be the corresponding 
eigenvectors. Recall that it was stated in Section 7.3 that if r\ j=- fi, then £ (1) and § (1 ) are 
linearly independent. 

(a) First we show that a and b are linearly independent. Consider the equation 
Cia + C 2 b = 0. Express a and b in terms of £ (1) and § (1) , and then show that 
(ci - ic 2 )$ (1) + (ci + ic 2 )f (1) = 0. 

(b) Show that ci — ic 2 = Oandci + ic 2 = 0 and then that ci = 0andc 2 = 0. Consequently, 
a and b are linearly independent. 

(c) To show that u(f) and v(f) are linearly independent, consider the equation 
Ciu(fo) + c 2 v(fo) = 0, where to is an arbitrary point. Rewrite this equation in terms of 
a and b, and then proceed as in part (b) to show that ci = 0 and c 2 = 0. Hence u(/) 
and \(t) are linearly independent at the arbitrary point t 0 . Therefore they are linearly 
independent at every point and on every interval. 

28. A mass m on a spring with constant k satisfies the differential equation (see Section 3.7) 


mu" + ku = 0, 


where u(t) is the displacement at time t of the mass from its equilibrium position, 

(a) Let jci = u, x 2 = u', and show that the resulting system is 



(b) Find the eigenvalues of the matrix for the system in part (a). 

(c) Sketch several trajectories of the system. Choose one of your trajectories, and sketch 
the corresponding graphs of jci versus t and of x 2 versus t. Sketch both graphs on one set 
of axes. 

(d) What is the relation between the eigenvalues of the coefficient matrix and the natural 
frequency of the spring-mass system? 

29. Consider the two-mass, three-spring system of Example 3 in the text. Instead of converting 
the problem into a system of four first order equations, we indicate here how to proceed 
directly from Eqs. (22). 

(a) Show that Eqs. (22) can be written in the form 



(b) Assume that x = £ e n and show that 


(A - r 2 I)£ = 0. 

Note that r 2 (rather than r) is an eigenvalue of A corresponding to an eigenvector £. 
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(c) Find the eigenvalues and eigenvectors of A. 

(d) Write down expressions for x 1 and x 2 . There should be four arbitrary constants in 
these expressions. 

(e) By differentiating the results from part (d), write down expressions for x[ a.ndx' 2 . Your 
results from parts (d) and (e) should agree with Eq. (31) in the text. 

30. Consider the two-mass, three-spring system whose equations of motion are Eqs. (22) in 
the text. Let wi = 1, m 2 = 4/3, k\ = 1, k 2 = 3, and k 2 = 4/3. 

(a) As in the text, convert the system to four first order equations of the form y' = Ay. 
Determine the coefficient matrix A. 

(b) Find the eigenvalues and eigenvectors of A. 

(c) Write down the general solution of the system. 

(d) Describe the fundamental modes of vibration. For each fundamental mode draw 
graphs of y\ versus t and y 2 versus t. Also draw the corresponding trajectories in the yiy 2 - 
and y 2 y 4 -planes. 

(e) Consider the initial conditions y(0) = (2, 1, 0, 0) r . Evaluate the arbitrary constants in 
the general solution in part (c). What is the period of the motion in this case? Plot graphs 
of y i versus t and y 2 versus t. Also plot the corresponding trajectories in the yiy 2 - and 
y 2 .V 4 -planes. Be sure you understand how the trajectories are traversed for a full period. 

(f) Consider other initial conditions of your own choice, and plot graphs similar to those 
requested in part (e). 

31. Consider the two-mass, three-spring system whose equations of motion are Eqs. (22) in 
the text. Let mi = m 2 = 1 and k\ = k 2 = k 2 = 1. 

(a) As in the text, convert the system to four first order equations of the form y' = Ay. 
Determine the coefficient matrix A. 

(b) Find the eigenvalues and eigenvectors of A. 

(c) Write down the general solution of the system. 

(d) Describe the fundamental modes of vibration. For each fundamental mode draw 
graphs of y\ versus t and y 2 versus t. Also draw the corresponding trajectories in the yiy 2 - 
and y 2 y 4 -planes. 

(e) Consider the initial conditions y (0) = (— 1,3,0, 0) r . Evaluate the arbitrary constants 
in the general solution in part (c). Plot yi versus t and y 2 versus t. Do you think the 
solution is periodic? Also draw the trajectories in the yiy 2 - and y 2 y^ -planes. 

(f) Consider other initial conditions of your own choice, and plot graphs similar to those 
requested in part (e). 
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The structure of the solutions of systems of linear differential equations can be fur- 
ther illuminated by introducing the idea of a fundamental matrix. Suppose that 
x (1) (t), . . . ,x (,,) (f) form a fundamental set of solutions for the equation 


x' = P (t)x 


( 1 ) 
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on some interval a < t < fl. Then the matrix 


fx[ n (t) 


*(0 = 


V4 1} (0 


4 n> (t)\ 

4 n) (0/ 


( 2 ) 


whose columns are the vectors x (1) (f), . . . , x (,n (f), is said to be a fundamental matrix 
for the system (1). Note that a fundamental matrix is nonsingular since its columns 
are linearly independent vectors. 


EXAMPLE 

1 


Find a fundamental matrix for the system 



In Example 1 of Section 7.5 we found that 


x ll) (f) 



x (2) (t) = 



(3) 


are linearly independent solutions of Eq. (3). Thus a fundamental matrix for the system (3) is 



(4) 


The solution of an initial value problem can be written very compactly in terms of 
a fundamental matrix. The general solution of Eq. (1) is 

X = Cix (1) (0 H b c„x (n, (t) (5) 

or, in terms of 

x = ¥(f)c, (6) 


where c is a constant vector with arbitrary components ci, . . . , c n . For an initial value 
problem consisting of the differential equation (1) and the initial condition 

x(t 0 ) = x°, (7) 

where to is a given point in a < t < and x° is a given initial vector, it is only necessary 
to choose the vector c in Eq. (6) so as to satisfy the initial condition (7). Hence c 
must satisfy 

*(to)c = x°. (8) 

Therefore, since ^(to) is nonsingular, 

c = ^(to)* 0 (9) 


x = 


and 


( 10 ) 
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is the solution of the initial value problem (1), (7). We emphasize, however, that 
to solve a given initial value problem, we would ordinarily solve Eq. (8) by row 
reduction and then substitute for c in Eq. (6), rather than compute Tfo) and use 
Eq. (10). 

Recall that each column of the fundamental matrix ^ is a solution of Eq. (1). It 
follows that satisfies the matrix differential equation 

= P(0*. (11) 


This relation is readily confirmed by comparing the two sides of Eq. (11) column by 
column. 

Sometimes it is convenient to make use of the special fundamental matrix, denoted 
by <h(f), whose columns are the vectors x (1) (f), . . . ,x (,,) (f) designated in Theorem 7.4.4. 
Besides the differential equation (1), these vectors satisfy the initial conditions 

x (i \t 0 ) = e« (12) 


where e (y) is the unit vector, defined in Theorem 7.4.4, with a one in the yth position 
and zeros elsewhere. Thus <!>(/) has the property that 


*('o) = 


A 

0 


0 • ■ ■ 0 \ 
1 0 


\o 0 ... 1/ 


(13) 


We will always reserve the symbol $ to denote the fundamental matrix satisfying the 
initial condition (13) and use when an arbitrary fundamental matrix is intended. 
In terms of <h(f) the solution of the initial value problem (1), (7) is even simpler in 
appearance; since <I> 1 (to) = I, it follows from Eq. (10) that 

x = Q(t)x°. (14) 


Although the fundamental matrix <I>(r) is often more complicated than 'P(t), it is es- 
pecially helpful if the same system of differential equations is to be solved repeatedly 
subject to many different initial conditions. This corresponds to a given physical sys- 
tem that can be started from many different initial states. If the fundamental matrix 
<S>(t) has been determined, then the solution for each set of initial conditions can be 
found simply by matrix multiplication, as indicated by Eq. (14). The matrix <!>(/) thus 
represents a transformation of the initial conditions x° into the solution x(f) at an ar- 
bitrary time t. Comparing Eqs. (10) and (14) makes it clear that <i>(f) = 'k(f)'k^ 1 (to). 


For the system (3) 

EXAMPLE 


in Example 1, find the fundamental matrix $ such that 4>(0) = I. 

The columns of $ are solutions of Eq. (3) that satisfy the initial conditions 




x (1) (0) = (J 


(15) 
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Since the general solution of Eq. (3) is 


x = Cl 



e 3 ' + c 2 



we can find the solution satisfying the first set of these initial conditions by choosing 
ci = C 2 = \\ similarly, we obtain the solution satisfying the second set of initial conditions 
by choosing Cj = ^ and c 2 = — \ . Hence 


<t>(t) = 


\e 3 ' + ie~‘ 


~ \e~‘\ 

\e 3 ‘ + \e-<) 


(16) 


Note that the elements of <h(f) are more complicated than those of the fundamental matrix 
ty(t) given by Eq. (4); however, it is now easy to determine the solution corresponding to any 
set of initial conditions. 


The Matrix exp(At). Recall that the solution of the scalar initial value problem 


where a is a constant, is 


x' — ax, x(0) = Xq, 


(17) 


v = xo exp(at). (18) 

Now consider the corresponding initial value problem for an n x n system, namely, 

x' = Ax, x(0) = x°, (19) 


where A is a constant matrix. Applying the results of this section to the problem 
(19), we can write its solution as 

x = $(0x°, (20) 

where <I>(0) = I. Comparing the problems (17) and (19), and their solutions, suggests 
that the matrix <l>(f) might have an exponential character. We now explore this 
possibility. 

The scalar exponential function exp(nf) can be represented by the power series 

°° d n t n 

exp(nf) = 1 + V — (21) 
L — ' n\ 


which converges for all t . Let us now replace the scalar a by the n x n constant matrix 
A and consider the corresponding series 




A n t n 
n\ 


I + Af + 



A"f" 


nl 


( 22 ) 


Each term in the series (22) is an n x n matrix, it is possible to show that each 
element of this matrix sum converges for all / as n — ► oo. Thus the series (22) defines 
as its sum a new matrix, which we denote by exp(At); that is, 

°° 

exp (Af) = 1 + V — — , (23) 

n\ 


analogous to the expansion (21) of the scalar function exp(^). 
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By differentiating the series (23) term by term, we obtain 


d 

dt 


[exp(Af)] = V — = A I + V — — = A exp (At). 

“ ( n — 1)! " n\ 



(24) 


n= 1 ' L n= 1 


Thus exp (At) satisfies the differential equation 

d 

— exp(At) = A exp(Af). 


dt 


(25) 


Further, when t = 0, exp(Af) satisfies the initial condition 


exp(Af) = I. 


(26) 


The fundamental matrix $ satisfies the same initial value problem as exp(Af) , namely. 


<J>' =A<f>, $(0)=I. 


(27) 


Thus, by the uniqueness part of Theorem 7.1.2 (extended to matrix differential equa- 
tions), we conclude that exp(A?) and the fundamental matrix 4>(f) are the same. Thus 
we can write the solution of the initial value problem (19) in the form 


x = exp(A/)x°, 


(28) 


which is analogous to the solution (18) of the initial value problem (17). 

In order to justify more conclusively the use of exp(Af) for the sum of the series (22), 
we should demonstrate that this matrix function does indeed have the properties we 
associate with the exponential function. One way to do this is outlined in Problem 15. 

Diagonalizable Matrices. The basic reason why a system of linear (algebraic or differ- 
ential) equations presents some difficulty is that the equations are usually coupled. 
In other words, some or all of the equations involve more than one — typically all — of 
the unknown variables. Hence the equations in the system must be solved simul- 
taneously. In contrast, if each equation involves only a single variable, then each 
equation can be solved independently of all the others, which is a much easier task. 
This observation suggests that one way to solve a system of equations might be by 
transforming it into an equivalent uncoupled system in which each equation contains 
only one unknown variable. This corresponds to transforming the coefficient matrix 
A into a diagonal matrix. 

Eigenvectors are useful in accomplishing such a transformation. Suppose that 
the n x n matrix A has a full set of n linearly independent eigenvectors. Recall 
that this will certainly be the case if the eigenvalues of A are all different, or if 
A is Hermitian. Letting § (1) , . . . ,% (n) denote these eigenvectors and X \, . . . , k n the 
corresponding eigenvalues, form the matrix T whose columns are the eigenvectors — 
that is. 


Ar (1) ••• ir\ 


T = 


ve ••• w 


(29) 
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Since the columns of T are linearly independent vectors, det T ^ 0; hence T is non- 
singular and T 1 exists. A straightforward calculation shows that the columns of the 
matrix AT are just the vectors A £ 11 1 , . . . , Ai- (n> . Since A§ <4) = , it follows that 


where 


At?! 


(i) 


AT = 



W/A ■■■ *„$<»>/ 


D = 


At 

o 


0 0\ 
Xi • • • o 


\o 0 ... kj 


(30) 


(31) 


is a diagonal matrix whose diagonal elements are the eigenvalues of A. From Eq. (30) 
it follows that 


T _1 AT = D. 


(32) 


Thus, if the eigenvalues and eigenvectors of A are known, A can be transformed 
into a diagonal matrix by the process shown in Eq. (32). This process is known as 
a similarity transformation, and Eq. (32) is summed up in words by saying that A is 
similar to the diagonal matrix D. Alternatively, we may say that A is diagonalizable. 
Observe that a similarity transformation leaves the eigenvalues of A unchanged and 
transforms its eigenvectors into the coordinate vectors e ( 1 V . . . ,e (H) . 

If A is Flermitian, then the determination of T 1 is very simple. The eigenvectors 
§ (1 \ . . . , % (n) of A are known to be mutually orthogonal, so let us choose them so that 
they are also normalized by (£ W ,I W ) = 1 for each i. Then it is easy to verify that 
T = T*; in other words, the inverse of T is the same as its adjoint (the transpose of 
its complex conjugate). 

Finally, we note that if A has fewer than n linearly independent eigenvectors, then 
there is no matrix T such that T _1 AT = D. In this case, A is not similar to a diagonal 
matrix and is not diagonalizable. 


EXAMPLE 

3 


Consider the matrix 



Find the similarity transformation matrix T and show that A can be diagonalized. 

In Example 1 of Section 7.5 we found that the eigenvalues and eigenvectors of A are 

ri -3, l“ = Q ; -i— 1, «“-(_))- < 34 > 

Thus the transformation matrix T and its inverse T _1 are 
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Consequently, you can check that 


T AT 



(36) 


Now let us turn again to the system 

x' = Ax, 


(37) 


where A is a constant matrix. In Sections 7.5 and 7.6 we described how to solve such 
a system by starting from the assumption that x = S-e n . Now we provide another 
viewpoint, one based on diagonalizing the coefficient matrix A. 

According to the results stated just above, it is possible to diagonalize A whenever A 
has a full set of n linearly independent eigenvectors. Let £ (1) ,...,§ (n) be eigenvectors 
of A corresponding to the eigenvalues r \ , . . . , r n and form the transformation matrix 
T whose columns are £ (1) , . . . , fj (n \ Then, defining a new dependent variable y by 
the relation 

x = Ty, (38) 

we have from Eq. (37) that 

Ty' = ATy. (39) 

Multiplying by T^ 1 , we then obtain 

y' = (T _1 AT)y, (40) 


or, using Eq. (32), 


y = Dy 


(41) 


Recall that D is the diagonal matrix with the eigenvalues r\, . . . ,r n of A along the 
diagonal. A fundamental matrix for the system (41) is the diagonal matrix (see 
Problem 16) 


Q(0 = exp(Df) = 


/e n ‘ 0 ... 0 \ 

0 e rit ... 0 

V 0 0 • • • e r " t J 


(42) 


A fundamental matrix 'll for the system (37) is then found from Q by the transfor- 
mation (38) 

* = TQ; (43) 


that is. 


*(t) = 






%n n) e rnt ) 


(44) 


Equation (44) is the same result that was obtained in Section 7.5. This diago- 
nalization procedure does not offer any computational advantage over the method 
of Section 7.5, since in either case it is necessary to calculate the eigenvalues and 
eigenvectors of the coefficient matrix in the system of differential equations. 


420 


Chapter 7. Systems of First Order Linear Equations 


EXAMPLE 

4 


PROBLEMS 


Consider again the system of differential equations 

x' = Ax, (45) 

where A is given by Eq. (33). Using the transformation x = Ty, where T is given by Eq. (35), 
you can reduce the system (45) to the diagonal system 

y' = ^ y = Dy (46) 


Obtain a fundamental matrix for the system (46) and then transform it to obtain a fundamental 
matrix for the original system (45). 

By multiplying D repeatedly with itself, we find that 


D 2 = 



(47) 


Therefore it follows from Eq. (23) that exp(Df) is a diagonal matrix with the entries e 3 ' and 
e~‘ on the diagonal; that is, 


e 


Dr 




(48) 


Finally, we obtain the required fundamental matrix 4'(r) by multiplying T and exp(Dr): 



Observe that this fundamental matrix is the same as the one found in Example 1. 


In each of Problems 1 through 10: 

(a) Find a fundamental matrix for the given system of equations. 

(b) Also find the fundamental matrix $(?) satisfying 4>(0) = I. 




( 1 

1 

A 


r \ 

-1 

4 

9. x' = 

2 

1 

-l x 

10 . x' = 

3 

2 

-1 


l - 8 

-5 

-3 ) 


l 2 

1 

- 1 / 
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11. Solve the initial value problem 



by using the fundamental matrix found in Problem 3. 

12. Solve the initial value problem 



by using the fundamental matrix <t>{t) found in Problem 6. 

13. Show that 4>(Y) = »P(f)*I'~ 1 (ro). where $(t) and <P(f) are as defined in this section. 

14. The fundamental matrix $(r) for the system (3) was found in Example 2. Show that 
4>(r)$(^) = $(r + s) by multiplying 4>(f) and $(j). 

15. Let <h(r) denote the fundamental matrix satisfying 4>' = A<1>. <I>(0) = I. In the text we also 
denoted this matrix by exp(Af). In this problem we show that does indeed have the 
principal algebraic properties associated with the exponential function. 

(a) Show that $(f)$(s) = $(t + s); that is, show that exp(Af) exp(As) = exp[A(r + s)]. 
Hint: Show that if s is fixed and t is variable, then both 4>(r)<I>(.s) and 4>(t + s) satisfy the 
initial value problem Z' = AZ, Z(0) = <J>(s). 

(b) Show that 4>(r)<I>(— t) = I; that is, exp(Af) exp[A(— t)] = I. Then show that 
$(-r) = 

(c) Show that $(t — s) = $(f)4> _1 (i). 

16. Show that if A is a diagonal matrix with diagonal elements ai,a 2 , . . . ,a n , then exp(At) is 
also a diagonal matrix with diagonal elements exp (ait), exp(a 2 t), . . . ,exp (a n t). 

17. Consider an oscillator satisfying the initial value problem 

u" + a> 2 u = 0, u( 0) = uq, u'( 0) = vq. (i) 


(a) Let x\ = u,X 2 = it' , and transform Eqs. (i) into the form 

x' = Ax, x(0) = x°. (ii) 

(b) By using the series (23), show that 


exp At = I cos t ot + 


A 


sin cot 


CD 


(hi) 


(c) Lind the solution of the initial value problem (ii). 

18. The method of successive approximations (see Section 2.8) can also be applied to systems 
of equations. Lor example, consider the initial value problem 


x' = Ax, x(0) = x°, 


(i) 


where A is a constant matrix and x° is a prescribed vector. 

(a) Assuming that a solution x = <t>{t) exists, show that it must satisfy the integral equation 


<!>{t) = x° + 


f 


A ds. 


(ii) 
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(b) Start with the initial approximation 0 (O) ( t ) = x°. Substitute this expression for 0(s) in 
the right side of Eq. (ii) and obtain a new approximation 0 (1) (/). Show that 

0 (1) (O = (I + Af)x°. (hi) 

(c) Repeat this process and thereby obtain a sequence of approximations 0 (O \ <j> a \ 
0 (2) , . . . , <j) >n \ . . . . Use an inductive argument to show that 

0 < "*(O — (l + Af + A 2 ^- + --- + A"Mx°. (iv) 

V 2! n\ ) 

(d) Let n — > oo and show that the solution of the initial value problem (i) is 

0(f) = exp(Af)x°. (v) 


7.8 Repeated Eigenvalues 

We conclude our consideration of the linear homogeneous system with constant 
coefficients 

x' = Ax (1) 

with a discussion of the case in which the matrix A has a repeated eigenvalue. Recall 
that in Section 7.3 we stated that a repeated eigenvalue with algebraic multiplicity 
k >2 may have a geometric multiplicity less than k. In other words, there may be 
fewer than k linearly independent eigenvectors associated with this eigenvalue. The 
following example illustrates this possibility. 


EXAMPLE 


Find the eigenvalues and eigenvectors of the matrix 

H ~V 

The eigenvalues r and eigenvectors § satisfy the equation (A — rl)£ = 0, or 

The eigenvalues are the roots of the equation 


det(A — rl) = 


1 -r -1 
1 3 - r 


= r — 4r + 4 = 0. 


( 2 ) 

(3) 

(4) 


Thus the two eigenvalues are r\ = 2 and r 2 = 2; that is, the eigenvalue 2 has algebraic multi- 
plicity 2. 

To determine the eigenvectors, we must return to Eq. (3) and use for r the value 2. This 
gives 


: 1 1 




(5) 
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Hence we obtain the single condition £1 + £2 = 0, which determines f 2 in terms of £ 1 , or vice 
versa. Thus the eigenvector corresponding to the eigenvalue r = 2 is 

= (J) • «■> 

or any nonzero multiple of this vector. Observe that there is only one linearly independent 
eigenvector associated with the double eigenvalue. 


Returning to the system (1), suppose that r = p is a /r-fold root of the determinantal 
equation 

det(A - r\) = 0. (7) 

Then p is an eigenvalue of algebraic multiplicity k of the matrix A. In this event, 
there are two possibilities: either there are k linearly independent eigenvectors cor- 
responding to the eigenvalue p, or else there are fewer than k such eigenvectors. 

In the first case, let § (1) ,. . . ,§ <k> be k linearly independent eigenvectors associ- 
ated with the eigenvalue p of algebraic multiplicity k. Then x f! 1 ( r ) = £ 1 r V", . . . , 
x (k) {t) = § (k ">e pt are k linearly independent solutions of Eq. (1). Thus in this case it 
makes no difference that the eigenvalue r — pis repeated; there is still a fundamental 
set of solutions of Eq. (1) of the form $e rt . This case always occurs if the coefficient 
matrix A is Hermitian. 

However, if the coefficient matrix is not Hermitian, then there may be fewer than k 
independent eigenvectors corresponding to an eigenvalue p of algebraic multiplicity 
k , and if so, there will be fewer than k solutions of Eq. (1) of the form %e pt associated 
with this eigenvalue. Therefore, to construct the general solution of Eq. (1), it is 
necessary to find other solutions of a different form. By analogy with previous results 
for linear equations of order n, it is natural to seek additional solutions involving 
products of polynomials and exponential functions. We first consider an example. 


Find a fundamental set of solutions of 

EXAMPLE / \ 

2 x' — Ax = R Jx (8) 

and draw a phase portrait for this system. 

A direction field for the system (8) is shown in Figure 7.8.1. From this figure it appears that 
all nonzero solutions depart from the origin. 

To solve the system, observe that the coefficient matrix A is the same as the matrix in 
Example 1. Thus we know that r = 2 is a double eigenvalue and that it has only a single 
corresponding eigenvector, which we may take as % T = (1,-1). Thus one solution of the 
system (8) is 

x (1) « = ( e 2 ', (9) 

but there is no second solution of the form x = £e". 

Based on the procedure used for second order linear equations in Section 3.4, it may be 
natural to attempt to find a second solution of the system (8) of the form 

x = £te 2 ', 


(10) 
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FIGURE 7.8.1 A direction field for the system (8). 


where ? is a constant vector to be determined. Substituting for x in Eq. (8) gives 

2i-te 2 ‘ + ?e 2f - A?te 2 ' = 0. (11) 

For Eq. (11) to be satisfied for all t, it is necessary for the coefficients of te 2 ' and e 2 ' both to be 
zero. From the term in e 2 ' we find that 

« = 0- (12) 

Hence there is no nonzero solution of the system (8) of the form (10). 

Since Eq. (11) contains terms in both te 21 and e 2t , it appears that in addition to ?te 2 ', the 
second solution must contain a term of the form ije 2f ; in other words, we need to assume that 

x = |fe 2 ' + tie 2 ', (13) 

where ? and t] are constant vectors to be determined. Upon substituting this expression for x 
in Eq. (8), we obtain 

2 ?te 2f + ($ + 2ri)e 2 ' = A(?re 2 ' + r,e 2 '). (14) 

Equating coefficients of te 2 ‘ and e 2t on each side of Eq. (14) gives the conditions 

(A - 21)? = 0 (15) 

and 

(A -21)* = $ (16) 

for the determination of ? and rj. Equation (15) is satisfied if ? is an eigenvector of A cor- 
responding to the eigenvalue r — 2, that is, ? r = (1, —1). Since det(A — 21) is zero, we might 
expect that Eq. (16) cannot be solved. However, this is not necessarily true, since for some 
vectors ? it is possible to solve Eq. (16). In fact, the augmented matrix for Eq. (16) is 

-1 -1 | 1 

1 1 | -1 

The second row of this matrix is proportional to the first, so the system is solvable. We have 

-7i -'72 = 1, 

so if )?! = k , where k is arbitrary, then rj 2 = —k — 1. If we write 
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FIGURE 7.8.2 (a) Trajectories of the system (8); the origin is an improper node, (b) Plots 
of xi versus t for the system (8). 


then by substituting for £ and jj in Eq. (13), we obtain 

* = (_!) te2 ' + (_J) g2 ' + k (_}) ( i§ ) 

The last term in Eq. (18) is merely a multiple of the first solution x (1) (t) and may be ignored, 
but the first two terms constitute a new solution: 

x<2) «=(_l) fe2 '+(_j) e2 '- < 19 ) 


An elementary calculation shows that lT[x (1) , x (2) ](t) = — e 4 ', and therefore x (1) and x (2) form 
a fundamental set of solutions of the system (8). The general solution is 

x = cix (1) (f) + c 2 x (2) (r) 


= c l 



e 2 ' + c 2 




(20) 


The graph of the solution (20) is a little more difficult to analyze than in some of the previous 
examples. Since every term in Eq. (20) contains the exponential factor e 2t , it follows that x 
becomes unbounded as t — >■ oo and that x — *■ 0 as t — > —oo. It is possible to show that as 
t — > —oo, all solutions approach the origin tangent to the line x 2 = — x\ determined by the 
eigenvector. Similarly, as t — > oo, each trajectory is asymptotic to a line of slope —1. The 
trajectories of the system (8) are shown in Figure 7.8.2 a, and some typical plots of X\ versus t 
are shown in Figure 7.8.2 b. The pattern of trajectories in this figure is typical of 2 x 2 systems 
x' = Ax with equal eigenvalues and only one independent eigenvector. The origin is called an 
improper node in this case. If the eigenvalues are negative, then the trajectories are similar but 
are traversed in the inward direction. An improper node is asymptotically stable or unstable, 
depending on whether the eigenvalues are negative or positive. 


One difference between a system of two first order equations and a single second 
order equation is evident from the preceding example. Recall that for a second order 
linear equation with a repeated root ri of the characteristic equation, a term ce nt in 
the second solution is not required since it is a multiple of the first solution. On the 
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other hand, for a system of two first order equations, the term r]e nt of Eq. (13) with 
r\ — 2 is not a multiple of the first solution t-e r, ‘ , so the term rje r, ‘ must be retained. 

Example 2 is entirely typical of the general case when there is a double eigenvalue 
and a single associated eigenvector. Consider again the system (1), and suppose 
that r — p is a double eigenvalue of A, but that there is only one corresponding 
eigenvector §. Then one solution [similar to Eq. (9)] is 

x (1) (0 = %e pt , (21) 


where £ satisfies 


(A - pl)$ = 0. (22) 

By proceeding as in Example 2, we find that a second solution [similar to Eq. (19)] is 

x (2) (0 = i-te pt + rie pt , (23) 

where § satisfies Eq. (22) and rj is determined from 

(A — pi) ri = £. (24) 


Even though det(A — pi) = 0, it can be shown that it is always possible to solve 
Eq. (24) for t). The vector ri is called a generalized eigenvector corresponding to the 
eigenvalue p. 


Fundamental Matrices. As explained in Section 7.7, fundamental matrices are formed 
by arranging linearly independent solutions in columns. Thus, for example, a funda- 
mental matrix for the system (8) can be formed from the solutions x (1> (t) and x (2 \t) 
from Eqs. (9) and (19), respectively: 


*(*) = 





(25) 


The matrix $ that satisfies 4>(0) = I can also be readily found from the relation 
4>(t) = «r(0¥ -1 (0). For Eq. (8) we have 


* <o >=u I -?)■ _?)• 


(26) 


and then 


$(f) = *(f)'I'- 1 (0) = e 2 ' 


1 t 


1 0 


-1 -1 - t) \-l -l 

1 — t —t 


t 1 + tj 

The latter matrix is also the exponential matrix exp (At). 


(27) 


Jordan Forms. An n x n matrix A can be diagonalized as discussed in Section 7.7 only if 
it has a full complement of n linearly independent eigenvectors. If there is a shortage 
of eigenvectors (because of repeated eigenvalues), then A can always be transformed 
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into a nearly diagonal matrix called its Jordan 6 form, which has the eigenvalues of 
A on the main diagonal, ones in certain positions on the diagonal above the main 
diagonal, and zeros elsewhere. 

Consider again the matrix A given by Eq. (2). Form the transformation matrix 
T with the single eigenvector (j from Eq. (6) in its first column and the generalized 
eigenvector r) from Eq. (17) with k = 0 in the second column. Then T and its inverse 
are given by 



As you can verify, it follows that 

T " AT= (o 2) = ^ (29) 

The matrix J in Eq. (29) is the Jordan form of A. It is typical of all Jordan forms in that 
it has a 1 above the main diagonal in the column corresponding to the eigenvector 
that is lacking (and is replaced in T by the generalized eigenvector). 

If we start again from Eq. (1) 

x' = Ax, 

the transformation x = Ty, where T is given by Eq. (28), produces the system 

y' = Jy. (30) 

where J is given by Eq. (29). In scalar form the system (30) is 

y[ = 2y 1 +y 2 , y 2 = 2y 2 . (31) 

These equations can be solved readily in reverse order. In this way we obtain 

y 2 = c\e 2t , yi = c x te 2t + c 2 e 2t . (32) 

Thus two independent solutions of the system (30) are 

y (1) (0 = Qe* y®(0 = ( 1 V, (33) 

and the corresponding fundamental matrix is 

* ( e 2t te 2t \ 

*<'> = ( 0 *)■ (34) 

Since ^(O) = I, we can also identify the matrix in Eq. (34) as exp(Jf)- The same result 
can be reached by calculating powers of J and substituting them into the exponential 


6 Camille Jordan (1838-1921), professor at the Ecole Polytechnique and the College de France, made 
important contributions to analysis, topology, and especially to algebra. The Jordan form of a matrix 
appeared in his influential book Traite des substitutions et des equations algebriques , published in 1870. 
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series (see Problems 19 through 21). To obtain a fundamental matrix for the original 
system, we now form the product 


»P(f) = T exp(J?) = 




(35) 


which is the same as the fundamental matrix given in Eq. (25). 

We will not discuss the Jordan forms ofnxn matrices in more detail here. For 
large n it is possible that there may be eigenvalues of high algebraic multiplicity, 
perhaps with much lower geometric multiplicity. A full discussion 7 of the Jordan 
form of a general n x n matrix requires a greater background in linear algebra than 
we assume for most readers of this book. Problems 17 through 21 ask you to explore 
the use of Jordan forms for systems of three equations. 


PROBLEMS In each of Problems 1 through 4: 

(a) Draw a direction field and sketch a few trajectories. 

(b) Describe how the solutions behave as t — > oo. 

(c) Find the general solution of the system of equations. 


41 

41 



41 

41 




X 


X 


In each of Problems 5 and 6 find the general solution of the given system of equations. 



/ 1 

1 

A 


7 0 

1 

A 

5. x' = 

2 

1 

- 1 X 

6. x' = 

1 

0 

1 


1° 

-1 



l 1 

1 

0/ 


In each of Problems 7 through 10: 

(a) Find the solution of the given initial value problem. 

(b) Draw the trajectory of the solution in the xiX 2 -plane and also draw the graph of x\ 
versus t. 



7 For example, see the books listed in the References at the end of this chapter. 
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In each of Problems 11 and 12: 

(a) Find the solution of the given initial value problem. 

(b) Draw the corresponding trajectory in xiX 2 X 3 -space and also draw the graph of x\ versus t. 


: li. x' = 

12. x' = 


1 1 
-4 

V 3 

(-1 

1 

V i 



x(0) = 


/ -1> 
\-30j 



In each of Problems 13 and 14 solve the given system of equations by the method of Problem 
19 of Section 7.5. Assume that t > 0. 


l3 *=C‘ -t 

15. Show that all solutions of the system 


14. fx' = 


1 -4 


approach zero as t — > oo if and only if a + d <0 and ad — be > 0. Compare this result 
with that of Problem 38 in Section 3.4. 

16. Consider again the electric circuit in Problem 26 of Section 7.6. This circuit is described 
by the system of differential equations 


1 1 
\C ~RC / 

(a) Show that the eigenvalues are real and equal if L = 4 R 2 C. 

(b) Suppose that = 1 12, C = 1 F, and L = 4H. Suppose also that 7(0) = 1 A and 
V(0) = 2 V. Find/(f) and V(t). 



- 1 
dt If 


Eigenvalues of Multiplicity 3. If the matrix A has an eigenvalue of algebraic multiplicity 3, 
then there may be either one, two, or three corresponding linearly independent eigenvectors. 
The general solution of the system x' = Ax is different, depending on the number of eigenvec- 
tors associated with the triple eigenvalue. As noted in the text, there is no difficulty if there 
are three eigenvectors, since then there are three independent solutions of the form x = §e". 
The following two problems illustrate the solution procedure for a triple eigenvalue with one 
or two eigenvectors, respectively. 

17. Consider the system 


x' = Ax = 




(i) 
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(a) Show that r = 2 is an eigenvalue of algebraic multiplicity 3 of the coefficient matrix 
A and that there is only one corresponding eigenvector, namely. 


v-v 

(b) Using the information in part (a), write down one solution x ( 1 ) (t) of the system (i). 
There is no other solution of the purely exponential form x = §e". 

(c) To find a second solution, assume that x = £fe 2 ' + r/e 2 '. Show that £ and ij satisfy the 
equations 

(A -21)5 = 0, (A — 2T)r/ = £■ 


Since £ has already been found in part (a), solve the second equation for »/. Neglect the 
multiple of § ll) that appears in )j, since it leads only to a multiple of the first solution x (1) . 
Then write down a second solution x (2, (0 of the system (i). 

(d) To find a third solution, assume that x = §(t 2 /2)e 2 ' + rite 2 ' + ?e 2t . Show that 5, »/, and 
? satisfy the equations 


(A - 2I)| = 0, (A - 21)17 = 5, (A - 21)? = i /. 


The first two equations are the same as in part (c), so solve the third equation for ?, again 
neglecting the multiple of £ (1) that appears. Then write down a third solution x (3) (r) of the 
system (i). 

(e) Write down a fundamental matrix 'l'(f) for the system (i). 

(f) Form a matrix T with the eigenvector £ (1) in the first column and the generalized 
eigenvectors ri and ? in the second and third columns. Then find T _1 and form the product 
J = T AT. The matrix J is the Jordan form of A. 


18. Consider the system 



(i) 


(a) Show that r = 1 is a triple eigenvalue of the coefficient matrix A and that there are 
only two linearly independent eigenvectors, which we may take as 




( 0\ 

s (1) = 

0 , ? (2) = 

2 


w 

\- 3 / 


Find two linearly independent solutions x ( 1 ) (t) and x ( 2 , (f) of Eq. (i). 

(b) To find a third solution, assume that x = £fe' + tie'; then show that § and t] must satisfy 


(A - D$ = 0, (in) 

(A — I) 17 = 5 • (iv) 

(c) Show that £ = Ci£ (1) + C 2 % <2) , where ci and C 2 are arbitrary constants, is the most 
general solution of Eq. (iii). Show that in order for us to solve Eq. (iv), it is necessary that 

Cl = c 2 . 
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(d) It is convenient to choose c\ — C 2 = 2. For this choice, show that 



( 2 \ 

( 


4 , r] = 

0 


\~ 2 ) 



where we have dropped the multiples of £ (1) and § t2) that appear in t]. Use the results 
given in Eqs. (v) to find a third linearly independent solution x (3) (t) of Eq. (i). 

(e) Write down a fundamental matrix (f) for the system (i). 

(f) Form a matrix T with the eigenvector § (1) in the first column and with the eigenvector 
£ and the generalized eigenvector ij from Eqs. (v) in the other two columns. Find T _1 and 
form the product J = T AT. The matrix J is the Jordan form of A. 


19. Let J = 


^ , where X is an arbitrary real number. 


(a) Find J 2 ,J 3 , and J 4 . 

(b) Use an inductive argument to show that J" = 

(c) Determine exp(Jf). 

(d) Use exp(Jr) to solve the initial value problem x' = Jx, x(0) = x°. 
20. Let 




where X is an arbitrary real number. 

(a) Find J 2 ,J 3 , and J 4 . 

(b) Use an inductive argument to show that 


r 


( X" 


0 

V° 


o 

x n 

0 



(c) Determine exp(Jf). 

(d) Observe that if you choose X = 1, then the matrix J in this problem is the same as 
the matrix J in Problem 18(f). Using the matrix T from Problem 18(f), form the product 
T exp(Jf) with X = 1. Is the resulting matrix the same as the fundamental matrix <P(r) in 
Problem 18(e)? If not, explain the discrepancy. 

21. Let 



where X is an arbitrary real number. 
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(a) Find J 2 ,J 3 , and J 4 . 

(b) Use an inductive argument to show that 



!X n 

nX n ~ l 

\n(n - 1)/2]V , " 2 \ 

j 

r = 

b 

X n 

nX n ~ l 

I 

(c) Determine exp(Jf). 

u 

0 

^ ) 


(d) Observe that if you choose X = 2, then the matrix J in this problem is the same as 
the matrix J in Problem 17(f). Using the matrix T from Problem 17(f), form the product 
T exp(Jf) with X = 2. Observe that the resulting matrix is the same as the fundamental 
matrix *P(t) in Problem 17(e). 


7.9 Nonhomogeneous Linear Systems 

In this section we turn to the nonhomogeneous system 

x' = P(t)x + g(t), (1) 

where the n x n matrix P(f) and n x 1 vector g (t) are continuous for a < t < /3. By 
the same argument as in Section 3.5 (see also Problem 16 in this section), the general 
solution of Eq. (1) can be expressed as 

X = cix (1 V) H f c n x (n \t) + v(f), (2) 

where cix (1) (f) + b c n x (n) (t) is the general solution of the homogeneous system 

x' = P(f)x, and v(f) is a particular solution of the nonhomogeneous system (1). We 
will briefly describe several methods for determining v(f). 

Diagonalization. We begin with systems of the form 

x' = Ax + g(0, (3) 

where A is an n x n diagonalizable constant matrix. By diagonalizing the coefficient 
matrix A, as indicated in Section 7.7, we can transform Eq. (3) into a system of 
equations that is readily solvable. 

Let T be the matrix whose columns are the eigenvectors § (1) , . . . <n) of A, and 

define a new dependent variable y by 

x = Ty. (4) 

Then, substituting for x in Eq. (3), we obtain 

Ty' = ATy + g (t). 

When we multiply by T , it follows that 

y' = (T- 1 AT)y+T- 1 g(0 = Dy + h(0, (5) 

where h(f) = T x g(?) and where D is the diagonal matrix whose diagonal entries 
are the eigenvalues r\,...,r n of A, arranged in the same order as the corresponding 
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eigenvectors § (1) , . . . , § (w) that appear as columns of T. Equation (5) is a system of 
n uncoupled equations for y\(t), . . . ,y„(f); as a consequence, the equations can be 
solved separately. In scalar form Eq. (5) has the form 

y'j(t) = rjyjit) + hj(t ), / = 1, (6) 

where hj(t) is a certain linear combination of gi(t), . . . ,g n (t). Equation (6) is a first 
order linear equation and can be solved by the methods of Section 2.1. In fact, we 
have 

yj(t)=e r ’‘[ e~ r i s hj(s)ds + Cje r i\ j= 1, n , (7) 

Jto 

where the c ; are arbitrary constants. Finally, the solution x of Eq. (3) is obtained 
from Eq. (4). When multiplied by the transformation matrix T, the second term on 
the right side of Eq. (7) produces the general solution of the homogeneous equation 
x' = Ax, while the first term on the right side of Eq. (7) yields a particular solution 
of the nonhomogeneous system (3). 


EXAMPLE 


Find the general solution of the system 

-2 1 


x = ‘ f - 2 ) X+ U )= Ax + s«- 


2e“ 


( 8 ) 


Proceeding as in Section 7.5, we find that the eigenvalues of the coefficient matrix are 
r\ = —3 and r 2 = —1 and that the corresponding eigenvectors are 


S (1) = 


1 


| (2) 


! / ’ 11 

Thus the general solution of the homogeneous system is 


x = c i| J e 3, + c 2 (| | e '. 


(9) 


( 10 ) 


Before writing down the matrix T of eigenvectors, we recall that eventually we must find T _1 . 
The coefficient matrix A is real and symmetric, so we can use the result stated at the end of 
Section 7.3: is simply the adjoint or (since T is real) the transpose of T, provided that the 

eigenvectors of A are normalized so that (£,£) = 1. Hence, upon normalizing £ (1 > and £ <2) , 
we have 


T = 


- 1 

V2\-l 



T -i 




( 11 ) 


Letting x = Ty and substituting for x in Eq. (8), we obtain the following system of equations 
for the new dependent variable y: 


0 


I y + — ( 2e ~‘ ~ 3t 

- 1 / V2 \2e-‘ + 3t 


y' = Dy + T ’g(0 = 


-3 


( 12 ) 
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Thus 


y[ + 3yi = V2e ' - —t, 


y' 2 +y2 = V2e ' + -j=t. 


(13) 


Each of Eqs. (13) is a first order linear equation and so can be solved by the methods of Section 
2.1. In this way we obtain 


V2„_ t 


Vi = — e — . 

2 V2LV3 


+ C\e 


y 2 = sllte ' + —= (t - 1) + c 2 e '. 

v 2 

Finally, we write the solution in terms of the original variables: 
x = Ty=^( 

V2 \-yi + yi ) 

( c i /\/2)e 31 + (C 2 / V2) + | J e ( + t— | + te 1 

^-(ci/V 2)e -3 ' + [(c 2 /V2) - 1 j e“' + 2r - § + re - ^ 


= k 


1 1 -l) £ " 3 ' + * 2 (1) e "' + 5 (-1) + (1) + (2) f ~ \ (5 


(14) 


(15) 


where k\ = ci/^/2 and k 2 = c 2 /\fl. The first two terms on the right side of Eq. (15) form the 
general solution of the homogeneous system corresponding to Eq. (8). The remaining terms 
are a particular solution of the nonhomogeneous system. 


If the coefficient matrix A in Eq. (3) is not diagonalizable (because of repeated 
eigenvalues and a shortage of eigenvectors), it can nevertheless be reduced to a 
Jordan form J by a suitable transformation matrix T involving both eigenvectors and 
generalized eigenvectors. In this case the differential equations for jq, . . . , y„ are not 
totally uncoupled since some rows of J have two nonzero elements: an eigenvalue 
in the diagonal position and a 1 in the adjacent position to the right. However, the 
equations for y\,...,y n can still be solved consecutively, starting with y n . Then the 
solution of the original system (3) can be found by the relation x = Ty. 

Undetermined Coefficients. A second way to find a particular solution of the nonhomo- 
geneous system (1) is the method of undetermined coefficients. To make use of this 
method, we assume the form of the solution with some or all of the coefficients un- 
specified, and then seek to determine these coefficients so as to satisfy the differential 
equation. As a practical matter, this method is applicable only if the coefficient ma- 
trix P is a constant matrix, and if the components of g are polynomial, exponential, or 
sinusoidal functions, or sums or products of these. In these cases the correct form of 
the solution can be predicted in a simple and systematic manner. The procedure for 
choosing the form of the solution is substantially the same as that given in Section 3.5 
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for linear second order equations. The main difference is illustrated by the case of a 
nonhomogeneous term of the form ue^, where X is a simple root of the characteristic 
equation. In this situation, rather than assuming a solution of the form a te kt , it is 
necessary to use ate kt + be kt , where a and b are determined by substituting into the 
differential equation. 


EXAMPLE 

2 


Use the method of undetermined coefficients to find a particular solution of 




= Ax + g(t). 


(16) 


This is the same system of equations as in Example 1. To use the method of undetermined 
coefficients, we write g (t) in the form 


Then we assume that 



x = v(t) = are ' + be ' + cr + d. 


(17) 


(18) 


where a, b, c, and d are vectors to be determined. Observe that r = —1 is an eigenvalue of the 
coefficient matrix, and therefore we must include both are - ' and be - ' in the assumed solution. 
By substituting Eq. (18) into Eq. (16) and collecting terms, we obtain the following algebraic 
equations for a, b, c, and d: 



Ad = c. 


(19) 


From the first of Eqs. (19) we see that a is an eigenvector of A corresponding to the eigenvalue 
r = —1. Thus a r = (a, a), where a is any nonzero constant. Then we find that the second of 
Eqs. (19) can be solved only if a = 1 and that in this case 


b — k 




( 20 ) 


for any constant k. The simplest choice is k = 0, from which b 7 = (0, —1). Then the third and 
fourth of Eqs. (19) yield c r = (1,2) andd r = (—|, — |), respectively. Finally, from Eq. (18) we 
obtain the particular solution 


v(0 = 



( 21 ) 


The particular solution (21) is not identical to the one contained in Eq. (15) of Example 1 
because the term in e~‘ is different. However, if we choose k = | inEq. (20), then b 7 = (|, — 1) 
and the two particular solutions agree. 
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Variation of Parameters. Now let us turn to more general problems in which the coef- 
ficient matrix is not constant or not diagonalizable. Let 

x' = P(f)x + g (t), (22) 

where P(t) and g(f) are continuous on a < t < ft. Assume that a fundamental matrix 
ty(t) for the corresponding homogeneous system 

x' = P(f)x (23) 

has been found. We use the method of variation of parameters to construct a partic- 
ular solution, and hence the general solution, of the nonhomogeneous system (22). 

Since the general solution of the homogeneous system (23) is ^(f)c, it is natural to 
proceed as in Section 3.6 and to seek a solution of the nonhomogeneous system (22) 
by replacing the constant vector c by a vector function u(f). Thus we assume that 

x = ¥(f)u(0, (24) 

where u(f) is a vector to be found. Upon differentiating x as given by Eq. (24) and 
requiring that Eq. (22) be satisfied, we obtain 

*'(f)u(0 + *(0u'(0 = P(0*(0u(0 + g(0. (25) 

Since 'k(f) is a fundamental matrix, \U(f) = P (t)'k(Y); hence Eq. (25) reduces to 

*(0u'(0 = g(0- (26) 

Recall that 'P(r) is nonsingular on any interval where P is continuous. Hence 1 (f) 
exists, and therefore 

u'(f) = *-*(0g(0. (27) 

Thus for u(t) we can select any vector from the class of vectors that satisfy Eq. (27); 
these vectors are determined only up to an arbitrary additive constant vector; there- 
fore we denote u(f) by 

u (t) = J + c, (28) 

where the constant vector c is arbitrary. If the integrals in Eq. (28) can be evaluated, 
then the general solution of the system (22) is found by substituting for u(t) from 
Eq. (28) in Eq. (24). However, even if the integrals cannot be evaluated, we can still 
write the general solution of Eq. (22) in the form 

x = ty(t)c + 4f(f) f 4* 1 Cv)g(.v) ds, (29) 

'h 

where t\ is any point in the interval (a.P). Observe that the first term on the right 
side of Eq. (29) is the general solution of the corresponding homogeneous system 
(23), and the second term is a particular solution of Eq. (22). 

Now let us consider the initial value problem consisting of the differential equation 
(22) and the initial condition 


x(f 0 ) = x°. 


(30) 


7.9 Nonhomogeneous Linear Systems 


437 


We can find the solution of this problem most conveniently if we choose the lower 
limit of integration in Eq. (29) to be the initial point to- Then the general solution of 
the differential equation is 

x = (t)c + (f) f (s)g(s) ds. (31) 

Jto 

For t = to the integral in Eq. (31) is zero, so the initial condition (30) is also satisfied 
if we choose 

c = ^-\t 0 )x°. (32) 

Therefore 

X = ^(t)^»' 1 (to)x° + V(t) [ ^ _1 (5)g(5) ds (33) 

Jto 

is the solution of the given initial value problem. Again, although it is helpful to use 
1 to write the solutions (29) and (33), it is usually better in particular cases to solve 
the necessary equations by row reduction than to calculate IP -1 and substitute into 
Eqs. (29) and (33). 

The solution (33) takes a slightly simpler form if we use the fundamental matrix 
<I> (t) satisfying <I>(to) = I. In this case we have 

x = <&(f)x° + $(f) f <l> _1 (s)g(s) ds. (34) 

J to 

Equation (34) can be simplified further if the coefficient matrix P(f) is a constant 
matrix (see Problem 17). 


EXAMPLE 


Use the method of variation of parameters to find the general solution of the system 


1 -2 


x = \ 2 llx+ ( 2 ! I =Ax + g(f). 


3f 


(35) 


This is the same system of equations as in Examples 1 and 2. 

The general solution of the corresponding homogeneous system was given in Eq. (10). Thus 


♦(0 = 


(36) 


is a fundamental matrix. Then the solution x of Eq. (35) is given by x = (r)u(f), where u(t) 

satisfies >k(r)u'(r) = g(f),or 


m(\ / 2e 


3 1 


( 37 ) 


Solving Eq. (37) by row reduction, we obtain 


u\ = e 2 ‘ — | te 3 ', 
u' ? = 1 + he' . 
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Hence 

Mi (f) = \e 2t - \te 3 ' + ge 3t + ci, 
m 2 (0 = t + \te‘ - \ef + c 2 , 

and 

x = <i<(f)u(r) 



which is the same as the solution obtained previously. 

Laplace Transforms. We used the Laplace transform in Chapter 6 to solve linear equa- 
tions of arbitrary order. It can also be used in very much the same way to solve 
systems of equations. Since the transform is an integral, the transform of a vector is 
computed component by component. Thus £{x(t)} is the vector whose components 
are the transforms of the respective components of x(t), and similarly for £{x'(f)}. 
We will denote £{x(f)} by X(s). Then, by an extension of Theorem 6.2.1 to vectors, 
we also have 

£{x'(f)} = sX(s) — x(0). (39) 


EXAMPLE 

4 


Use the method of Laplace transforms to solve the system 

x '=(^ -2) X+ ( 2 3t) =Ax + 8(/) - (40) 

This is the same system of equations as in Examples 1 through 3. 

We take the Laplace transform of each term in Eq. (40), obtaining 

sX(s) — x(0) = AX(j) + G(s), (41) 

where G(s) is the transform of g(f). The transform G(s) is given by 


r ,, 2/(s + l) 

G(S)={ 3/s 2 


(42) 


We will simplify the remaining calculations by assuming that x(t) satisfies the initial condition 
x(0) = 0. Then Eq. (41) becomes 


(.vl — A)X(s) = G(i), 


(43) 


where, as usual, I is the identity matrix. Consequently, X(.v) is given by 

X(s) = (si — A) -1 G(s). (44) 

The matrix (si — A) -1 is called the transfer matrix because multiplying it by the transform of 
the input vector g (?) yields the transform of the output vector x(f). In this example we have 


si — A = 




(45) 
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and by a straightforward calculation we obtain 


(si - A)- 1 


1 fs + 2 
{s + !)(■£ + 3) y 1 



(46) 


Then, substituting from Eqs. (42) and (46) in Eq. (44) and carrying out the indicated multipli- 
cation, we find that 


X(s) = 


/ 2(5 + 2 ) 

(5 +1) 2 (5 + 3) 

2 

V(5+1) 2 (5 + 3) 


+ 

+ 


3 \ 

5 2 (5+ l)(5 + 3) 
3(5 + 2) 


5 2 (5+ 1)(5 + 3)/ 


(47) 


Finally, we need to obtain the solution \(t) from its transform X(5). This can be done by 
expanding the expressions in Eq. (47) in partial fractions and using Table 6.2.1, or (more 
efficiently) by using appropriate computer software. In any case, after some simplification the 
result is 



Equation (48) gives the particular solution of the system (40) that satisfies the initial condition 
x(0) = 0. As a result, it differs slightly from the particular solutions obtained in the preceding 
three examples. To obtain the general solution of Eq. (40), you must add to the expression in 
Eq. (48) the general solution (10) of the homogeneous system corresponding to Eq. (40). 


Each of the methods for solving nonhomogeneous equations has some advantages 
and disadvantages. The method of undetermined coefficients requires no integra- 
tion, but it is limited in scope and may entail the solution of several sets of algebraic 
equations. The method of diagonalization requires finding the inverse of the trans- 
formation matrix and the solution of a set of uncoupled first order linear equations, 
followed by a matrix multiplication. Its main advantage is that for Hermitian coeffi- 
cient matrices, the inverse of the transformation matrix can be written down without 
calculation — a feature that is more important for large systems. The method of 
Laplace transforms involves a matrix inversion to find the transfer matrix, followed 
by a multiplication, and finally by the determination of the inverse transform of each 
term in the resulting expression. It is particularly useful in problems with forcing 
functions that involve discontinuous or impulsive terms. Variation of parameters is 
the most general method. On the other hand, it involves the solution of a set of 
linear algebraic equations with variable coefficients, followed by an integration and 
a matrix multiplication, so it may also be the most complicated from a computational 
viewpoint. For many small systems with constant coefficients, such as the one in 
the examples in this section, all of these methods work well, and there may be little 
reason to select one over another. 


PROBLEMS 


In each of Problems 1 through 12 find the general solution of the given system of equations. 




x + 



2. x' = 
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1 : Ih rZ 


\ 1 \ ie 

4 - x = 1 4 _ 2 > x + 


,-2f 

— 2e ! 


5 - X ' = U I4 X+ U 


t > 0 


/-I 


6. x' = | 4 2 x + ( f , 

2 -l/ V 2f— 1 + 4/ 


7 *=\\ ! x+ u) e ‘ 


t > 0 


8. X ' = r x + j). 


5 3 

'4 4 


9 - x '" 1 ! -i) x+ u ; 

11. x' = l~ ~% + (° 

\ 1 -2/ Icos? 


;2 — 5\ /cscf 

12. x' = x + 

,1 -2/ l sec? 


10. x' = 


-3 V2 

V2 -2 


-1 


0 < t < n 


n 

— < t < it 
2 


13. The electric circuit shown in Figure 7.9. 1 is described by the system of differential equations 


dx 

dt 




(i) 


where x\ is the current through the inductor, X 2 is the voltage drop across the capacitor, 
and 7(f) is the current supplied by the external source. 


L = 8 henrys 



FIGURE 7.9.1 The circuit in Problem 13. 


(a) Determine a fundamental matrix (r) for the homogeneous system corresponding to 
Eq. (i). Refer to Problem 25 of Section 7.6. 

(b) If 7(f) = e -f/2 , determine the solution of the system (i) that also satisfies the initial 
conditions x(0) = 0. 


In each of Problems 14 and 15 verify that the given vector is the general solution of the 
corresponding homogeneous system, and then solve the nonhomogeneous system. Assume 
that f > 0. 
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16. Let x = <j>(t) be the general solution of x' = P(f)x + g (t), and let x = v(f) be some 
particular solution of the same system. By considering the difference 0(f) — v(f), show 
that 0(f) = u(f) + v(f), where u(f) is the general solution of the homogeneous system 
x' = P (f)x. 

17. Consider the initial value problem 

x' = Ax + g (t), x(0) = x°. 


(a) By referring to Problem 15(c) in Section 7.7, show that 


/' 


X = 4>(t)x u + $ (t - s)g(s) ds. 


(b) Show also that 


f 


x = exp(Ar)x u + / exp[A(r — s)]g(i) ds. 


Compare these results with those of Problem 27 in Section 3.6. 
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Up to this point we have discussed methods for solving differential equations by using 
analytical techniques such as integration or series expansions. Usually, the emphasis 
was on finding an exact expression for the solution. Unfortunately, there are many 
important problems in engineering and science, especially nonlinear ones, to which 
these methods either do not apply or are very complicated to use. In this chapter 
we discuss an alternative approach, the use of numerical approximation methods to 
obtain an accurate approximation to the solution of an initial value problem. We 
present these methods in the simplest possible context, namely, a single scalar first 
order equation. However, they can readily be extended to systems of first order 
equations, and this is outlined briefly in Section 8.6. The procedures described here 
can be executed easily on personal computers. 


8.1 The Euler or Tangent Line Method 


To discuss the development and use of numerical approximation procedures, we 
will concentrate mainly on the first order initial value problem consisting of the 
differential equation 



( 1 ) 


and the initial condition 


( 2 ) 


y(fo) = yo- 


We assume that the functions/ and/ v are continuous on some rectangle in the fy-plane 
containing the point (to,yo)- Then, by Theorem 2.4.2, there exists a unique solution 
y = (p(t) of the given problem in some interval about to. If Eq. (1) is nonlinear, then 
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the interval of existence of the solution may be difficult to determine and may have 
no simple relationship to the function/. However, in all our discussions we assume 
that there is a unique solution of the initial value problem (1), (2) in the interval of 
interest. 

In Section 2.7 we described the oldest and simplest numerical approximation 
method, namely, the Euler or tangent line method. This method is expressed by 
the equation 

y«+i = y« + f (fiu y n) i.tn +1 fit)> n = 0,1,2, .... (3) 

If the step size has a uniform value h and if we denote f(t„,y n ) by /„, then Eq. (3) 
simplifies to 

y n +\ =y n +fnh, n= 0,1,2,.... (4) 

Euler’s method consists of repeatedly evaluating Eq. (3) or (4), using the result of 
each step to execute the next step. In this way you obtain a sequence of values 
yo,yi,yi, ■ ■ ■ ,yn, ■ ■ ■ that approximate the values of the solution cp(t) at the points 
to rii . ri , . . . , t n , . . . . 

In Section 2.7 we pointed out that a computer program for Euler’s method has 
the structure given below. The specific instructions can be written in any convenient 
programming language. 


The Euler Method 

Step 1. 

define /(t,y) 

Step 2. 

input initial values fO and yO 

Step 3. 

input step size h and number of steps n 

Step 4. 

output fO and yO 

Step 5. 

for j from 1 to n do 

Step 6. 

?r 

h-* 

II 


y — y + h * k 1 


t — t + h 

Step 7. 

output t and y 

Step 8. 

end 


Some examples of Euler’s method appear in Section 2.7. As another example, 
consider the initial value problem 

y' = 1 — t + 4y, (5) 

y (0) = l. (6) 

Equation (5) is a first order linear equation, and it is easily verified that the solution 
satisfying the initial condition (6) is 

y = m =\t-l b + f 6 e*'. (7) 

Since the exact solution is known, we do not need numerical methods to approximate 
the solution of the initial value problem (5), (6). On the other hand, the availability of 
the exact solution makes it easy to determine the accuracy of any numerical procedure 
that we use on this problem. We will use this problem throughout the chapter to 
illustrate and compare different numerical methods. The solutions of Eq. (5) diverge 
rather rapidly from each other, so we should expect that it will be fairly difficult to 
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approximate the solution (7) well over any considerable interval. Indeed, this is the 
reason for choosing this particular problem; it will be relatively easy to observe the 
benefits of using more efficient methods. 


Using the Euler formula (4) and step sizes h = 0.05, 0.025, 0.01, and 0.001, determine approx- 
imate values of the solution y — <f>(t) of the problem (5), (6) on the interval 0 < t <2. 

The indicated calculations were carried out on a computer, and some of the results are shown 
in Table 8.1.1. Their accuracy is not particularly impressive. For /7 = 0.01 the percentage error 
is 3.85% at f = 0.5, 7.49% at t = 1.0, and 14.4% at r = 2.0. The corresponding percentage 
errors for h = 0.001 are 0.40%, 0.79%, and 1.58%, respectively. Observe that if h = 0.001, 
then it requires 2000 steps to traverse the interval from t = 0 to t = 2. Thus considerable 
computation is needed to obtain even reasonably good accuracy for this problem using the 
Euler method. When we discuss other numerical approximation methods later in this chapter, 
we will find that it is possible to obtain comparable or better accuracy with much larger step 
sizes and many fewer computational steps. 


TABLE 8.1.1 A Comparison of Results for the Numerical Approximation of the Solution 
of y = 1 — t + 4y,y(0) = 1 Using the Euler Method for Different Step Sizes h 


t 

h = 0.05 

h = 0.025 

h = 0.01 

h = 0.001 

Exact 

0.0 

1.0000000 

1.0000000 

1.0000000 

1.0000000 

1.0000000 

0.1 

1.5475000 

1.5761188 

1.5952901 

1.6076289 

1.6090418 

0.2 

2.3249000 

2.4080117 

2.4644587 

2.5011159 

2.5053299 

0.3 

3.4333560 

3.6143837 

3.7390345 

3.8207130 

3.8301388 

0.4 

5.0185326 

5.3690304 

5.6137120 

5.7754845 

5.7942260 

0.5 

7.2901870 

7.9264062 

8.3766865 

8.6770692 

8.7120041 

1.0 

45.588400 

53.807866 

60.037126 

64.382558 

64.897803 

1.5 

282.07187 

361.75945 

426.40818 

473.55979 

479.25919 

2.0 

1745.6662 

2432.7878 

3029.3279 

3484.1608 

3540.2001 


To begin to investigate the errors in using numerical approximations, and also to 
suggest ways to construct more accurate algorithms, it is helpful to mention some 
alternative ways to look at the Euler method. 

First, let us write the differential equation (1) at the point t = t n in the form 

^ ft.) =/[*», * ft.)]- (8) 


Then we approximate the derivative in Eq. (8) by the corresponding (forward) dif- 
ference quotient, obtaining 


0(/«+ 1 ) 0(f«) 

hi+l t n 


= f[tn,<P(t n )]. 


( 9 ) 


Finally, if we replace <p{t n+ \) and <j>(t n ) by their approximate values y n+ 1 and y n , 
respectively, and solve for y n+ \ , we obtain the Euler formula (3). 

Another way to proceed is to write the problem as an integral equation. Since 
y = (p(t) is a solution of the initial value problem (1), (2), by integrating from t n to 
/„+i, we obtain 
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or 



4>' (t) dt 





<P(t„+i) =<p(tn)+ l f[t,<p(t)]dt. (10) 

Jt n 

The integral in Eq. (10) is represented geometrically as the area under the curve in 
Figure 8.1.1 between t — t„ and t = t n+ \ . If we approximate the integral by replacing 
f[t,(p(t)] by its value f[t n . </>(?„)] at t = t n , then we are approximating the actual area 
by the area of the shaded rectangle. In this way we obtain 

0 (h; | 1 ) = 0(t/i) T" f\t n -,4 > {tn)\(tn+ 1 hi) 

= <j)(t n ) + hf[t n ,<p(t n )]. (11) 

Finally, to obtain an approximation y n+ \ for <p (t n+ \), we make a second approximation 
by replacing <p ( t n ) by its approximate value y n in Eq. (11). This gives the Euler formula 
y n+1 = y n + hf(t n , y„) . A more accurate algorithm can be obtained by approximating 
the integral more accurately. This is discussed in Section 8.2. 


f[t n , <j>(t n ) ] 


FIGURE 8.1.1 


y' = f[t, pit)] 


Integral derivation of the Euler method. 


A third approach is to assume that the solution y = 4>(t) has a Taylor series about 
the point t n . Then 

h 2 

<p(t n + h) — 4>{t n ) + <p'(t n )h + 4>" (t n )— + • ■ • , 
or 

h~ 

<P(tn+ 1) = 0(hi) +/[h;. 0(fn)]^ + + ■ ■ ■ . (12) 

If the series is terminated after the first two terms, and <p(t n+ \) and <p(r n ) are replaced 
by their approximate values y n+ \ and y n , we again obtain the Euler formula (4). If 
more terms in the series are retained, a more accurate formula is obtained. Further, 
by using a Taylor series with a remainder, it is possible to estimate the magnitude of 
the error in the formula. This is discussed later in this section. 

The Backward Euler Formula. A variation on the Euler formula can be obtained by 
approximating the derivative in Eq. (8) by the backward difference quotient 
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[</)(/„) — 0(f„_i)]//; instead of the forward difference quotient used in Eq. (9). In 
this way we obtain 

(p{tn) (fiifn— l) — hf (t n ,y n ). 


or 

y n = y n - 1 + hf(tn,y n ). 

Stepping the index up from n to n + 1, we obtain the backward Euler formula 

y n + 1 = y n + ¥(Wid»+i)' (13) 

Assuming that y n is known and y n+ 1 is to be calculated, observe that Eq. (13) does not 
provide an explicit formula for y n +i- Rather, it is an equation that implicitly defines 
y n+ i and must be solved to determine the value of y n +i- How difficult this is depends 
entirely on the nature of the function /. 


Use the backward Euler formula (13) and step sizes h = 0.05, 0.025, 0.01, and 0.001 to find 
approximate values of the solution of the initial value problem (5), (6) on the interval 0 < t < 2. 
For this problem the backward Euler formula (13) becomes 

y n +i — yn+h( l — f„ + i + 4v„ + i). 

We will show the first two steps in detail so that it will be clear how the method works. At the 
first step we have 

yi = yo + h{ 1 - h + 4yi) = 1 + (0.05)(1 - 0.05 + 4yi). 

Solving this equation for yi, we obtain 

yi = 1.0475/0.8 = 1.309375. 

Observe that because the differential equation is linear, the implicit equation for y\ is also 
linear and therefore easy to solve. Next, 

yi = yi + h( 1 -t 2 + 4 y 2 ) = 1.309375 + (0.05)(1 - 0.1 + 4y 2 ), 

which leads to 

y 2 = 1.354375/0.8 = 1.69296875. 

Continuing the computations on a computer, we obtain the results shown in Table 8.1.2. The 
values given by the backward Euler method are uniformly too large for this problem, whereas 


TABLE 8.1.2 A Comparison of Results for the Numerical Approximation of the Solution of 
y' = 1 — t + 4y,y(0) = 1 Using the Backward Euler Method for Different Step Sizes h 


t 

h = 0.05 

h = 0.025 

h = 0.01 

h = 0.001 

Exact 

0 

1.0000000 

1.0000000 

1.0000000 

1.0000000 

1.0000000 

0.1 

1.6929688 

1.6474375 

1.6236638 

1.6104634 

1.6090418 

0.2 

2.7616699 

2.6211306 

2.5491368 

2.5095731 

2.5053299 

0.3 

4.4174530 

4.0920886 

3.9285724 

3.8396379 

3.8301388 

0.4 

6.9905516 

6.3209569 

5.9908303 

5.8131282 

5.7942260 

0.5 

10.996956 

9.7050002 

9.0801473 

8.7472667 

8.7120041 

1.0 

103.06171 

80.402761 

70.452395 

65.419964 

64.897803 

1.5 

959.44236 

661.00731 

542.12432 

485.05825 

479.25919 

2.0 

8934.0696 

5435.7294 

4172.7228 

3597.4478 

3540.2001 
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the values obtained from the Euler method were too small. In this problem the errors are 
somewhat larger for the backward Euler method than for the Euler method, although for 
small values of h the differences are insignificant. Since the backward Euler method appears 
to be no more accurate than the Euler method, and is somewhat more complicated, a natural 
question is why it should even be mentioned. The answer is that it is the simplest example of a 
class of methods known as backward differentiation formulas that are very useful for certain 
types of differential equations. We will return to this issue later in this chapter. 


Errors in Numerical Approximations. The use of a numerical procedure, such as the Eu- 
ler formula, on an initial value problem raises a number of questions that must be 
answered before the approximate numerical solution can be accepted as satisfactory. 
One of these is the question of convergence. That is, as the step size h tends to zero, 
do the values of the numerical approximation y\,yi, ■ ■ ■ , ■ ■ ■ approach the corre- 
sponding values of the actual solution? If we assume that the answer is affirmative, 
there remains the important practical question of how rapidly the numerical approx- 
imation converges to the solution. In other words, how small a step size is needed 
in order to guarantee a given level of accuracy? We want to use a step size that is 
small enough to ensure the required accuracy, but not too small. An unnecessarily 
small step size slows down the calculations and in some cases may even cause a loss 
of accuracy. 

There are two fundamental sources of error in approximating the solution of an 
initial value problem numerically. Let us first assume that our computer is such that 
we can carry out all computations with complete accuracy; that is, we can retain an 
infinite number of decimal places. The difference E n between the solution y = (j)(t) 
of the initial value problem (1), (2) and its numerical approximation is given by 

E n = <p(t n ) — y„ (14) 

and is known as the global truncation error. It arises from two causes: first, at each 
step we use an approximate formula to determine y n +i; second, the input data at 
each step are only approximately correct since in general <p(t„) is not equal to y„. If 
we assume that y n — <p(t n ), then the only error in going one step is due to the use of 
an approximate formula. This error is known as the local truncation error e n . 

The second fundamental source of error is that we carry out the computations in 
arithmetic with only a finite number of digits. This leads to a round-off error R„ 
defined by 

R n =y„-Y n , (15) 

where Y„ is the value actually computed from the given numerical method. 

The absolute value of the total error in computing </>(/„) is given by 

\<t>(t n) - Y n | = | <j>(t n ) -y„ +y n - Y n \. (16) 

Making use of the triangle inequality, \a + b\ < \a\ + \b\, we obtain, from Eq. (16), 

1 4>{t n ) - Y n | < | </>(t n ) -y n | + | y n - Y n \ 

<\E n \ + \R n \. (17) 

Thus the total error is bounded by the sum of the absolute values of the truncation and 
round-off errors. For the numerical procedures discussed in this book, it is possible 
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to obtain useful estimates of the truncation error. However, we limit our discussion 
primarily to the local truncation error, which is somewhat simpler. The round-off 
error is clearly more random in nature. It depends on the type of computer used, 
the sequence in which the computations are carried out, the method of rounding off, 
and so forth. An analysis of round-off error is beyond the scope of this book, but it 
is possible to say more about it than one might at first expect (see, for example, the 
book by Henrici listed in the references). Some of the dangers from round-off error 
are discussed in Problems 25 through 27 and in Section 8.5. 

Local Truncation Error for the Euler Method. Let us assume that the solution y — 0(f) of 
the initial value problem (1), (2) has a continuous second derivative in the interval 
of interest. To ensure this, we can assume that/,/,, and/) are continuous. Observe 
that iff has these properties and if <p is a solution of the initial value problem (1), (2), 
then 

and, by the chain rule, 

0"(O=/r[O00] +/,[*, 0(*)]0'(O 

=f t [t,m] (is> 

Since the right side of this equation is continuous, 0" is also continuous. 

Then, making use of a Taylor polynomial with a remainder to expand 0 about t n , 
we obtain 

0 (In + h) = <t>{t n ) + <j>\t n )h + \(p"(i n )h 2 , (19) 

where t n is some point in the interval t n < t n < t n + h. Subtracting Eq. (4) from 
Eq. (19), and noting that 0(f„ + h) = 0(f„ + 1 ) and 0'(f„) — f[t n , 0(f„)], we find that 

0(W l) - y n + 1 = [0(0 - y n ] + h{f[t n ,<p(t n )] -f(t„,y„)} + \cp"(t n )h 2 . (20) 

To compute the local truncation error, we apply Eq. (20) to the true solution 0(f); 
that is, we take y n to be 0(f„). Then we immediately see from Eq. (20) that the local 
truncation error e„+i is 

e«+i = 0(f„+i) - y n + 1 = \<P"(i n )h 2 . (21) 

Thus the local truncation error for the Euler method is proportional to the square 
of the step size h, and the proportionality factor depends on the second derivative 
of the solution 0. The expression given by Eq. (21) depends on n and, in general, is 
different for each step. A uniform bound, valid on an interval [n, b], is given by 

\e n \<Mh 2 /2, (22) 

where M is the maximum of |0"(f)| on the interval [a.b]. Since Eq. (22) is based 
on a consideration of the worst possible case — that is, the largest possible value of 
|0" (01 — it may well be a considerable overestimate of the actual local truncation 
error in some parts of the interval [a, b]. One use of Eq. (22) is to choose a step 
size that will result in a local truncation error no greater than some given tolerance 
level. For example, if the local truncation error must be no greater than e, then from 
Eq. (22) we have 


h < V2 e/M. 


(23) 
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The primary difficulty in using Eq. (21), (22), or (23) lies in estimating \</>"(t) \ or M. 
However, the central fact expressed by these equations is that the local truncation 
error is proportional to h 2 . Thus, if h is reduced by a factor of (. then the error is 
reduced by 1, and so on. 

More important than the local truncation error is the global truncation error E n . 
The analysis for estimating E n is more difficult than that for e„ . However, knowing 
the local truncation error, we can make an intuitive estimate of the global truncation 
error at a fixed T > t 0 as follows. Suppose that we take n steps in going from t 0 to 
T — 1 0 + nh. In each step the error is at most Mh 2 / 2; thus the error in n steps is at 
most nMh 2 /2. Noting that n = (T — to) /h. we find that the global truncation error 
for the Euler method in going from to to T is bounded by 


Mh 2 

U — — 


(T — 1 0 ) 


Mh 

~Y 


(24) 


This argument is not complete since it does not take into account the effect that an 
error at one step will have in succeeding steps. Nevertheless, it can be shown that the 
global truncation error in using the Euler method on a finite interval is no greater 
than a constant times h\ see Problem 23 for more details. The Euler method is called 
a first order method because its global truncation error is proportional to the first 
power of the step size. 

Because it is more accessible, we will hereafter use the local truncation error as our 
principal measure of the accuracy of a numerical method and for comparing different 
methods. If we have a priori information about the solution of the given initial value 
problem, we can use the result (21) to obtain more precise information about how the 
local truncation error varies with t. As an example, consider the illustrative problem 

y ' = l-t + 4y, _y(0) = 1 (25) 


on the interval 0 < t < 2. Let y = <p(t) be the solution of the initial value problem 
(25). Then, as noted previously, 

0(0 = (At — 3 + 19e 4 0/16 


and therefore 

0"(O = 19e 4 '. 


Equation (21) then states that 

l9e Atn h 2 _ 

e n +i = ^ , t n < t n < t n A h. (26) 

The appearance of the factor 19 and the rapid growth of e At explain why the results 
in Table 8.1.1 are not very accurate. 

For instance, for h — 0.05 the error in the first step is 


e\ = <p(tx)-yi 


19e 4f ° (0.0025) 

2 


0 < to < 0.05. 


It is clear that e\ is positive, and since e 4 ' 0 < e 02 , we have 


19e 02 (0.0025) 


0.02901. 


e i < 


2 


(27) 
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Note also that e 4t ° > 1; hence e\ > 19(0.0025)/2 = 0.02375. The actual error is 
0.02542. It follows from Eq. (26) that the error becomes progressively worse with in- 
creasing t \ this is also clearly shown by the results in Table 8.1.1. Similar computations 
for bounds for the local truncation error give 


„ 19e 38 (0.0025) 19e 4 (0.0025) 

1.0617 = < e20 < ^ 

in going from 0.95 to 1.0 and 

19e 78 (0.0025) 19e 8 (0.0025) 

57.96 = < e 40 < 


1.2967 


70.80 


(28) 

(29) 


in going from 1.95 to 2.0. 

These results indicate that, for this problem, the local truncation error is about 2500 
times larger near t = 2 than near t — 0. Thus, to reduce the local truncation error to 
an acceptable level throughout 0 < t < 2, one must choose a step size h based on an 
analysis near t — 2. Of course, this step size will be much smaller than necessary near 
t = 0. For example, to achieve a local truncation error of 0.01 for this problem, we 
need a step size of about 0.00059 near t = 2 and a step size of about 0.032 near t = 0. 
The use of a uniform step size that is smaller than necessary over much of the interval 
results in more calculations than necessary, more time consumed, and possibly more 
danger of unacceptable round-off errors. 

Another approach is to keep the local truncation error approximately constant 
throughout the interval by gradually reducing the step size as t increases. In the 
example problem we would need to reduce h by a factor of about 50 in going from 
t = 0 to t — 2. A method that provides for variations in the step size is called adaptive. 
All modern computer codes for solving differential equations have the capability of 
adjusting the step size as needed. We will return to this question in the next section. 


PROBLEMS In each of Problems 1 through 6 find approximate values of the solution of the given initial 
value problem at t = 0.1, 0.2, 0.3, and 0.4. 

(a) Use the Euler method with h = 0.05. 

(b) Use the Euler method with h = 0.025. 

(c) Use the backward Euler method with h = 0.05. 

(d) Use the backward Euler method with h = 0.025. 


41 

1. yf = 3 + 1 - y. 

y(0) = l 

41 

2. 

y = 5t- 3 Jy, 

y(0) = 2 

41 

3. Y = 2y- 3 1, 

y(0) = l 

41 

4. 

y = 2t + e-®. 

y (0) = l 

41 

5 y'= y2 + 2ty 

3 + 1 2 ’ 

y(0) = 0.5 

41 

6. 

y = ( t 2 — y 2 ) siny. 

y(0) = -l 


In each of Problems 7 through 12 find approximate values of the solution of the given initial 
value problem at t = 0.5, 1.0, 1.5, and 2.0. 

(a) Use the Euler method with h = 0.025. 

(b) Use the Euler method with h = 0.0125. 

(c) Use the backward Euler method with h = 0.025. 

(d) Use the backward Euler method with h = 0.0125. 
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y = 0.5 - t + 2 y, y(0) = 1 4 Ifl 8. / = 5t — 3 Jy, y(0) = 2 

y = Jt + y, y(0) = 3 41 10. y' = 2t + e-‘y, y( 0) = 1 

y = (4-ty)/(l+y 2 ), y( 0) = -2 

y = (y 2 + 2fy)/0 + t 2 ), y(0) = 0.5 

Complete the calculations leading to the entries in columns three and four of Table 8.1.1. 

Complete the calculations leading to the entries in columns three and four of Table 8.1.2. 

Using three terms in the Taylor series given in Eq. (12) and taking h — 0.1, determine 
approximate values of the solution of the illustrative example / = !—( + 4y,y(0) = 1 at 
t = 0.1 and 0.2. Compare the results with those using the Euler method and with the exact 
values. 

Hint: If/ = /(t,y),what isy"? 

In each of Problems 16 and 17 estimate the local truncation error for the Euler method in 
terms of the solution y = Obtain a bound for e„ + i in terms of t and <p(t) that is valid on 
the interval 0 < t < 1. By using a formula for the solution, obtain a more accurate error bound 
for e n+ i. For h = 0.1 compute a bound for e\ and compare it with the actual error at t = 0.1. 
Also compute a bound for the error e 4 in the fourth step. 

16. / = 2y - 1, y(0) = 1 17. / = \ - t + 2 y, y(0) = 1 

In each of Problems 18 through 21 obtain a formula for the local truncation error for the Euler 
method in terms of t and the solution </>. 

18. / = t 2 + y 2 , y(0) = 1 19. / = 5 1- 3//, y(0) = 2 

20. / = v /f + y, y(l) = 3 21. / = 2f + e^, y( 0) = 1 

22. Consider the initial value problem 

y' = cos5jrf, y(0) = 1. 

(a) Determine the solution y = <f>(t) and draw a graph ofy = cp(t) for 0 < t < 1. 

(b) Determine approximate values of <j>{t) at t = 0.2, 0.4, and 0.6 using the Euler method 
with h = 0.2. Draw a broken-line graph for the approximate solution and compare it with 
the graph of the exact solution. 

(c) Repeat the computation of part (b) for 0 <t< 0.4, but take h = 0.1. 

(d) Show by computing the local truncation error that neither of these step sizes is suffi- 
ciently small. Determine a value of h to ensure that the local truncation error is less than 
0.05 throughout the interval 0 < t < 1. That such a small value of h is required results 
from the fact that max \<p"(t)\ is large. 

23. In this problem we discuss the global truncation error associated with the Euler method 
for the initial value problem y' = /(f,y), y(to) = yo- Assuming that the functions / and 
f y are continuous in a closed, bounded region R of the fy-plane that includes the point 
(fo,yo),it can be shown that there exists a constant L such that |/(t,y) — f(t,y \ < L\y — y|, 
where (t,y) and (t,y) are any two points in R with the same t coordinate (see Problem 15 
of Section 2.8). Further, we assume that f, is continuous, so the solution <p has a continuous 
second derivative. 


4fl 

7. 

41 

9. 

41 

11. 

41 

12. 

41 

13. 

41 

14. 


15. 
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(a) Using Eq. (20), show that 

l-En+il < \E„\ + h\f[t n ,4>(t n )] -f(t n ,y„)\ + \h 2 \<t>"{t„)\ < a\E n \ + ph 2 , (i) 


where a = 1 + hL and p = max \<j>"(t)\/2 on to < t < t n . 

(b) Accepting without proof that if Eq = 0, and if \E n \ satisfies Eq. (i), then 
\E n \ < ph 2 (ct n — 1 )/(a — 1) for a y 1, show that 


(1 + hL) n - 1 , 
\E n \ < ^ ph. 


(ii) 


Equation (ii) gives a bound for \E„ \ in terms of h,L,n, and ft. Notice that for a fixed/;, this 
error bound increases with increasing «; that is, the error bound increases with distance 
from the starting point t 0 . 

(c) Show that (1 + hL) n < e nhL ; hence 


g nhL _ ^ e (t n -t 0 )L _ j 

\E„\ < — j- — P h = — ph. 


For a fixed point T = t 0 + nh [that is, nh is constant and h = (T — t 0 )/n], this error 
bound is of the form of a constant times h and approaches zero as h — > 0. Also note 
that for nhL = {T — to)L small, the right side of the preceding equation is approximately 
nh 2 p = (T — to)Ph, which was obtained in Eq. (24) by an intuitive argument. 

24. Derive an expression analogous to Eq. (21 ) for the local truncation error for the backward 
Euler formula. 

Hint: Construct a suitable Taylor approximation to <p(t ) about t = t n+1 . 

25. Using a step size h = 0.05 and the Euler method, but retaining only three digits throughout 
the computations, determine approximate values of the solution at t = 0.1, 0.2, 0.3, and 
0.4 for each of the following initial value problems: 

(a) / = !-/ + 4 y, y(0) = 1 

(b) / = 3 + t - y, y(0) = 1 

(c) y = 2y- 3 1, y(0) = 1 

Compare the results with those obtained in Example 1 and in Problems 1 and 3. The 
small differences between some of those results rounded to three digits and the present 
results are due to round-off error. The round-off error would become important if the 
computation required many steps. 

26. The following problem illustrates a danger that occurs because of round-off error when 
nearly equal numbers are subtracted and the difference is then multiplied by a large 
number. Evaluate the quantity 


6.010 


2.004 


18.04 

6.000 


in the following ways: 

(a) First round each entry in the determinant to two digits. 

(b) First round each entry in the determinant to three digits. 

(c) Retain all four digits. Compare this value with the results in parts (a) and (b). 

27. The distributive law a(b — c) = ab — ac does not hold, in general, if the products are 
rounded off to a smaller number of digits. To show this in a specific case, take a = 0.22, 
b = 3.19, and c = 2.17. After each multiplication, round off the last digit. 
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8.2 Improvements on the Euler Method 


Since for many problems the Euler method requires a very small step size to produce 
sufficiently accurate results, much effort has been devoted to the development of 
more efficient methods. In the next three sections we will discuss some of these 
methods. Consider the initial value problem 

y'=f(t,y ), y(to)=yo (l) 


and let y — <p(t) denote its solution. Recall from Eq. (10) of Section 8.1 that by 
integrating the given differential equation from t n to t n+ \, we obtain 

+ [ f[t,4>(t)i dt. (2) 

Jtn 


The Euler formula 


y n + 1 = y n + hf(t n ,y„) 


( 3 ) 


is obtained by replacing f[t,<p(t)] in Eq. (2) by its approximate value f(t n ,y n ) at the 
left endpoint of the interval of integration. 


Improved Euler Formula. A better approximate formula can be obtained if the 
integrand in Eq. (2) is approximated more accurately. One way to do this is to 
replace the integrand by the average of its values at the two endpoints, namely, 
{f[t n ,<l>(t n )] + f[t n+ i,(p{t n+ i)]}/ 2 . This is equivalent to approximating the area un- 
der the curve in Figure 8.2.1 between t = t n and t = t n+ 1 by the area of the shaded 
trapezoid. Further, we replace <p(t n ) and <j>(t n+ 1 ) by their respective approximate 
values y n and y n+ \ . In this way we obtain, from Eq. (2), 

f (t n ,y n ) f (t n +i,y n +\) . 

y n + 1 =y n + 2 h • ( 4 ) 

Since the unknown y n+ i appears as one of the arguments of / on the right side of 
Eq. (4), this equation defines y n+ \ implicitly rather than explicitly. Depending on 
the nature of the function /, it may be fairly difficult to solve Eq. (4) for y n+ \ . This 
difficulty can be overcome by replacing y n+ \ on the right side of Eq. (4) by the value 
obtained using the Euler formula (3). Thus 

, f(t n ,y n ) +f[t n + h,y n + hf(t n ,y n )] 
y n + 1 = y n + ^ h 


, fn+f(t„ + h,y n + hf n ) 

= y» + 2 h ^ 

where t n+ 1 has been replaced by t n + h. 

Equation (5) gives an explicit formula for computing y n+ \, the approximate value 
of <j){t n+ \), in terms of the data at t n . This formula is known as the improved Euler 
formula or the Heun 1 formula. The improved Euler formula is an example of a 
two-stage method; that is, we first calculate y„ + hf n from the Euler formula and then 
use this result to calculate y n + i from Eq. (5). The improved Euler formula (5) does 


lr The formula is named for Karl Heun (1859-1929), a professor at the Technical University of Karlsruhe. 
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EXAMPLE 

1 


y 

y 

= f[t, 0(t)] 

ftt n +iMt n+ 1)] 



f[t n ,<t>{t n )] 

1 

! t 

\{f[t n Mt n )U 



1 , 


tn t n+ i t 


FIGURE 8.2.1 Derivation of the improved Euler method. 


represent an improvement over the Euler formula (3) because the local truncation 
error in using Eq. (5) is proportional to h 3 , while for the Euler method it is propor- 
tional to h 1 . This error estimate for the improved Euler formula is established in 
Problem 14. It can also be shown that for a finite interval the global truncation error 
for the improved Euler formula is bounded by a constant times h 2 , so this method is 
a second order method. Note that this greater accuracy is achieved at the expense of 
more computational work, since it is now necessary to evaluate f(t,y) twice in order 
to go from t n to t n+ \. 

depends only on t and not on y, then solving the differential equation 
y' — f(t,y) reduces to integrating /(f). In this case the improved Euler formula (5) 
becomes 

h 

y n + 1 -y n = +f(tn + h)], (6) 

which is just the trapezoid rule for numerical integration. 


Use the improved Euler formula (5) to calculate approximate values of the solution of the 
initial value problem 

y' — 1 — t + 4y, y(0) = 1. (7) 

To make clear exactly what computations are required, we show a couple of steps in detail. 
For this problem f(t,y) = 1 — t + 4y; hence 


fn — 1 T 4 y n 


and 

/ (t n + h,y„ + hfn ) = 1 — (t n + h) + 4 (y„ + hf n ). 

Further, to = 0, yo = 1, and/o = 1 — to + 4yo = 5. If h = 0.025, then 

/(to + h,y 0 + hfo) = 1 - 0.025 + 4[1 + (0.025)(5)] = 5.475. 

Then, from Eq. (5), 

>>i = l + (0.5) (5 + 5. 475) (0.025) = 1.1309375. (8) 

At the second step we must calculate 

/, = 1 - 0.025 + 4(1.1309375) = 5.49875, 
y x + hf\ = 1.1309375 + (0.025) (5.49875) = 1.26840625, 
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and 

f(t 2 ,yi + hf\) = 1 - 0.05 + 4(1.26840625) = 6.023625. 

Then, from Eq. (5), 

y 2 = 1.1309375 + (0.5) (5.49875 + 6.023625) (0.025) = 1.2749671875. (9) 

Further results for 0 < t < 2 obtained by using the improved Euler method with h — 0.025 
and h = 0.01 are given in Table 8.2.1. To compare the results of the improved Euler method 
with those of the Euler method, note that the improved Euler method requires two evaluations 
of / at each step, while the Euler method requires only one. This is significant because typically 
most of the computing time in each step is spent in evaluating/, so counting these evaluations 
is a reasonable way to estimate the total computing effort. Thus, for a given step size h, the 
improved Euler method requires twice as many evaluations of / as the Euler method. Alter- 
natively, the improved Euler method for step size h requires the same number of evaluations 
of / as the Euler method with step size h/2. 


TABLE 8.2.1 A Comparison of Results Using the Euler and Improved Euler Methods for 
the Initial Value Problem / = 1 — t + 4 y, y(0) = 1 


t 

Euler 

Improved Euler 


h = 0.01 

h = 0.001 

h = 0.025 

h = 0.01 

Exact 

0 

1.0000000 

1.0000000 

1.0000000 

1.0000000 

1.0000000 

0.1 

1.5952901 

1.6076289 

1.6079462 

1.6088585 

1.6090418 

0.2 

2.4644587 

2.5011159 

2.5020618 

2.5047827 

2.5053299 

0.3 

3.7390345 

3.8207130 

3.8228282 

3.8289146 

3.8301388 

0.4 

5.6137120 

5.7754845 

5.7796888 

5.7917911 

5.7942260 

0.5 

8.3766865 

8.6770692 

8.6849039 

8.7074637 

8.7120041 

1.0 

60.037126 

64.382558 

64.497931 

64.830722 

64.897803 

1.5 

426.40818 

473.55979 

474.83402 

478.51588 

479.25919 

2.0 

3029.3279 

3484.1608 

3496.6702 

3532.8789 

3540.2001 


By referring to Table 8.2.1, you can see that the improved Euler method with h = 0.025 gives 
much better results than the Euler method with h = 0.01. Note that to reach t = 2 with these 
step sizes, the improved Euler method requires 160 evaluations of/, while the Euler method 
requires 200. More noteworthy is that the improved Euler method with h = 0.025 is also 
slightly more accurate than the Euler method with h = 0.001 (2000 evaluations of/). In other 
words, with something like one-twelfth of the computing effort, the improved Euler method 
yields results for this problem that are comparable to, or a bit better than, those generated by 
the Euler method. This illustrates that, compared to the Euler method, the improved Euler 
method is clearly more efficient, yielding substantially better results or requiring much less 
total computing effort, or both. 

The percentage errors at t = 2 for the improved Euler method are 1.23% for h — 0.025 and 
0.21% for h = 0.01. 


A computer program for the Euler method can be readily modified to implement 
the improved Euler method instead. All that is required is to replace Step 6 in the 
algorithm in Section 8.1 by the following: 
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Step 6. kl—f(t,y ) 

k2 = f(t + h.y + h * kl ) 
y = y + (/i/2) * (kl + k2) 
t = t + h 

Variation of Step Size. In Section 8.1 we mentioned the possibility of adjusting the step 
size as a calculation proceeds so as to maintain the local truncation error at a more 
or less constant level. The goal is to use no more steps than necessary and, at the 
same time, to keep some control over the accuracy of the approximation. Here we 
will describe how this can be done. Suppose that after n steps we have reached the 
point ( t n ,y n ). We choose a step size h and calculate y n+ \. Next we need to estimate 
the error we have made in calculating y n+ \ . Not knowing the actual solution, the best 
that we can do is to use a more accurate method and repeat the calculation starting 
from ( t n ,y n )■ For example, if we used the Euler method for the original calculation, 
we might repeat it with the improved Euler method. Then the difference between the 
two calculated values is an estimate of the error in using the original method. 
If the estimated error is different from the error tolerance e, then we adjust the 
step size and repeat the calculation. The key to making this adjustment efficiently is 
knowing how the local truncation error e n+ \ depends on the step size h. For the Euler 
method the local truncation error is proportional to h 2 , so to bring the estimated error 
down (or up) to the tolerance level e, we must multiply the original step size by the 
factor s/e/e^l v 

To illustrate this procedure, consider the example problem (7) 


/ = 1 — t + 4y, y( 0) = 1. 


You can verify that after one step with h = 0.1 we obtain the values 1.5 and 1.595 
from the Euler method and the improved Euler method, respectively. Thus the 
estimated error in using the Euler method is 0.095. If we have chosen an error 
tolerance of 0.05, for instance, then we need to adjust the step size downward by the 
factor ^/0. 05/0. 095 = 0.73. Rounding downward to be conservative, let us choose 
the adjusted step size h = 0.07. Then, from the Euler formula, we obtain 


yi = l + (0.07)/ (0, 1) = 1.35 = 0(0.07). 


Using the improved Euler method, we obtain y i = 1.39655, so the estimated error in 
using the Euler formula is 0.04655, which is slightly less than the specified tolerance. 
The actual error, based on a comparison with the solution itself, is somewhat greater, 
namely, 0.05122. 

We can follow the same procedure at each step of the calculation, thereby keeping 
the local truncation error approximately constant throughout the entire numerical 
process. Modern adaptive codes for solving differential equations adjust the step 
size as they proceed in very much this way, although they usually use more accurate 
formulas than the Euler and improved Euler formulas. Consequently, they are able 
to achieve both efficiency and accuracy by using very small steps only where they are 
really needed. 
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PROBLEMS In each of Problems 1 through 6 find approximate values of the solution of the given initial 
value problem at t = 0.1, 0.2, 0.3, and 0.4. Compare the results with those obtained by the 
Euler method and the backward Euler method in Section 8.1 and with the exact solution (if 



available). 




(a) Use the improved Euler method 

with h = 0.05. 


(b) Use the improved Euler method 

with h = 0.025. 


(c) Use the improved Euler method 

with h = 0.0125. 

41 

1 

+ 

CO 

II 

X 

r— 1 

y(0) = 1 

2. / = 5t- 3 Jy, 

41 

3. y = 2y- 3 1 , 

y(0) = i 

4. y' = 2t + e~' y , 

41 

e y 2 + 2 ty 

3 4 -t 2 ’ 

y (0) = 0.5 

6. / = (t 2 — y 2 ) siny, 


y(0) = 2 
y(0) = 1 


v(0) = -1 


In each of Problems 7 through 12 find approximate values of the solution of the given initial 

value problem at t = 0.5, 1.0, 1.5, and 2.0. 

(a) Use the improved Euler method with h = 0.025. 

(b) Use the improved Euler method with h = 0.0125. 

4fl 7. / = 0.5 — t + 2y, y(0) = 1 8. / = 5t — 3 Jy, y(0) = 2 

9. / = Jt + y, y(0) = 3 (fl 10. ? = 2t + e"*, y(0) = 1 

4fl ii. y = (4 - O') /(I + y 2 ), y(0) = -2 

4fl 12. y = (y 2 + 200/(3 + t 2 ), y(0) = 0.5 

13. Complete the calculations leading to the entries in columns four and five of Table 8.2.1. 

14. In this problem we establish that the local truncation error for the improved Euler for- 
mula is proportional to h 3 . If we assume that the solution 0 of the initial value problem 
y — f(t,y), y(to) = yo has derivatives that are continuous through the third order (f has 
continuous second partial derivatives), then it follows that 


(f)(t n + h) — (f>(t n ) + (f> ( t„)h 4 — — h 


+ 


3! 


h 3 


where t n < t n < t n + h. Assume thaty„ = <j>(t„). 
(a) Show that, for y n+i as given by Eq. (5), 


&n+l — 0(t«+l) yn+ 1 

4>"(t n )h-{f[t n +h,y n +hf(t n ,y n )]-f(t n ,y n ) ) 4>"’(fn)h 3 

= 2! k + W 

(b) Making use of the facts that +f y [t,(t>(t)]<p'(t) and that the Taylor 

approximation with a remainder for a function F(t,y ) of two variables is 


F(a + h,b + k) = F(a , b) 4- F t (a, b)h 4- F y (a, b)k 


+ 


1 

2 ! 


( h 2 F„ + 2 hkF ty + k 2 F yy ) 


x=t,y=ri 


where § lies between a and a + h and rj lies between b and b + k, show that the first term 
on the right side of Eq. (i) is proportional to h 3 plus higher order terms. This is the desired 
result. 
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(c) Show that if f(t,y) is linear in t and y, then e„ + i = (p'"(t„)h 3 / 6, where t„ < t n < t n+ \ . 
Hint: What ar e/„,/ (v , and f yy l 

15. Consider the improved Euler method for solving the illustrative initial value problem 
y' = 1 — t + 4y,y(0) = 1. Using the result of Problem 14(c) and the exact solution of the 
initial value problem, determine e n+l and a bound for the error at any step on 0 < t < 2. 
Compare this error with the one obtained in Eq. (26) of Section 8.1 using the Euler method. 
Also obtain a bound for e\ for h = 0.05, and compare it with Eq. (27) of Section 8.1. 

In each of Problems 16 and 17 use the actual solution to determine e n+ i and a bound 
for e n+ i at any step on 0 < t < 1 for the improved Euler method for the given initial value 
problem. Also obtain a bound for e\ for h = 0.1, and compare it with the similar estimate for 
the Euler method and with the actual error using the improved Euler method. 

16. y' = 2y - 1, y(0) = 1 17. y' = 0.5 - t + 2y, y(0) = 1 

In each of Problems 18 through 21 carry out one step of the Euler method and of the improved 
Euler method, using the step size h = 0.1. Suppose that a local truncation error no greater 
than 0.0025 is required. Estimate the step size that is needed for the Euler method to satisfy 
this requirement at the first step. 

18. y = 0.5 - t + 2 y, y(0) = 1 19. y' = 5t - 3 Jy, y( 0) = 2 

20. y = yTTy, y(0) = 3 21. y' = (y 2 + 2ry)/(3 + t 2 ), y(0) = 0.5 

22. The modified Euler formula for the initial value problem y' = /(t, y),y(fo) = yo is given by 

yn+i — yn T hf[t n + 2 h,y n T 2 ^/(^<}’«)]. 

Following the procedure outlined in Problem 14, show that the local truncation error in 
the modified Euler formula is proportional to h 3 . 

In each of Problems 23 through 26 use the modified Euler formula of Problem 22 with h = 0.05 
to compute approximate values of the solution of the given initial value problem at t = 0.1, 
0.2, 0.3, and 0.4. Compare the results with those obtained in Problems 1 through 4. 

23. y' = 3 + t-y, y CO) = 1 gl 24. y' = 5t-3yy, y( 0) = 2 

jfl 25. y = 2y — 3 1, y(0) = 1 jfl 26. y' = 2t + e~*, y( 0) = 1 

27. Show that the modified Euler formula of Problem 22 is identical to the improved Euler 
formula of Eq. (5) for y' = f(t,y) iff is linear in both t and y. 


8.3 The Runge-Kutta Method 


In preceding sections we have introduced the Euler formula, the backward Euler 
formula, and the improved Euler formula as ways to approximate the solution of the 
initial value problem 

y'=f(t,y ), y(to)=yo (l) 


numerically. The local truncation errors for these methods are proportional to h 2 ,h 2 . 
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and h 3 , respectively. The Euler and improved Euler methods belong to what is now 
called the Runge-Kutta 2 class of methods. 

In this section we discuss the method originally developed by Runge and Kutta. 
This method is now called the classic fourth order four-stage Runge-Kutta method, 
but it is often referred to simply as the Runge-Kutta method, and we will follow this 
practice for brevity. This method has a local truncation error that is proportional 
to h 5 . Thus it is two orders of magnitude more accurate than the improved Euler 
method and three orders of magnitude better than the Euler method. It is relatively 
simple to use and is sufficiently accurate to handle many problems efficiently. This 
is especially true of adaptive Runge-Kutta methods, in which provision is made to 
vary the step size as needed. We return to this issue at the end of the section. 

The Runge-Kutta formula involves a weighted average of values of f(t,y) at dif- 
ferent points in the interval t n < t < t n+ \. It is given by 



where 


knl — f (hi . y n ) 

k„2 =f(tn + \h,y n + \hk n i), 

K-i =f(t n + \h,y n + \hk n2 ), 
kn 4 = f (hi T h, y n + hk n 3). 


( 3 ) 


The sum (k n 1 + 2k,a + 2k n j, + k n 4) /6 can be interpreted as an average slope. Note 
that k n \ is the slope at the left end of the interval, k n 2 is the slope at the midpoint 
using the Euler formula to go from t„ to t n + h/2, k n -$ is a second approximation to 
the slope at the midpoint, and k n 4 is the slope at t„ + h using the Euler formula and 
the slope k n3 to go from t n to t n + h. 

Although in principle it is not difficult to show that Eq. (2) differs from the Taylor 
expansion of the solution <f> by terms that are proportional to h 5 , the algebra is rather 
lengthy. 3 Thus we will simply accept the fact that the local truncation error in using 
Eq. (2) is proportional to h 5 and that for a finite interval the global truncation error is 
at most a constant times h 4 . The earlier description of this method as a fourth order 
four-stage method reflects the facts that the global truncation error is of fourth order 
in the step size h and that there are four intermediate stages in the calculation (the 
calculation of k n \ , . . . , k n 4). 

Clearly the Runge-Kutta formula, Eqs. (2) and (3), is more complicated than any 
of the formulas discussed previously. This is of relatively little significance, however, 
since it is not hard to write a computer program to implement this method. Such a 


2 Carl David Runge (1856-1927), German mathematician and physicist, worked for many years in spec- 
troscopy. The analysis of data led him to consider problems in numerical computation, and the Runge- 
Kutta method originated in his paper on the numerical solution of differential equations in 1895. The 
method was extended to systems of equations in 1901 by M. Wilhelm Kutta (1867-1944). Kutta was a 
German mathematician and aerodynamicist who is also well known for his important contributions to 
classical airfoil theory. 

3 See, for example, Chapter 3 of the book by Henrici listed in the references. 
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EXAMPLE 

1 


program has the same structure as the algorithm for the Euler method outlined in 
Section 8.1. To be specific, the lines in Step 6 in the Euler algorithm must be replaced 
by the following: 

Step 6. kl—f(t,y) 

k2 = f(t + 0.5 * h. y + 0.5 * h * kl) 
k3 — f(t + 0.5 * h.y + 0.5 * h * k2) 
k4 = f(t + h,y + h * k3) 
y — y + (h/6) * (kl + 2*k2 + 2*k3 + k4) 
t = t + h 

Note that if / does not depend on y, then 

Kl = f(t n ), Ki = Ki = f(t„ + h/2), k n 4 = f(t n + h), (4) 

and Eq. (2) reduces to 

y n + 1 -y n = + 4/(t„ + h/2) +f(t„ + h)]. (5) 

6 

Equation (5) can be identified as Simpson’s 4 rule for the approximate evaluation of 
the integral of y' = fit). The fact that Simpson’s rule has an error proportional to h 5 
is consistent with the local truncation error in the Runge-Kutta formula. 


Use the Runge-Kutta method to calculate approximate values of the solution y = <p(t) of the 
initial value problem 

/=!-< + 4y, y( 0) = 1. (6) 


Taking h = 0.2, we have 

k m = /( 0, 1) = 5; hk m = 1.0, 

Ki = /( 0 + 0.1,1 + 0.5) = 6.9; hk 02 = 1.38, 
k 03 = /( 0 + 0.1, 1 + 0.69) = 7.66; hk m = 1.532, 

A: 04 = /( 0 + 0.2, 1 + 1.532) = 10.928. 

Thus 

yi = 1 + °^[5 + 2(6.9) + 2(7.66) + 10.928] 

6 

= 1 + 1.5016 = 2.5016. 

Further results using the Runge-Kutta method with* = 0.2, /z = 0.1, and h = 0.05 are given 
in Table 8.3.1. Note that the Runge-Kutta method yields a value at t = 2 that differs from the 
exact solution by only 0.122% if the step size is h = 0.1, and by only 0.00903% if h = 0.05. 
In the latter case, the error is less than one part in 10,000, and the calculated value at t = 2 is 
correct to four digits. 


4 Simpson’s rule is named for Thomas Simpson (1710-1761), an English mathematician and textbook 

author, who published it in 1743. 
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For comparison, note that both the Runge-Kutta method with h = 0.05 and the improved 
Euler method with h — 0.025 require 160 evaluations of / to reach t = 2. The improved Euler 
method yields a result at t = 2 that is in error by 1.23%. Although this error may be acceptable 
for some purposes, it is more than 135 times the error yielded by the Runge-Kutta method 
with comparable computing effort. Note also that the Runge-Kutta method with h = 0.2, or 
40 evaluations of/, produces a value at t = 2 with an error of 1.40%, which is only slightly 
greater than the error in the improved Euler method with h = 0.025, or 160 evaluations of/. 
Thus we see again that a more accurate algorithm is more efficient; it produces better results 
with similar effort, or similar results with less effort. 


TABLE 8.3.1 A Comparison of Results for the Numerical Approximation of the Solution 
of the Initial Value Problem / = 1 — t + 4y, y( 0) = 1 



Improved 

Euler 


Runge-Kutta 


Exact 

t 

h = 0.025 

h = 0.2 

h = 0.1 

h = 0.05 


0 

1.0000000 

1.0000000 

1.0000000 

1.0000000 

1.0000000 

0.1 

1.6079462 


1.6089333 

1.6090338 

1.6090418 

0.2 

2.5020618 

2.5016000 

2.5050062 

2.5053060 

2.5053299 

0.3 

3.8228282 


3.8294145 

3.8300854 

3.8301388 

0.4 

5.7796888 

5.7776358 

5.7927853 

5.7941197 

5.7942260 

0.5 

8.6849039 


8.7093175 

8.7118060 

8.7120041 

1.0 

64.497931 

64.441579 

64.858107 

64.894875 

64.897803 

1.5 

474.83402 


478.81928 

479.22674 

479.25919 

2.0 

3496.6702 

3490.5574 

3535.8667 

3539.8804 

3540.2001 


The classic Runge-Kutta method suffers from the same shortcoming as other meth- 
ods with a fixed step size for problems in which the local truncation error varies 
widely over the interval of interest. That is, a step size that is small enough to achieve 
satisfactory accuracy in some parts of the interval may be much smaller than neces- 
sary in other parts of the interval. This has stimulated the development of adaptive 
Runge-Kutta methods that provide for modifying the step size automatically as the 
computation proceeds, so as to maintain the local truncation error near or below a 
specified tolerance level. As explained in Section 8.2, this requires the estimation 
of the local truncation error at each step. One way to do this is to repeat the com- 
putation with a fifth order method — which has a local truncation error proportional 
to h 6 — and then to use the difference between the two results as an estimate of the 
error. If this is done in a straightforward manner, then the use of the fifth order 
method requires at least five more evaluations of / at each step, in addition to those 
required originally by the fourth order method. However, if we make an appropriate 
choice of the intermediate points and the weighting coefficients in the expressions 
for k„ i,. . . in a certain fourth order Runge-Kutta method, then these expressions 
can be used again, together with one additional stage, in a corresponding fifth order 
method. This results in a substantial gain in efficiency. It turns out that this can be 
done in more than one way. 
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The first fourth and fifth order Runge-Kutta pair was developed by Erwin Fehlberg 5 
in the late 1960’s and is now called the Runge-Kutta-Fehlberg or RKF 6 method. The 
popularity of the RKF method was considerably enhanced by the appearance in 1977 
of its Fortran implementation RKF45 by Fawrence F. Shampine and H. A. Watts. The 
RKF method and other adaptive Runge-Kutta methods are very powerful and ef- 
ficient means of approximating numerically the solutions of an enormous class of 
initial value problems. Specific implementations of one or more of them are widely 
available in commercial software packages. 


PROBLEMS In each of Problems 1 through 6 find approximate values of the solution of the given initial 
value problem at t = 0.1, 0.2, 0.3, and 0.4. Compare the results with those obtained by using 
other methods and with the exact solution (if available). 

(a) Use the Runge-Kutta method with h = 0.1. 

(b) Use the Runge-Kutta method with h — 0.05. 


41 

1. y = 3 + t - y. 

y(0) = l 

41 

2 . y = 5t- 3 Vv, 

y(0) = 2 

41 

3. y = 2y - 3 1, 

y(0) = l 

41 

4. y = 2t + e-v, 

y(0) = l 

41 

, , .V 2 + 2 ty 

y 3 + t 2 

y(0) = 0.5 

41 

6. y = ( t 2 — y 2 ) sin y. 

v(0) = -1 


In each of Problems 7 through 12 find approximate values of the solution of the given initial 
value problem at t = 0.5, 1.0, 1.5, and 2.0. Compare the results with those obtained by other 
methods. 

(a) Use the Runge-Kutta method with h = 0.1. 

(b) Use the Runge-Kutta method with h = 0.05. 

4fl 7. / = 0.5 — f + 2y, y(0) = 1 4fl 8. y' = 5t-3Jy, y( 0) = 2 

4fl 9. y = s/TTy, y(0) = 3 10. y' = 2t + er», y(0) = 1 

11. y = (4- 09/(1 + y 2 ), y(0) = —2 

12. yt = (y 2 + 200/(3 + t 2 ), y(0) = 0.5 

13. Confirm the results in Table 8.3.1 by executing the indicated computations. 

14. Consider the initial value problem 

y' = t 2 +y 2 , y(0) = 1. 


(a) Draw a direction field for this equation. 

(b) Use the Runge-Kutta method or another method to find approximate values of the 
solution at t = 0.8, 0.9, and 0.95. Choose a small enough step size so that you believe your 
results are accurate to at least four digits. 

(c) Try to extend the calculations in part (b) to obtain an accurate approximation to the 
solution at t = 1. If you encounter difficulties in doing this, explain why you think this 
happens. The direction field in part (a) may be helpful. 


s Fehlberg (1911-1990) was born in Germany, received his doctorate from the Technical University of 
Berlin in 1942, emigrated to the United States after World War II, and was employed by NASA for many 
years. The Runge-Kutta-Fehlberg method was first published in a NASA Technical Report in 1969. 

6 The details of the RKF method may be found, for example, in the books by Ascher and Petzold and by 
Mattheij and Molenaar that are listed in the references. 


464 


Chapter 8. Numerical Methods 


15. Consider the initial value problem 

y = 3f 2 /(3y 2 - 4), y(0) = 0. 

(a) Draw a direction field for this equation. 

(b) Estimate how far the solution can be extended to the right. Let t M be the right 
endpoint of the interval of existence of this solution. What happens at t M to prevent the 
solution from continuing farther? 

(c) Use the Runge-Kutta method with various step sizes to determine an approximate 
value of t M . 

(d) If you continue the computation beyond t M , you can continue to generate values of 
y. What significance, if any, do these values have? 

(e) Suppose that the initial condition is changed to y(0) = 1. Repeat parts (b) and (c) for 
this problem. 


8.4 Multistep Methods 


In previous sections we have discussed numerical procedures for approximating the 
solution of the initial value problem 

y'=f(t,y), y(to) = yo, (1) 

in which data at the point t = t n are used to calculate an approximate value of the 
solution (j){t n+ \) at the next mesh point t = t n+ \ . In other words, the calculated value 
of </> at any mesh point depends only on the data at the preceding mesh point. Such 
methods are called one-step methods. However, once approximate values of the 
solution y — (f>{t) have been obtained at a few points beyond to, it is natural to ask 
whether we can make use of some of this information, rather than just the value at 
the last point, to calculate the value of 0(t) at the next point. Specifically, if y\ at t\, 
yj at t 2 , . ■ ■ ,y n at t„ are known, how can we use this information to determine y n+ \ at 
t n+ 1 ? Methods that use information at more than the last mesh point are referred to 
as multistep methods. In this section we will describe two types of multistep methods: 
Adams 7 methods and backward differentiation formulas. Within each type, we can 
achieve various levels of accuracy, depending on the number of preceding data points 
that are used. For simplicity, we will assume throughout our discussion that the step 
size h is constant. 

Adams Methods. Recall that 

r ^n + 1 

4>(t n+ i) - = / (p'(t) dt, (2) 


7 John Couch Adams (1819-1892), an English astronomer, is most famous as codiscoverer, with Joseph 
Leverrier, of the planet Neptune in 1846. Adams was also extremely skilled at computation; his procedure 
for numerical integration of differential equations appeared in 1883 in a book that he wrote with Francis 
Bashforth on capillary action. 
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where <p(t) is the solution of the initial value problem (1). The basic idea of an 
Adams method is to approximate (p'(t) by a polynomial Pk(t) of degree k and to use 
the polynomial to evaluate the integral on the right side of Eq. (2). The coefficients in 
Pk(t) are determined by using k + 1 previously calculated data points. For example, 
suppose that we wish to use a first degree polynomial Pi(t) = At + B. Then we need 
only the two data points ( t n ,y n ) and Since P\ is to be an approximation 

to </>', we require that P\{t n ) — f(t n ,y n ) and that Recall that 

we denote f(tj,yj) by fj for an integer j. Then A and B must satisfy the equations 


At n + B — f n , 
At n ~ i + B — fn—i . 

Solving for A and B , we obtain 

fn - fn - 1 


A = 


B = 


fn—Pn fn Ui — I 


(3) 


(4) 


h h 

Replacing (/>'(/) by P\(t) and evaluating the integral in Eq. (2), we find that 

< P(fn+ 1 ) <P(tn) = t„) + B(t n +i t n ). 

Finally, we replace r/j (f„+i) and <!> (r n ) by y n+ \ and y„ , respectively, and carry out some 
algebraic simplification. For a constant step size h we obtain 


y n + 1 = y,t + \hf n - \hf n - 1 . 


(5) 


Equation (5) is the second order Adams-Bashforth formula. It is an explicit formula 
for y n+ 1 in terms of y n and y„_i and has a local truncation error proportional to IP. 

We note in passing that the first order Adams-Bashforth formula, based on the 
polynomial Po{t) = f n of degree zero, is just the original Euler formula. 

More accurate Adams formulas can be obtained by following the procedure out- 
lined above, but using a higher degree polynomial and correspondingly more data 
points. For example, suppose that a polynomial / 3(f) of degree three is used. The 
coefficients are determined from the four points ( t n ,y n ), (7„_i ,y H .-i ), (t n - 2 ,y n -i)^ an d 
(t„_3,y„_3). Substituting this polynomial for <p’{t) in Eq. (2), evaluating the integral, 
and simplifying the result, we eventually obtain the fourth order Adams-Bashforth 
formula 

y n + 1 =y n + (h/24)(55f n - 59/„_i + 37 f n _ 2 ~ 9f n -3 )■ (6) 


The local truncation error of this fourth order formula is proportional to h 5 . 

A variation on the derivation of the Adams-Bashforth formulas gives another 
set of formulas called the Adams-Moulton 8 formulas. To see the difference, let 
us again consider the second order case. Again we use a first degree polynomial 


s Forest Ray Moulton (1872-1952) was an American astronomer and administrator of science. While 
calculating ballistics trajectories during World War I, he devised substantial improvements in the Adams 
formula. 
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Qi(t) = at + p, but we determine the coefficients by using the points ( t n ,y n ) and 
(t n+ \,y n+ \). Thus a and ft must satisfy 

°it n + ft = ft, 

®tn+l T ft — fn+\> 


and it follows that 

fn+l ~ fn Q fntn+1 ~ fn+ltn 

“ = f = h ' (8 > 

Substituting Q\ (t) for <p'(t) in Eq. (2) and simplifying, we obtain 

y n +\ = y n + \hf„ + \hf(t n+ i,y n+1 ), (9) 

which is the second order Adams-Moulton formula. We have written f(t n+ \,y n+ \) 
in the last term to emphasize that the Adams-Moulton formula is implicit, rather 
than explicit, since the unknown y n+ \ appears on both sides of the equation. The 
local truncation error for the second order Adams-Moulton formula is proportional 
to lift . 

The first order Adams-Moulton formula is just the backward Euler formula, as 
you might anticipate by analogy with the first order Adams-Bashforth formula. 

More accurate higher order formulas can be obtained by using an approximating 
polynomial of higher degree. The fourth order Adams-Moulton formula, with a local 
truncation error proportional to h 5 , is 

y «+ 1 = y« + (/j/24)(9/„ + i + i9/„ — 5/„_i + f n - 2 ). (10) 


Observe that this is also an implicit formula because y n+ \ appears in/„ + i. 

Although both the Adams-Bashforth and Adams-Moulton formulas of the same 
order have local truncation errors proportional to the same power of h, the Adams- 
Moulton formulas of moderate order are in fact considerably more accurate. For 
example, for the fourth order formulas (6) and (10), the proportionality constant 
for the Adams-Moulton formula is less than 1/10 of the proportionality constant 
for the Adams-Bashforth formula. Thus the question arises: is it better to use the 
explicit (and faster) Adams-Bashforth formula or the more accurate but implicit 
(and slower) Adams-Moulton formula? The answer depends on whether, by using 
the more accurate formula, you can increase the step size, and thereby reduce the 
number of steps enough to compensate for the additional computations required at 
each step. 

In fact, numerical analysts have attempted to achieve both simplicity and accuracy 
by combining the two formulas in what is called a predictor-corrector method. Once 
y n ~ 3 ,y n ~ 2 , y„- 1 , and y n are known, we can compute /„_3 ,/«_2 ,/«-t , and f, , and then use 
the Adams-Bashforth (predictor) formula (6) to obtain a first value for y n +x- Then 
we compute /„ + i and use the Adams-Moulton (corrector) formula (10), which is no 
longer implicit, to obtain an improved value of y n+ \ . We can, of course, continue to 
use the corrector formula (10) if the change in y n+ \ is too large. However, if it is 
necessary to use the corrector formula more than once or perhaps twice, it means 
that the step size h is too large and should be reduced. 

In order to use any of the multistep methods, it is necessary first to calculate a few 
yj by some other method. For example, the fourth order Adams-Moulton method 
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requires values for y\ and yy, while the fourth order Adams-Bashforth method also 
requires a value for y 3 . One way to proceed is to use a one-step method of comparable 
accuracy to calculate the necessary starting values. Thus, for a fourth order multistep 
method, one might use the fourth order Runge-Kutta method to calculate the starting 
values. This is the method used in the next example. 

Another approach is to use a low order method with a very small h to calculate y\, 
and then to increase gradually both the order and the step size until enough starting 
values have been determined. 


EXAMPLE 

1 


Consider again the initial value problem 


y' = l-t + 4y, y(0) = 1. 


( 11 ) 


With a step size of h = 0.1, determine an approximate value of the solution y = 0 (f) at t = 0.4 
using the fourth order Adams-Bashforth formula, the fourth order Adams-Moulton formula, 
and the predictor-corrector method. 

For starting data, we use the values of yi,y 2 , and y 3 found by the Runge-Kutta method. 
These are tabulated in Table 8.3.1. Next, calculating the corresponding values of /(f,y), we 
obtain 

yo = 1, fo = 5, 


yi = 1.6089333, 
y 2 = 2.5050062, 
y 3 = 3.8294145, 


/i = 7.3357332, 
f 2 = 10.820025, 
/ 3 = 16.017658. 


Then, from the Adams-Bashforth formula, Eq. (6), we find that y 4 = 5.7836305. The ex- 
act value of the solution at t = 0.4, correct through eight digits, is 5.7942260, so the error is 
-0.010595. 

The Adams-Moulton formula, Eq. (10), leads to the equation 


y 4 = 4.9251275 + 0.15y 4 , 


from which it follows that y 4 = 5.7942676 with an error of only 0.0000416. 

Finally, using the result from the Adams-Bashforth formula as a predicted value of 0(0.4), 
we can then use Eq. (10) as a corrector. Corresponding to the predicted value of y 4 , we find 
that/ 4 = 23.734522. Hence, from Eq. (10), the corrected value of y 4 is 5.7926721. This result 
is in error by —0.0015539. 

Observe that the Adams-Bashforth method is the simplest and fastest of these methods, 
since it involves only the evaluation of a single explicit formula, ft is also the least accurate. 
Using the Adams-Moulton formula as a corrector increases the amount of calculation that is 
required, but the method is still explicit. In this problem the error in the corrected value of y 4 
is reduced by approximately a factor of 7 when compared to the error in the predicted value. 
The Adams-Moulton method alone yields by far the best result, with an error that is about 
1/40 as large as the error from the predictor-corrector method. Remember, however, that 
the Adams-Moulton method is implicit, which means that an equation must be solved at each 
step. In the problem considered here this equation is linear, so the solution is quickly found, 
but in other problems this part of the procedure may be much more time-consuming. 

The Runge-Kutta method with h = 0.1 gives y 4 = 5.7927853 with an error of —0.0014407; 
see Table 8.3.1. Thus, for this problem, the Runge-Kutta method is comparable in accuracy to 
the predictor-corrector method. 
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EXAMPLE 

2 


Backward Differentiation Formulas. Another type ofmultistep method uses a polynomial 
Pk(t) to approximate the solution (j>(t) of the initial value problem ( 1 ) rather than 
its derivative cp'ft), as in the Adams methods. We then differentiate f\ (t) and set 
P' k (t n+ \ ) equal to/(f n+ i,y„ + i) to obtain an implicit formula foi' y n+ \ . These are called 
backward differentiation formulas. These methods became widely used in the 1970s 
because of the work of C. William Gear 9 on so-called stiff differential equations, 
whose solutions are very difficult to approximate by the methods discussed up to 
now; see Section 8.5. 

The simplest case uses a first degree polynomial P\(t) = At + B. The coefficients 
are chosen to match the computed values of the solution y n and y n +i- Thus A and B 
must satisfy 

At n 

At n . )_i 

Since P\ (t) = A , the requirement that 

P\{tn+l) 

is just 

A=f(t n+1 ,y n+ i). (13) 

Another expression for A comes from subtracting the first of Eqs. (12) from the 
second, which gives 

A = (y n+ 1 - y n )/h . 

Substituting this value of A into Eq. (13) and rearranging terms, we obtain the first 
order backward differentiation formula 

y„+i = y n + hf(t n+ ].y n+ \). (14) 

Note that Eq. (14) is just the backward Euler formula that we first saw in Section 8.1. 

By using higher order polynomials and correspondingly more data points, we can 
obtain backward differentiation formulas of any order. The second order formula is 

y n +\ = j[4 y n - y n - i + 2hf(t n+ i,y n +i)] , (15) 

and the fourth order formula is 

y n + 1 = 55 [48y„ - 36y„_i + 16y„_ 2 - 3y „_ 3 + I2hf(t n+1 ,y n+1 )] . (16) 

These formulas have local truncation errors proportional to h 3 and h 5 , respectively. 


+ B — y n , 

+ B = y n+1 . 


— / (fn+lAn+l) 


( 12 ) 


Use the fourth order backward differentiation formula with h — 0.1 and the data given in 
Example 1 to determine an approximate value of the solution y = (j> (t) at t = 0.4 for the initial 
value problem (11). 


9 C. William Gear (1935- ), born in London, England, received his undergraduate education at Cambridge 
University and his doctorate in 1960 from the University of Illinois. He was a member of the faculty at the 
University of Illinois for most of his career and made significant contributions both to computer design 
and numerical analysis. His influential book on numerical methods for differential equations is listed in 
the references. 
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Using Eq. (16) with n = 3, h = 0.1, and with yo, ■ ■ • ,>7 given in Example 1, we obtain the 
equation 


y 4 = 4.6837842 + 0.192y 4 . 


Thus 


y 4 = 5.7967626. 


Comparing the calculated value with the exact value 0 (0.4) = 5.7942260, we find that the error 
is 0.0025366. This is somewhat better than the result using the Adams-Bashforth method, but 
not as good as the result using the predictor-corrector method, and not nearly as good as the 
result using the Adams-Moulton method. 


A comparison between one-step and multistep methods must take several factors 
into consideration. The fourth order Runge-Kutta method requires four evalua- 
tions off at each step, while the fourth order Adams-Bashforth method (once past 
the starting values) requires only one, and the predictor-corrector method only two. 
Thus, for a given step size h, the latter two methods may well be considerably faster 
than Runge-Kutta. However, if Runge-Kutta is more accurate and therefore can 
use fewer steps, then the difference in speed will be reduced and perhaps eliminated. 
The Adams-Moulton and backward differentiation formulas also require that the 
difficulty in solving the implicit equation at each step be taken into account. All 
multistep methods have the possible disadvantage that errors in earlier steps can 
feed back into later calculations with unfavorable consequences. On the other hand, 
the underlying polynomial approximations in multistep methods make it easy to ap- 
proximate the solution at points between the mesh points, should this be desirable. 
Multistep methods have become popular largely because it is relatively easy to esti- 
mate the error at each step and to adjust the order or the step size to control it. For 
a further discussion of such questions as these, see the books listed at the end of this 
chapter; in particular, Shampine (1994) is an authoritative source. 


PROBLEMS 


4fl 

4fl 


In each of Problems 1 through 6 determine an approximate value of the solution at t = 0.4 
and t = 0.5 using the specified method. For starting values use the values given by the Runge- 
Kutta method; see Problems 1 through 6 of Section 8.3. Compare the results of the various 
methods with each other and with the actual solution (if available). 

(a) Use the fourth order predictor-corrector method with h = 0.1. Use the corrector formula 
once at each step. 

(b) Use the fourth order Adams-Moulton method with h = 0.1. 

(c) Use the fourth order backward differentiation method with h = 0.1. 

1. y = 3 + 1 - y, y(0) = 1 4fl 2. ? = 5f - IJy, y(0) = 2 

3. y' = 2y — 3 1, y(0) = 1 4. y' = 2t + e"*, y(0) = 1 


4fl 


5. 


y 2 + 

3 + r 2 ’ 


y(0) = 0.5 


6. / = (t 2 — y 2 ) sin y, y(0) = — 1 


In each of Problems 7 through 12 find approximate values of the solution of the given initial 
value problem at t = 0.5, 1.0, 1.5, and 2.0, using the specified method. For starting values 
use the values given by the Runge-Kutta method; see Problems 7 through 12 in Section 8.3. 
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Compare the results of the various methods with each other and with the actual solution (if 
available). 

(a) Use the fourth order predictor-corrector method with/; = 0.05. Use the corrector formula 
once at each step. 

(b) Use the fourth order Adams-Moulton method with h = 0.05. 

(c) Use the fourth order backward differentiation method with h = 0.05. 

41 7. / = 0.5 — t + 2y, y(0) = 1 41 8. y’ = 5t-3Jy, y( 0) = 2 

41 9. y = JT+y, y(0) = 3 41 10. y' = 2t + e~*, ytfl) = 1 

41 11. / = (4 - ty)/( 1 + y 2 ), y(0) = -2 

41 12. / = ( y 2 + 2fy)/(3 + t 2 ), y(0) = 0.5 

13. Show that the first order Adams-Bashforth method is the Euler method and that the first 
order Adams-Moulton method is the backward Euler method. 

14. Show that the third order Adams-Bashforth formula is 

J'n+i = y n + (/i/12)(23/„ - 16/„_i + 5 f„- 2 ). 

15. Show that the third order Adams-Moulton formula is 

y n + 1 = y n + (h/12)(5f n+ i + 8 f n i). 

16. Derive the second order backward differentiation formula given by Eq. ( 15 ) in this section. 


8.5 More on Errors; Stability 

In Section 8.1 we discussed some ideas related to the errors that can occur in a 
numerical approximation of the solution of the initial value problem 

y'=f(t,y), y(to)=yo- (i) 

In this section we continue that discussion and also point out some other difficulties 
that can arise. Some of the points that we wish to make are fairly difficult to treat in 
detail, so we will illustrate them by means of examples. 

Truncation and Round-off Errors. Recall that for the Euler method we showed that the 
local truncation error is proportional to h 2 and that for a finite interval the global 
truncation error is at most a constant times h. In general, for a method of order /;, the 
local truncation error is proportional to h p+1 and the global truncation error on a finite 
interval is bounded by a constant times h p . To achieve high accuracy, we normally use 
a numerical procedure for which p is fairly large, perhaps 4 or higher. As p increases, 
the formula used in computing y n+ \ normally becomes more complicated, and hence 
more calculations are required at each step; however, this is usually not a serious 
problem unless f(t,y) is very complicated or the calculation must be repeated very 
many times. If the step size h is decreased, the global truncation error is decreased by 
the same factor raised to the power p. However, as we mentioned in Section 8.1, if h is 
very small, a great many steps will be required to cover a fixed interval, and the global 
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round-off error may be larger than the global truncation error. The situation is shown 
schematically in Figure 8.5.1. We assume that the round-off error R n is proportional 
to the number of computations performed and therefore is inversely proportional 
to the step size h. On the other hand, the truncation error E„ is proportional to a 
positive power of h. From Eq. (17) of Section 8.1 we know that the total error is 
bounded by \E n \ + R n j ; hence we wish to choose h so as to minimize this quantity. 
The optimum value of h occurs when the rate of increase of the truncation error (as 
h increases) is balanced by the rate of decrease of the round-off error, as indicated 
in Figure 8.5.1. 



FIGURE 8.5.1 The dependence of truncation and round-off errors on the step size h. 


Consider the example problem 

EXAMPLE 

j y' — 1 — t + 4y, y( 0) = 1. (2) 

Using the Euler method with various step sizes, calculate approximate values for the solution 
0(f) at r = 0.5 and t = 1. Try to determine the optimum step size. 

Keeping only four digits in order to shorten the calculations, we obtain the data shown in 
Table 8.5.1. The first two columns are the step size h and the number of steps N required to 
traverse the interval 0 < t < 1. Then y N/2 and y N are approximations to 0(0.5) = 8.712 and 
0(1) = 64.90, respectively. These quantities appear in the third and fifth columns. The fourth 


TABLE 8.5.1 Approximations to the Solution of the Initial Value 
Problem yf = 1 — t + 4y, y(0) = 1 Using the Euler Method with 
Different Step Sizes 


h 

N 

yN/ 2 

Error 

yN 

Error 

0.01 

100 

8.390 

0.322 

60.12 

4.78 

0.005 

200 

8.551 

0.161 

62.51 

2.39 

0.002 

500 

8.633 

0.079 

63.75 

1.15 

0.001 

1000 

8.656 

0.056 

63.94 

0.96 

0.0008 

1250 

8.636 

0.076 

63.78 

1.12 

0.000625 

1600 

8.616 

0.096 

64.35 

0.55 

0.0005 

2000 

8.772 

0.060 

64.00 

0.90 

0.0004 

2500 

8.507 

0.205 

63.40 

1.50 

0.00025 

4000 

8.231 

0.481 

56.77 

8.13 
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and sixth columns display the differences between the calculated values and the actual value 
of the solution. 

For relatively large step sizes the round-off error is much less than the global truncation 
error. Consequently, the total error is approximately the same as the global truncation error, 
which for the Euler method is bounded by a constant times h. Thus, as the step size is reduced, 
the error is reduced proportionally. The first three lines in Table 8.5.1 show this type of 
behavior. For h = 0.001 the error has been further reduced, but much less than proportionally; 
this indicates that round-off error is becoming important. As h is reduced still more, the error 
begins to fluctuate, and further improvements in accuracy become problematical. For values 
of h less than 0.0005 the error is clearly increasing, which indicates that round-off error is now 
the dominant part of the total error. 

These results can also be expressed in terms of the number of steps N. For N less than 
about 1000 accuracy is improved by taking more steps, while for N greater than about 2000 
using more steps has an adverse effect. Thus for this problem it is best to use an N somewhere 
between 1000 and 2000. For the calculations shown in Table 8.5.1 the best result at t — 0.5 
occurs for N = 1000, while at t = 1.0 the best result is for N = 1600. 


You should be careful not to read too much into the results shown in Example 1. 
The optimum ranges for h and N depend on the differential equation, the numerical 
method that is used, and the number of digits that are retained in the calculation. 
Nevertheless, it is generally true that if too many steps are required in a calculation, 
then eventually round-off error is likely to accumulate to the point where it seriously 
degrades the accuracy of the procedure. For many problems this is not a concern: 
for them, any of the fourth order methods we have discussed in Sections 8.3 and 8.4 
will produce good results with a number of steps far less than the level at which 
round-off error becomes important. For some problems, however, round-off error 
does become vitally important. For such problems, the choice of method may be 
crucial. This is also one reason why modern codes provide a means of adjusting the 
step size as they go along, using a larger step size wherever possible and a very small 
step size only where necessary. 

Vertical Asymptotes. As a second example, consider the problem of determining the 
solution y = 0(f) of 

y' = t 2 +y 2 , y(0) = 1. (3) 

Since the differential equation is nonlinear, the existence and uniqueness theorem 
(Theorem 2.4.2) guarantees only that there is a solution in some interval about t = 0. 
Suppose that we try to compute a solution of the initial value problem on the interval 
0 < t < 1 using different numerical procedures. 

If we use the Euler method with h = 0.1, 0.05, and 0.01, we find the following 
approximate values at t = 1; 7.189548, 12.32093, and 90.75551, respectively. The large 
differences among the computed values are convincing evidence that we should use a 
more accurate numerical procedure — the Runge-Kutta method, for example. Using 
the Runge-Kutta method with h — 0.1, we find the approximate value 735.0991 at 
t = 1, which is quite different from those obtained using the Euler method. Repeating 
the calculations using step sizes of h = 0.05 and h = 0.01, we obtain the interesting 
information shown in Table 8.5.2. 

The values at t — 0.90 are reasonable and we might well believe that the solution has 
a value of about 14.305 at t = 0.90. However, it is not clear what is happening between 
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TABLE 8.5.2 Approximations to the Solution 
of the Initial Value Problem y' = t 2 + y 2 , 
y(0) = 1 Using the Runge-Kutta Method 


h 

t = 0.90 

t = 1.0 

0.1 

14.02182 

735.0991 

0.05 

14.27117 

1.75863 x 10 s 

0.01 

14.30478 

2.0913 x 10 2893 

0.001 

14.30486 



t = 0.9 and t — 1.0. To help clarify this, let us turn to some analytical approximations 
to the solution of the initial value problem (3). Note that, on 0 < t < 1, 

y 2 < t 2 + y 2 < 1 + y 2 . (4) 

This suggests that the solution y = tp\ (?) of 

/ = l + y 2 , y(0) = 1 (5) 

and the solution y = (fait) of 

/ = y 2 , y( 0 ) = 1 ( 6 ) 

are upper and lower bounds, respectively, for the solution y = 0 ( t ) of the original 
problem, since all these solutions pass through the same initial point. Indeed, it can be 
shown (for example, by the iteration method of Section 2.8) that 02 (f) <0(0 < 0i(O 
as long as these functions exist. The important thing to note is that we can solve 
Eqs. (5) and (6) for 0i and 02 by separation of variables. We find that 

0t(O = tan (f + , 02 (0 = ( 7 ) 

Thus 02(0 — > oo as t — > 1, and 0i(O — > oo as t — >■ 7t/4 = 0.785. These calculations 
show that the solution of the original initial value problem exists at least for 
0 < t < n/4 and at most for 0 < / < 1. The solution of the problem (3) has a vertical 
asymptote for some t in 7r/4 < t < 1 and thus does not exist on the entire interval 
0< t < 1. 

Our numerical calculations, however, suggest that we can go beyond t — n/4, and 
probably beyond t = 0.9. Assuming that the solution of the initial value problem 
exists at t = 0.9 and has the value 14.305, we can obtain a more accurate appraisal of 
what happens for larger t by considering the initial value problems (5) and (6) with 
_y(0) = 1 replaced by y(0.9) = 14.305. Then we obtain 

0i (0 = tan(f + 0.60100), 0 2 (f) = 1/(0.96991 - t ), (8) 

where only five decimal places have been kept. Thus 0i(f) ->oo as 
t — > n/2 — 0.60100 = 0.96980 and 02(0 — > oo as t — ► 0.96991. We conclude that the 
asymptote of the solution of the initial value problem (3) lies between these two 
values. This example illustrates the sort of information that can be obtained by a 
judicious combination of analytical and numerical work. 

Stability. The concept of stability is associated with the possibility that small errors 
that are introduced in the course of a mathematical procedure may die out as the 


474 


Chapter 8. Numerical Methods 


procedure continues. Conversely, instability occurs if small errors tend to increase, 
perhaps without bound. For example, in Section 2.5 we identified equilibrium so- 
lutions of a differential equation as (asymptotically) stable or unstable, depending 
on whether solutions that were initially near the equilibrium solution tended to ap- 
proach it or to depart from it as t increased. Somewhat more generally, the solution 
of an initial value problem is asymptotically stable if initially nearby solutions tend 
to approach the given solution, and unstable if they tend to depart from it. Visually, 
in an asymptotically stable problem the graphs of solutions will come together, while 
in an unstable problem they will separate. 

If we are investigating an initial value problem numerically, the best that we can 
hope for is that the numerical approximation will mimic the behavior of the actual 
solution. We cannot make an unstable problem into a stable one merely by ap- 
proximating its solution numerically. Flowever, it may well happen that a numerical 
procedure will introduce instabilities that were not part of the original problem, and 
this can cause trouble in approximating the solution. Avoidance of such instabilities 
may require us to place restrictions on the step size h. 

To illustrate what can happen in the simplest possible context, consider the differ- 
ential equation 

dy/dt = ry , (9) 

where r is a constant. Suppose that in solving this equation we have reached the 
point (t n ,y n ). Let us compare the exact solution of Eq. (9) that passes through this 
point, namely, 

y = y n exp[r(t - f„)], (10) 

with numerical approximations obtained from the Euler formula 

y n +i = y„ + hf(t n ,y n ) (ll) 

and from the backward Euler formula 

y n + 1 = y„ + hf(t n+ i,y n +i). (12) 

From the Euler formula (11) we obtain 

y n + 1 = y n + hry n = y n ( 1 + rh). (13) 


Similarly, from the backward Euler formula (12) we have 

y n + 1 = y n + hry n+ 1, 


or 


y n + 1 = 1 y " , = yA 1 + rh + {rh) 2 H ]. 

1 — rh 


Finally, evaluating the solution (10) at t„ + h, we find that 


y n + 1 = y n exp {rh) = y n 


1 + rh + 



(14) 


(15) 


Comparing Eqs. (13), (14), and (15), we see that the errors in both the Euler formula 
and the backward Euler formula are of order h 2 , as the theory predicts. 
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Now suppose that we change the value y n to y„ + S. Think, if you wish, of S as the 
error that has accumulated by the time we reach t = t n . The question is whether this 
error increases or decreases in going one more step to t n+ \ . 

For the exact solution (15), the change in y„+i due to the change S in y„ is just 
8 exp (rh). This quantity is less than 8 if exp (rh) < 1, that is, if r < 0. This confirms our 
conclusion in Chapter 2 that Eq. (9) is asymptotically stable if r < 0, and is unstable 
if r > 0. 

For the backward Euler method, the change iny„ + i in Eq. (14) due to <5 is <5/(1 — rh). 
For r < 0 the quantity 1/(1 — rh) is always nonnegative and never greater than 1. 
Thus, if the differential equation is stable, then so is the backward Euler method for 
an arbitrary step size h. 

On the other hand, for the Euler method, the change in y n+ \ in Eq. (13) due to 8 
is 5(1 + rh). If we recall that r < 0 and require that 1 1 + rh\ < 1, then we find that h 
must satisfy h < 2/\r\. Thus the Euler method is not stable for this problem unless h 
is sufficiently small. 

The restriction on the step size h in using the Euler method in the preceding 
example is rather mild unless \r\ is quite large. Nonetheless, the example illustrates 
that it may be necessary to restrict h in order to achieve stability in the numerical 
method, even though the initial value problem itself is stable for all values of h. 
Problems for which a much smaller step size is needed for stability than for accuracy 
are called stiff. The backward differentiation formulas described in Section 8.4 (of 
which the backward Euler formula is the lowest order example) are the most popular 
formulas for dealing with stiff problems. The following example illustrates the kind of 
instability that can occur when we try to approximate the solution of a stiff problem. 


EXAMPLE 

2 

A Stiff 
Problem 


Consider the initial value problem 

/ = -lOOy + lOOf + 1, y(0) = 1. (16) 

Find numerical approximations to the solution for 0 < t < 1 using the Euler, backward Euler, 
and Runge-Kutta methods. Compare the numerical results with the exact solution. 

Since the differential equation is linear, it is easy to solve, and the solution of the initial 
value problem (16) is 

y = (P(t) = e- 100, + t. (17) 

Some values of the solution <j> (t), correct to six decimal places, are given in the second column of 
Table 8.5.3, and a graph of the solution is shown in Figure 8.5.2. There is a thin layer (sometimes 
called a boundary layer) to the right of t = 0 in which the exponential term is significant and 
the solution varies rapidly. Once past this layer, however, <f>(t) = t and the graph of the solution 
is essentially a straight line. The width of the boundary layer is somewhat arbitrary, but it is 
certainly small. At t = 0.1, for example, exp(— lOOr) = 0.000045. 

If we plan to approximate the solution (17) numerically, we might intuitively expect that a 
small step size will be needed only in the boundary layer. To make this expectation a bit more 
precise, recall from Section 8.1 that the local truncation errors for the Euler and backward 
Euler methods are proportional to cp"(t). For this problem </"(f) = 10 4 e~ 100 ', which varies 
from a value of 10 4 at t = 0 to nearly zero for t > 0.2. Thus a very small step size is needed for 
accuracy near t = 0, but a much larger step size is adequate once t is a little larger. 

On the other hand, the stability analysis in Eqs. (9) through (15) also applies to this problem. 
Since r = —100 for Eq. (16), it follows that for stability we need h < 0.02 for the Euler method, 
but there is no corresponding restriction for the backward Euler method. 
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TABLE 8.5.3 Numerical Approximations to the Solution of the Initial Value Problem 
y' = -lOOy + lOOt + 1, y(0) = 1 


t 

Exact 

Euler 

0.025 

Euler 

0.0166... 

Runge-Kutta 
0.0333 . . . 

Runge-Kutta 

0.025 

Backward Euler 
0.1 

0.0 

1.000000 

1.000000 

1.000000 

1.000000 

1.000000 

1.000000 

0.05 

0.056738 

2.300000 

-0.246296 


0.470471 


0.1 

0.100045 

5.162500 

0.187792 

10.6527 

0.276796 

0.190909 

0.2 

0.200000 

25.8289 

0.207707 

111.559 

0.231257 

0.208264 

0.4 

0.400000 

657.241 

0.400059 

1.24 x 10 4 

0.400977 

0.400068 

0.6 

0.600000 

1.68 x 10 4 

0.600000 

1.38 x 10 6 

0.600031 

0.600001 

0.8 

0.800000 

4.31 x 10 s 

0.800000 

1.54 x 10 8 

0.800001 

0.800000 

1.0 

1.000000 

1.11 x 10 7 

1.000000 

1.71 x 10 10 

1.000000 

1.000000 


Some results obtained from the Euler method are shown in columns 3 and 4 of Table 8.5.3. 
The values for h = 0.025 are worthless because of instability, while those for h = 0.01666 . . . are 
reasonably accurate for t > 0.2. However, comparable accuracy for this range of t is obtained 
for h — 0.1 by using the backward Euler method, as shown by the results in column 7 of the 
table. 

The situation is not improved by using, instead of the Euler method, a more accurate one, 
such as Runge-Kutta. For this problem the Runge-Kutta method is unstable for h = 0.033 . . . 
but stable for h = 0.025, as shown by the results in columns 5 and 6 of Table 8.5.3. 

The results given in the table for t = 0.05 and for t = 0.1 show that in the boundary layer a 
smaller step size is needed to obtain an accurate approximation. You are invited to explore 
this matter further in Problem 3. 


y 

1 

0.8 

0.6 

0.4 

0.2 


FIGURE 8.5.2 The solution of the initial value problem (16). 


t 


0.2 0.4 0.6 0.8 1 t 


As a final example, consider the problem of determining two linearly independent 
solutions of the second order linear equation 

y" - 10;r 2 y = 0 (18) 

for t > 0. The generalization of numerical techniques for the first order equations 
to higher order equations or to systems of equations is discussed in Section 8.6, 
but that is not needed for the present discussion. Two linearly independent solu- 
tions of Eq. (18) are </>i(f) = cosh VltErf and 02(0 = sinh VltErf. The first solution, 
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0i(t) = cosh yiO jrt, is generated by the initial conditions 0i(O) = 1, 0(( 0) = 0; the 
second solution, 02 (t) = sinh VTO nt, is generated by the initial conditions 02(0 ) = 0, 
0'(O) = vTOtt. Although analytically we can tell the difference between 
cosh VTfbrf and sinh vTfbrf, for large t we have cosh VTOtt/ ~ e v/|(l7r '/2 and 
sinh vTdjr/ 1 ~ e^ nt /2\ numerically these two functions look exactly the same if only 
a fixed number of digits are retained. For example, correct to eight significant figures, 
we find that for t = 1, 


sinh\/l07r = cosh VTOtt = 10,315.894. 

If the calculations are performed on a machine that carries only eight digits, the 
two solutions 0i and 02 are identical at t — 1 and indeed for all t > 1. Thus, even 
though the solutions are linearly independent, their numerical tabulation would show 
that they are the same because we can retain only a finite number of digits. This 
phenomenon is called numerical dependence. 

For the present problem we can partially circumvent this difficulty by comput- 
ing, instead of sinh VTO.t / 1 and cosh VTfbrt, the linearly independent solutions 
03 (f) = e •Su >7Tt and 04(f) = g-viOjo corresponding to the initial conditions 03(0) = 1, 
03(0) = VIOjt and 04(0) = 1, 0^(0) = — VT0 tt, respectively. The solution 03 grows 
exponentially while 0 4 decays exponentially. Even so, we encounter difficulty in 
calculating 04 correctly on a large interval. The reason is that at each step of the 
calculation for 04 we introduce truncation and round-off errors. Thus, at any point 
t„, the data to be used in going to the next point are not precisely the values of 04 (f„) 
and 04 (t n ). The solution of the initial value problem with these data at t„ involves 
not only e~^ nt but also e '^ 7rt . Because the error in the data at t n is small, the latter 
function appears with a very small coefficient. Nevertheless, since e~'^ nt tends to 
zero and g'dojrr g rows ver y ra pidly, the latter eventually dominates, and the calculated 
solution is simply a multiple of e v l(l - T ' = 03(f). 

To be specific, suppose that we use the Runge-Kutta method to calculate the so- 
lution y = 04(f) = g-'/iOjrf 0 f value problem 

y" - lCbr 2 y = 0, y(0) = 1, /( 0) = — VTOj r. 

(The Runge-Kutta method for second order systems is described in Section 8.6.) 
Using single-precision (eight-digit) arithmetic with a step size h = 0.01, we obtain 
the results in Table 8.5.4. It is clear from these results that the numerical approxi- 
mation begins to deviate significantly from the exact solution for / > 0.5, and soon 
differs from it by many orders of magnitude. The reason is the presence, in the nu- 
merical approximation, of a small component of the exponentially growing solution 
03 it) — e vl(LT '. With eight-digit arithmetic we can expect a round-off error of the 
order of 10~ 8 at each step. Since e^ nt grows by a factor of 3.7 x 10 21 from t = 0 to 
t = 5, an error of order 10 -8 near t = 0 can produce an error of order 10 13 at t = 5 
even if no further errors are introduced in the intervening calculations. The results 
given in Table 8.5.4 demonstrate that this is exactly what happens. 

You should bear in mind that the numerical values of the entries in the second 
column of Table 8.5.4 are extremely sensitive to slight variations in how the calcula- 
tions are executed. Regardless of such details, however, the exponential growth of 
the approximation will be clearly evident. 
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TABLE 8.5.4 Exact Solution of y" — 10n 2 y — 0, 
y(0) = 1, y'(0) = — yiCbr and Numerical 
Approximation Using the Runge-Kutta Method 
with h = 0.01 


y 

t 

Numerical 

Exact 

0.0 

1.0 

1.0 

0.25 

8.3439 x 10~ 2 

8.3438 x 10- 2 

0.5 

6.9623 x 10- 3 

6.9620 x 10- 3 

0.75 

5.8409 x 10- 4 

5.8089 x 10- 4 

1.0 

8.6688 x 10- 5 

4.8469 x 10- 5 

1.5 

5.4900 x 10- 3 

3.3744 x 10- 7 

2.0 

7.8852 x 10- 1 

2.3492 x 10- 9 

2.5 

1.1326 x 10 2 

1.6355 x 10- 11 

3.0 

1.6268 x 10 4 

1.1386 x 10- 13 

3.5 

2.3368 x 10 6 

7.9272 x 10- 16 

4.0 

3.3565 x 10 8 

5.5189 x 10- 18 

4.5 

4.8211 x 10 10 

3.8422 x 10- 20 

5.0 

6.9249 x 10 12 

2.6749 x 10- 22 


Equation (18) is highly unstable, and the behavior shown in this example is typical 
of unstable problems. One can track a solution accurately for a while, and the interval 
can be extended by using smaller step sizes or more accurate methods, but eventually 
the instability in the problem itself takes over and leads to large errors. 

Some Comments on Numerical Methods. In this chapter we have introduced several nu- 
merical methods for approximating the solution of an initial value problem. We 
have tried to emphasize some important ideas while maintaining a reasonable level 
of complexity. For one thing, we have always used a uniform step size, whereas 
production codes that are currently in use provide for varying the step size as the 
calculation proceeds. 

There are several considerations that must be taken into account in choosing step 
sizes. Of course, one is accuracy; too large a step size leads to an inaccurate result. 
Normally, an error tolerance is prescribed in advance, and the step size at each step 
must be consistent with this requirement. As we have seen, the step size must also be 
chosen so that the method is stable. Otherwise, small errors will grow and soon render 
the results worthless. Finally, for implicit methods an equation must be solved at each 
step, and the method used to solve the equation may impose additional restrictions 
on the step size. 

In choosing a method, one must also balance the considerations of accuracy and sta- 
bility against the amount of time required to execute each step. An implicit method, 
such as the Adams-Moulton method, requires more calculations for each step, but 
if its accuracy and stability permit a larger step size (and consequently fewer steps), 
then this may more than compensate for the additional calculations. The backward 
differentiation formulas of moderate order, say, four, are highly stable and are there- 
fore indicated for stiff problems, for which stability is the controlling factor. 
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Some current production codes also permit the order of the method to be varied, 
as well as the step size, as the calculation proceeds. The error is estimated at each step, 
and the order and step size are chosen to satisfy the prescribed error tolerance. In 
practice, Adams methods up to order twelve and backward differentiation formulas 
up to order five are in use. Higher order backward differentiation formulas are 
unsuitable because of a lack of stability. 

Finally, we note that the smoothness of the function / — that is, the number of 
continuous derivatives that it possesses — is a factor in choosing the order of the 
method to be used. High order methods lose some of their accuracy if / is not 
smooth to a corresponding order. 


1. To obtain some idea of the possible dangers of small errors in the initial conditions, such 
as those due to round-off, consider the initial value problem 

y' = t + y- 3, y(0)=2. 

(a) Show that the solution is y = 4>i(t) = 2 — t. 

(b) Suppose that in the initial condition a mistake is made, and 2.001 is used instead of 2. 
Determine the solution y = 02(f) in this case, and compare the difference 02(0 — 0i(O at 
t = 1 and as t -> oo. 

2. Consider the initial value problem 

y = t 2 + e y , y(0) = 0. (i) 

Using the Runge-Kutta method with step size h, we obtain the results in Table 8.5.5. These 
results suggest that the solution has a vertical asymptote between t = 0.9 and t = 1.0. 


TABLE 8.5.5 Approximations to the 
Solution of the Initial Value Problem 
y' = t 2 + e y , y( 0) = 0 Using the 
Runge-Kutta Method 


h 

t = 0.90 

t = 1.0 

0.02 

3.42985 

> 10 38 

0.01 

3.42982 

> 10 38 


(a) Let y = 0(f) be the solution of problem (i). Further, let y = 0i(O be the solution of 


y' = 1 + e y , y( 0) = 0, 

(ii) 

and let y = 0 2 (t) be the solution of 


y' = y(0) = 0. 

(hi) 

Show that 


VI 

¥ 

VI 

<N 

(iv) 


on some interval, contained in 0 < t < 1, where all three solutions exist. 
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(b) Determine (j>\ (t) and 02(0- Then show that 0(f) —>■ oo for some t between 
t — In 2 = 0.69315 and t = 1. 

(c) Solve the differential equations y' = e y and y' = 1 + e y , respectively, with the initial 
condition y(0.9) = 3.4298. Use the results to show that 0(f) — > oo when t = 0.932. 

3. Consider again the initial value problem (16) from Example 2. Investigate how small a 
step size h must be chosen to ensure that the error at f = 0.05 and at f = 0.1 is less than 
0.0005. 

(a) Use the Euler method. 

(b) Use the backward Euler method. 

(c) Use the Runge-Kutta method. 

4. Consider the initial value problem 

y' = -10y + 2.5f 2 + 0.5f, y(0) = 4. 

(a) Find the solution y = 0(f) and draw its graph for 0 < t < 5. 

(b) The stability analysis in the text suggests that, for this problem, the Euler method is 
stable only for h < 0.2. Confirm that this is true by applying the Euler method to this 
problem for 0 < t < 5 with step sizes near 0.2. 

(c) Apply the Runge-Kutta method to this problem for 0 < t < 5 with various step sizes. 
What can you conclude about the stability of this method? 

(d) Apply the backward Euler method to this problem for 0 < f < 5 with various step 
sizes. What step size is needed to ensure that the error at f = 5 is less than 0.01? 

In each of Problems 5 and 6: 

(a) Find a formula for the solution of the initial value problem, and note that it is independent 
of X. 

(b) Use the Runge-Kutta method with h = 0.01 to compute approximate values of the solu- 
tion for 0 < t < 1 for various values of X such as X = 1, 10, 20, and 50. 

(c) Explain the differences, if any, between the exact solution and the numerical approxima- 
tions. 

5. y' — Xy — 1 — Xt, y(0) =0 6. y' — Xy = 2t — Xt 2 , y(0) = 0 


8.6 Systems of First Order Equations 


In the preceding sections we discussed numerical methods for approximating the 
solution of an initial value problem associated with a first order differential equation. 
These methods can also be applied to a system of first order equations. Since a 
higher order equation can always be reduced to a system of first order equations, it 
is sufficient to deal with systems of first order equations alone. For simplicity, we 
consider a system of two first order equations 

x' = f(t,x,y), y' = g(t,x,y), (1) 


with the initial conditions 


x(t 0 ) = x 0 , 


y(t 0 ) = y 0 . 


( 2 ) 
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The functions / and g are assumed to satisfy the conditions of Theorem 7.1.1 so that 
the initial value problem (f), (2) has a unique solution in some interval of the /-axis 
containing the point to. We wish to determine approximate values *i,x 2 , . . . ,x n , . . . 
and vi,y 2) . . . ,y n , ■ ■ ■ of the solution* = <p(t),y = t/f(t) at the points t n — t 0 + nh with 
n — 1,2, ... . 

In vector notation the initial value problem (I), (2) can be written as 

x' = f(/,x), x(/o)=x 0 , (3) 

where x is the vector with components * and y, f is the vector function with com- 
ponents / and g, and x 0 is the vector with components * 0 and yo- The methods of 
the previous sections can be readily generalized to handle systems of two (or more) 
equations. All that is needed (formally) is to replace the scalar variable * by the vec- 
tor x and the scalar function / by the vector function f in the appropriate equations. 
For example, the Euler formula becomes 


X/z+ ! — X tt + lit n , 


( 4 ) 


or, in component form, 

( X "+A = f'M + h (f^ x n,y n )\ ( 5 ) 

\y n +ij \y n ) \g(tn,x„,y n )J 

The initial conditions are used to determine fo, which is the vector tangent to the 
graph of the solution x = 0(/) in the xy-plane. We move in the direction of this 
tangent vector for a time step h in order to find the next point xj . Then we calculate 
a new tangent vector fi, move along it for a time step h to find x 2 , and so forth. 

In a similar way, the Runge-Kutta method can be extended to a system. For the 
step from /„ to t n+ \ we have 


x„+i — x„ -f- (/z/6)(k„i 4- 2k, l2 + 2k„3 T k ,,4 ) , (6) 

where 

= - X„), 

k„2 = f [tn + (h/ 2),x„ + (h/ 2)k„i], 

(7) 

k„3 = f[tn + (h/ 2),x„ + (/z/2)k„ 2 ], 
k/;4 — f (tn T h.X n T /Zk, j3 ). 

The formulas for the Adams-Moulton predictor-corrector method as it applies to 
the initial value problem (1), (2) are given in Problem 9. 

The vector equations (3), (4), (6), and (7) are, in fact, valid in any number of 
dimensions. All that is needed is to interpret the vectors as having n components 
rather than two. 


EXAMPLE 

1 


Determine approximate values of the solution x = <p(t),y = \[r(t) of the initial value problem 

x! = x - 4 y, y' - -x + y, (8) 

*(0) = 1, y(0) = 0, (9) 
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at the point t = 0.2. Use the Euler method with h = 0.1 and the Runge-Kutta method with 
h = 0.2. Compare the results with the values of the exact solution: 


m = 


fit) = 


(10) 


2 4 

Let us first use the Euler method. For this problem f n — x n — 4 y n and g n — —x n + y„; hence 

fo = 1 - (4)(0) = 1, go = — 1 + 0 = — 1. 

Then, from the Euler formulas (4) and (5), we obtain 

x\ = 1 T (0.1) (1) = 1.1, yi = 0 + (0.1)(-1) = -0.1. 

At the next step 


fi = 1.1 - (4)( — 0.1) = 1.5, gl = -1.1 + (-0.1) = -1.2. 


Consequently, 

x 2 = 1.1 + (0. 1) (1 .5) = 1.25, y 2 = -0.1 + (0.1)(— 1.2) = -0.22. 


The values of the exact solution, correct to eight digits, are 0(0.2) = 1.3204248 and 
0(0.2) = —0.25084701. Thus the values calculated from the Euler method are in error by 
about 0.0704 and 0.0308, respectively, corresponding to percentage errors of about 5.3% and 
12.3%. 

Now let us use the Runge-Kutta method to approximate 0(0.2) and 0(0.2). With h = 0.2 
we obtain the following values from Eqs. (7): 


km = 


kin = 


/( 1 . 0 ) 
g(!,0) 

/( 1 . 1 ,- 0 . 1 ) 


km = 


g(l.l,-0.1) 

/(1.15, —0.12) 
g(l .15, —0.12) 

/( 1.326,-0.254) 


k ° 4 \g( 1.326. 0.254) i 

Then, substituting these values in Eq. (6), we obtain 



/l\ 02 ( 9.602\ _ / 1.3200667 \ 

1^0/ + 6 \-7.52 ) ~ y— 0.25066667 ) ' 


These values of x\ and y\ are in error by about 0.000358 and 0.000180, respectively, with 
percentage errors much less than one-tenth of 1%. 

This example again illustrates the great gains in accuracy that are obtainable by using a more 
accurate approximation method, such as the Runge-Kutta method. In the calculations we have 
just outlined, the Runge-Kutta method requires only twice as many function evaluations as 
the Euler method, but the error in the Runge-Kutta method is about 200 times less than in 
the Euler method. 
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PROBLEMS 


41 

41 

41 

41 

41 

41 

41 


41 


41 


In each of Problems 1 through 6 determine approximate values of the solution x = 
y = y(t) of the given initial value problem at t = 0.2, 0.4, 0.6, 0.8, and 1.0. Compare the 
results obtained by different methods and different step sizes. 

(a) Use the Euler method with h = 0.1. 

(b) Use the Runge-Kutta method with h = 0.2. 

(c) Use the Runge-Kutta method with h = 0.1. 

1. x' = x + y + t, y' = Ax — 2y; x(0) = 1, y(0) = 0 

2. x! = 2x + ty, y = xy; x(0) = 1, y(0) = 1 

3. x' = -tx-y- 1, / = x; x(0) = 1, y(0) = 1 

4. x' = x - y + xy, y' = 3x - 2y - xy; x(0) = 0, y(0) = 1 

5. xf = x(l - 0.5x - 0.5y), y' = y(- 0.25 + 0.5x); jc( 0) = 4, y(0) = 1 

6. x! = exp(— x + y) — cosx, y' = sin(x — 3y); x(0) = 1, y(0) = 2 

7. Consider the example problem x! = x — 4y, y' = — x + y with the initial conditions 
x(0) = 1 and y(0) = 0. Use the Runge-Kutta method to find approximate values of the 
solution of this problem on the interval 0 < t < 1. Start with h = 0.2 and then repeat the 
calculation with step sizes h — 0.1,0.05, . . . , each half as long as in the preceding case. 
Continue the process until the first five digits of the solution at t = 1 are unchanged for 
successive step sizes. Determine whether these digits are accurate by comparing them 
with the exact solution given in Eqs. (10) in the text. 

8. Consider the initial value problem 

x" + t 2 x' + 3 x = t, x(0) = 1, x'(0) = 2. 

Convert this problem to a system of two first order equations, and determine approximate 
values of the solution at t = 0.5 and t = 1.0 using the Runge-Kutta method with h = 0.1. 

9. Consider the initial value problem x' =f(t,x,y) and y' = g(f,x,y) with x(t 0 ) = x 0 and 
y(t 0 ) — y 0 . The generalization of the Adams-Moulton predictor-corrector method of Sec- 
tion 8.4 is 


x n +i = x n + ya h(55f n - 59/„_i + 37/„_2 - 9/„_ 3 ), 
y«+i = y n + iih(55g n - 59 g„_! + 37g„_ 2 - 9 g n _ 3 ) 

and 

%n + 1 = “1“ 24 ^(^/n+l “1“ 19 f n 5 fn—\ “1“ fn—l)^ 

y n + 1 = y« + ^^(9g„+i + I9g„ - 5g„_i + g„_ 2 ). 

Determine an approximate value of the solution at t = 0.4 for the example initial value 
problem x' = x — 4y, y' = — x + y with x(0) = 1, y (0) = 0. Take h = 0.1. Correct the pre- 
dicted value once. For the values of x 1 , . . . ,y 3 use the values of the exact solution rounded 
to six digits: xj = 1.12735, x 2 = 1.32042, x 3 = 1.60021, yi = -0.111255, y 2 = -0.250847, 
and y 3 = —0.429696. 
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CHAPTER 


Nonlinear 
Differential 
Equations and 
Stability 


There are many differential equations, especially nonlinear ones, that are not suscepti- 
ble to analytical solution in any reasonably convenient manner. Numerical methods, 
such as those discussed in the preceding chapter, provide one means of dealing with 
these equations. Another approach, presented in this chapter, is geometrical in char- 
acter and leads to a qualitative understanding of the behavior of solutions rather 
than to detailed quantitative information. 


9.1 The Phase Plane: Linear Systems 


Since many differential equations cannot be solved conveniently by analytical meth- 
ods, it is important to consider what qualitative 1 information can be obtained about 
their solutions without actually solving the equations. The questions that we consider 
in this chapter are associated with the idea of stability of a solution, and the methods 


lr The qualitative theory of differential equations was created by Henri Poincare (1854—1912) in several 
major papers between 1880 and 1886. Poincare was a professor at the University of Paris and is generally 
considered the leading mathematician of his time. He made fundamental discoveries in several differ- 
ent areas of mathematics, including complex function theory, partial differential equations, and celestial 
mechanics. In a series of papers beginning in 1894 he initiated the use of modern methods in topology. 
In differential equations he was a pioneer in the use of asymptotic series, one of the most powerful tools 
of contemporary applied mathematics. Among other things, he used asymptotic expansions to obtain 
solutions about irregular singular points, thereby extending the work of Fuchs and Frobenius discussed in 
Chapter 5. 
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that we employ are basically geometrical. Both the concept of stability and the use 
of geometrical analysis were introduced in Chapter 1 and used in Section 2.5 for first 
order autonomous equations 

dy/dt — f(y). (1) 

In this chapter we refine the ideas and extend the discussion to systems of equations. 

We start with a consideration of the simplest system, namely, a second order linear 
homogeneous system with constant coefficients. Such a system has the form 

dx/dt = Ax, (2) 

where A is a 2 x 2 constant matrix and x is a 2 x 1 vector. Systems of this kind 
were solved in Sections 7.5 through 7.8. Recall that if we seek solutions of the form 
x = fje rt , then by substitution for x in Eq. (2) we find that 

(A - rl)$ = 0 . (3) 

Thus r must be an eigenvalue and § a corresponding eigenvector of the coefficient 
matrix A. The eigenvalues are the roots of the polynomial equation 

det(A — /-I) = 0, (4) 

and the eigenvectors are determined from Eq. (3) up to an arbitrary multiplicative 
constant. 

In Section 2.5 we found that points where the right side of Eq. (1) is zero are of 
special importance. Such points correspond to constant solutions, or equilibrium 
solutions, of Eq. (1) and are often called critical points. Similarly, for the system (2), 
points where Ax = 0 correspond to equilibrium (constant) solutions, and again they 
are called critical points. We will assume that A is nonsingular, or that det A ^ 0. It 
follows that x = 0 is the only critical point of the system (2). 

Recall that a solution of Eq. (2) is a vector function x = (f»(t) that satisfies the differ- 
ential equation. Such a function can be viewed as a parametric representation for a 
curve in the xiX 2 -plane. It is often useful to regard this curve as the path, or trajectory, 
traversed by a moving particle whose velocity dx/dt is specified by the differential 
equation. The xiX 2 -plane itself is called the phase plane, and a representative set of 
trajectories is referred to as a phase portrait. 

In analyzing the system (2), we must consider several different cases, depending 
on the nature of the eigenvalues of A. These cases also occurred in Sections 7.5 
through 7.8, where we were primarily interested in finding a convenient formula for 
the general solution. Now our main goal is to characterize the differential equation 
according to the geometric pattern formed by its trajectories. In each case we discuss 
the behavior of the trajectories in general and illustrate it with an example. It is 
important that you become familiar with the types of behavior that the trajectories 
have for each case, because these are the basic ingredients of the qualitative theory 
of differential equations. 

CASE 1 Real Unequal Eigenvalues of the Same Sign. The general solution of Eq. (2) is 

x = c^ a) e nt + c 2 ^ 2) e nt , (5) 

where r\ and rj are either both positive or both negative. Suppose first that 
r\ < r 2 < 0, and that the eigenvectors § (1) and fj (2) are as shown in Figure 9.1.1a. 
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It follows from Eq. (5) that x — »■ 0 as t — »■ oo regardless of the values of c\ and c 2 ; in 
other words, all solutions approach the critical point at the origin as t —*■ oo. If the 
solution starts at an initial point on the line through § (1) , then C 2 = 0. Consequently, 
the solution remains on the line through £ (l1 for all t and approaches the origin as 
t -» oo. Similarly, if the initial point is on the line through § <2) , then the solution 
approaches the origin along that line. In the general situation, it is helpful to rewrite 
Eq. (5) in the form 

x = e r2t [c^ {1) e (n - r2)t + c 2 $ (2) ], (6) 

Observe that r\ — r 2 < 0. Therefore, as long as c 2 / 0, the term ci£ (1) exp[(ri — r 2 )t] 
is negligible compared to c 2 § (2) for t sufficiently large. Thus, as t -» oo, the trajectory 
not only approaches the origin but also tends toward the line through £ (2) . Hence all 
solutions are tangent to £ (2 ' at the critical point except for those solutions that start 
exactly on the line through £ (1) . Several trajectories are sketched in Figure 9.1.1a. 
Some typical graphs of jq versus t are shown in Figure 9.1.1h, illustrating that all 
solutions exhibit exponential decay in time. The behavior of X2 versus f is similar. 
This type of critical point is called a node or a nodal sink. 

Let us now look backward in time and inquire what happens as t — »■ — oo. Still 
assuming that q < r 2 < 0, we note that if c\ ^ 0, then the dominant term in Eq. (5) 
as t —*■ — oo is the term involving e n ‘ . Thus, except for the trajectories lying along the 
line through i;' 2) , for large negative t the trajectories are very nearly parallel to the 
eigenvector £ (1) . This is also indicated in Figure 9.1.1a. 


x v 



t 



FIGURE 9.1.1 A node; q < r 2 < 0. (a) The phase plane, (b) x 1 versus t. 


If r\ and r 2 are both positive, and 0 <r 2 < q, then the trajectories have the same 
pattern as in Figure 9.1.1a, but the direction of motion is away from, rather than 
toward, the critical point at the origin. In this case ;q and x 2 grow exponentially as 
functions of t. Again the critical point is called a node or a nodal source. 

An example of a node occurs in Example 2 of Section 7.5, and its trajectories are 
shown in Figure 7.5.4. 

CASE 2 Real Eigenvalues of Opposite Sign. The general solution of Eq. (2) is 

x = Cl § ( V l( + c 2 $ ( V 2 ', (7) 

where q > 0 and r 2 < 0. Suppose that the eigenvectors § (1) and § (2> are as shown 
in Figure 9.1.2a. If the solution starts at an initial point on the line through \ 
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(a) ( b ) 

FIGURE 9.1.2 A saddle point; r\ > 0, r-i < 0. (a) The phase plane, (b) x\ versus t. 


then it follows that c 2 = 0. Consequently, the solution remains on the line through 
§ (1) for all t, and since /q > 0, ||x|| — »■ oo as t -» oo. If the solution starts at an initial 
point on the line through § (2) , then the situation is similar except that ||x|| -> 0 as 
t -> oo because r 2 < 0. Solutions starting at other initial points follow trajectories 
such as those shown in Figure 9.1.2a. The positive exponential is the dominant term 
in Eq. (7) for large t, so eventually all these solutions approach infinity asymptotic to 
the line determined by the eigenvector 1 ' corresponding to the positive eigenvalue 
r\. The only solutions that approach the critical point at the origin are those that start 
precisely on the line determined by £ (2> . Figure 9.1.2h shows some typical graphs of 
x\ versus t. For certain initial conditions the positive exponential term is absent 
from the solution, so xi — »■ 0 as / — »■ oo. For all other initial conditions the positive 
exponential term eventually dominates and causes xi to be unbounded. The behavior 
of x 2 is similar. The origin is called a saddle point in this case. 

An example of a saddle point occurs in Example 1 of Section 7.5, and its trajectories 
are shown in Figure 7.5.2. 

CASE 3 Equal Eigenvalues. We now suppose that r\ = r 2 = r. We consider the case in which 
the eigenvalues are negative; if they are positive, the trajectories are similar but the 
direction of motion is reversed. There are two subcases, depending on whether the 
repeated eigenvalue has two independent eigenvectors or only one. 

(a) Two independent eigenvectors. The general solution of Eq. (2) is 

x = Cl $ (1) e rt + c 2 $ (2) e rt , (8) 

where § (1) and £ (2) are the independent eigenvectors. The ratio x 2 /x\ is independent 
of t, but depends on the components of £ (l1 and £ (2) and on the arbitrary constants 
ci and c 2 . Thus every trajectory lies on a straight line through the origin, as shown 
in Figure 9.1.3a. Typical graphs of x\ or x 2 versus t are shown in Figure 9.1.3h. The 
critical point is called a proper node, or sometimes a star point. 

(b) One independent eigenvector. As shown in Section 7.8, the general solution 
of Eq. (2) in this case is 


x = Ci$e rt + c 2 (%te rt + ije rt ), 


( 9 ) 
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x v 






FIGURE 9.1.3 A proper node, two independent eigenvectors; r\ = r 2 < 0. (a) The phase 
plane. ( b ) x\ versus t. 

where § is the eigenvector and r] is the generalized eigenvector associated with the 
repeated eigenvalue. For large t the dominant term in Eq. (9) is C 2 $te rt . Thus, as 
t -> oo, every trajectory approaches the origin tangent to the line through the eigen- 
vector. This is true even if C 2 = 0, for then the solution x = c\%e rt lies on this line. 
Similarly, for large negative t the term C 2 %te rt is again the dominant one, so as t -» — oo, 
each trajectory is asymptotic to a line parallel to £. 

The orientation of the trajectories depends on the relative positions of § and t). 
One possible situation is shown in Figure 9.1.4a. To locate the trajectories, it is helpful 
to write the solution (9) in the form 

x = [(ci$ + c 2 ri) + c 2 i;t]e rt = ye rt , (10) 

where y = (ci§ + C 2 V) + c 2 ^t. Observe that the vector y determines the direction of 
x, whereas the scalar quantity e rt affects only the magnitude of x. Also note that, for 
fixed values of c\ and C 2 , the expression for y is a vector equation of the straight line 
through the point ci§ + c 2 r) and parallel to £. 

To sketch the trajectory corresponding to a given pair of values of c\ and c 2 , you 
can proceed in the following way. First, draw the line given by (ci§ + C 2 ij) + c 2 %t 
and note the direction of increasing t on this line. Two such lines are shown in 
Figure 9.1.4a, one for C 2 > 0 and the other for C 2 < 0. Next, note that the given 
trajectory passes through the point c\ | + c 2 j/ when t = 0. Further, as t increases, the 
direction of the vector x given by Eq. (10) follows the direction of increasing t on the 
line, but the magnitude of x rapidly decreases and approaches zero because of the 
decaying exponential factor e rt . Finally, as t decreases toward — oo, the direction of x 
is determined by points on the corresponding part of the line, and the magnitude of 
x approaches infinity. In this way we obtain the heavy trajectories in Figure 9.1.4a. 
A few other trajectories are lightly sketched as well to help complete the diagram. 
Typical graphs of x\ versus t are shown in Figure 9.1Ab. 

The other possible situation is shown in Figure 9.1.4c, where the relative orientation 
of § and i] is reversed. As indicated in the figure, this results in a reversal in the 
orientation of the trajectories. 

If n = r 2 > 0, you can sketch the trajectories by following the same procedure. In 
this event the trajectories are traversed in the outward direction, and the orientation 
of the trajectories with respect to that of (j and rj is also reversed. 
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FIGURE 9.1.4 An improper node, one independent eigenvector; r\=r 2 < 0. (a) The phase 
plane. ( b ) x\ versus t. (c) The phase plane. 


When a double eigenvalue has only a single independent eigenvector, the critical 
point is called an improper or degenerate node. A specific example of this case is 
Example 2 in Section 7.8; the trajectories are shown in Figure 7.8.2. 


CASE 4 Complex Eigenvalues. Suppose that the eigenvalues are X ± ip, where X and p are 
real, X ^ 0, and p > 0. It is possible to write down the general solution in terms of 
the eigenvalues and eigenvectors, as shown in Section 7.6. However, we proceed in 
a different way. 

Systems having the eigenvalues X ± ip are typified by 




( 11 ) 


x' 2 — —px i + Xxi. 


or, in scalar form, 


x[ — Xxi + px 2 , 


(12) 
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We introduce the polar coordinates r, 9 given by 

r 1 = x\ + jc 2 , tan0 = x 2 /x\. 

By differentiating these equations, we obtain 

rr' — xix[ + x 2 x' 2 , (sec 2 0)9' = (x\x ' 2 — x 2 x\)/x\. (13) 

Substituting from Eqs. (12) in the first of Eqs. (13), we find that 

/ = kr, (14) 

and hence 

r = ce xt , (15) 

where c is a constant. Similarly, substituting from Eqs. (12) in the second of Eqs. (13), 
and using the fact that sec 2 9 = r 2 /x 2 , we have 

O' = —fi. (16) 

Hence 

9 = ~iit + 9q, (17) 


where 9q is the value of 9 when t = 0. 

Equations (15) and (17) are parametric equations in polar coordinates of the tra- 
jectories of the system (11). Since p, > 0, it follows from Eq. (17) that 9 decreases 
as t increases, so the direction of motion on a trajectory is clockwise. As t oo, we 
see from Eq. (15) that r — »■ 0 if k < 0 and r — > oo if k > 0. Thus the trajectories are 
spirals, which approach or recede from the origin depending on the sign of k. Both 
possibilities are shown in Figure 9.1.5, along with some typical graphs of x\ versus t. 
The critical point is called a spiral point in this case. Frequently, the terms spiral sink 
and spiral source, respectively, are used to refer to spiral points whose trajectories 
approach, or depart from, the critical point. 

More generally, it is possible to show that for any system with complex eigenvalues 
k ± ill, where k ^ 0, the trajectories are always spirals. They are directed inward or 
outward, respectively, depending on whether k is negative or positive. They may be 
elongated and skewed with respect to the coordinate axes, and the direction of motion 
may be either clockwise or counterclockwise. While a detailed analysis is moderately 
difficult, it is easy to obtain a general idea of the orientation of the trajectories directly 
from the differential equations. Suppose that 


fdx/dt\ fa b\ fx\ 
\dy/dt) \c d) \y) 


(18) 


has complex eigenvalues k ± i/i, and look at the point (0, 1) on the positive y-axis. At 
this point it follows from Eqs. (18) that dx/dt = b and dy/dt — d. Depending on the 
signs of b and d, we can infer the direction of motion and the approximate orientation 
of the trajectories. For instance, if both b and d are negative, then the trajectories 
cross the positive y-axis so as to move down and into the second quadrant. If k < 0 
also, then the trajectories must be inward-pointing spirals resembling the one in 
Figure 9.1.6. Another case was given in Example 1 of Section 7.6, whose trajectories 
are shown in Figure 7.6.2. 
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FIGURE 9.1.5 A spiral point; r\ = X + ip, n = X — ip. (a) X < 0, the phase plane, (ft) X < 0, 
xi versus t. (c) X > 0, the phase plane. ( d ) X > 0, X\ versus t. 



FIGURE 9.1.6 A spiral point; r = A. =t ip with a < 0. 


CASE 5 Pure Imaginary Eigenvalues. In this case 1 = 0 and the system (11) reduces to 




(19) 


with eigenvalues ±ip. Using the same argument as in Case 4, we find that 

r' = 0, 9' = —p, 


( 20 ) 
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and consequently, 

r = c, 9 = -lit + 0 O , (21) 

where c and 6q are constants. Thus the trajectories are circles, with center at the origin, 
that are traversed clockwise if /x > 0 and counterclockwise if /x < 0. A complete 
circuit about the origin is made in a time interval of length 2i r//x, so all solutions are 
periodic with period 27r//x. The critical point is called a center. 

In general, when the eigenvalues are pure imaginary, it is possible to show (see 
Problem 19) that the trajectories are ellipses centered at the origin. A typical situation 
is shown in Figure 9.1.7, which also includes some typical graphs of x\ versus t. See 
also Example 3 in Section 7.6, especially Figures 7.6.3 and 7.6.4. 
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FIGURE 9.1.7 A center; r\ = i/x, r 2 = —in. (a) The phase plane, (b) x\ versus t. 


By reflecting on these five cases and by examining the corresponding figures, we 

can make several observations: 

1. After a long time, each individual trajectory exhibits one of only three types of behavior. 
Asf — ► oo, each trajectory approaches the critical point x = 0, repeatedly traverses a closed 
curve (corresponding to a periodic solution) that surrounds the critical point, or becomes 
unbounded. 

2. Viewed as a whole, the pattern of trajectories in each case is relatively simple. To be 
more specific, through each point (jt 0 ,.yo) in the phase plane there is only one trajectory; 
thus the trajectories do not cross each other. Do not be misled by the figures, in which it 
sometimes appears that many trajectories pass through the critical point x = 0. In fact, 
the only solution passing through the origin is the equilibrium solution x = 0. The other 
solutions that appear to pass through the origin actually only approach this point as t — >■ oo 
or t -> —oo. 

3. In each case the set of all trajectories is such that one of three situations occurs. 

(a) All trajectories approach the critical point x = 0 as t -»• oo. This is the case if the 
eigenvalues are real and negative or complex with negative real part. The origin is 
either a nodal or a spiral sink. 

(b) All trajectories remain bounded but do not approach the origin as t -> oo. This is 
the case if the eigenvalues are pure imaginary. The origin is a center. 

(c) Some trajectories, and possibly all trajectories except x = 0, become unbounded 
as t — > oo. This is the case if at least one of the eigenvalues is positive or if the 
eigenvalues have positive real part. The origin is a nodal source, a spiral source, or 
a saddle point. 
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The situations described in 3(a), (b), and (c) above illustrate the concepts of asymp- 
totic stability, stability, and instability, respectively, of the equilibrium solution x = 0 
of the system (2). The precise definitions of these terms are given in Section 9.2, but 
their basic meaning should be clear from the geometrical discussion in this section. 
The information that we have obtained about the system (2) is summarized in Table 
9.1.1. Also see Problems 20 and 21. 


TABLE 9.1.1 Stability Properties of Linear Systems x' = Ax with 
det(A — rl) = 0 and det A ^ 0 


Eigenvalues 

Type of Critical Point 

Stability 

n > r 2 > 0 

Node 

Unstable 

ri < r2 < 0 

Node 

Asymptotically stable 

/2 < 0 < T\ 

Saddle point 

Unstable 

ri = r 2 > 0 

Proper or improper node 

Unstable 

r\ = r 2 < 0 

Proper or improper node 

Asymptotically stable 

n,r 2 = X±ip 

Spiral point 


X > 0 


Unstable 

X < 0 


Asymptotically stable 

n = ip, r 2 = -ip 

Center 

Stable 


The analysis in this section applies only to second order systems x' = Ax whose 
solutions are represented geometrically as curves in the phase plane. A similar, 
though more complicated, analysis can be carried out for an nth order system, with 
an n x n coefficient matrix A, whose solutions are curves in an //-dimensional phase 
space. The cases that can occur in higher order systems are essentially combinations 
of those we have seen in two dimensions. For instance, in a third order system 
with a three-dimensional phase space, one possibility is that solutions in a certain 
plane may spiral toward the origin, while other solutions may tend to infinity along 
a line transverse to this plane. This would be the case if the coefficient matrix has 
two complex eigenvalues with negative real part and one positive real eigenvalue. 
Flowever, because of their complexity, we will not discuss systems of higher than 
second order. 


PROBLEMS For each of the systems in Problems 1 through 12: 

(a) Find the eigenvalues and eigenvectors. 

(b) Classify the critical point (0, 0) as to type and determine whether it is stable, asymptotically 
stable, or unstable. 

(c) Sketch several trajectories in the phase plane and also sketch some typical graphs of X\ 
versus t. 

(d) Use a computer to plot accurately the curves requested in part (c). 
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41 3 - 
41 5 - 
4t 7 - 
41 9 - 
41 n - 


dx 

dt 

dx 

dt 

dx 

dt 

dx 

dt 

dx 

dt 



41 4 - 
41 6 - 
41 8 - 
10 . 

41 12 . 


dx 

dt 

dx 

dt 

dx 

dt 

dx 

dt 

dx 

dt 



In each of Problems 13 through 16 determine the critical point x = x°, and then classify its 
type and examine its stability by making the transformation x = x° + u. 



17. The equation of motion of a spring-mass system with damping (see Section 3.7) is 


d 2 u du 

m—— + c— + ku = 0, 
dt- dt 


where m, c , and k are positive. Write this second order equation as a system of two 
first order equations for x = u, y = du/dt. Show that x = 0, y = 0 is a critical point, and 
analyze the nature and stability of the critical point as a function of the parameters m , c, 
and k. A similar analysis can be applied to the electric circuit equation (see Section 3.7) 



dl 1 

+ R— + -1 = 0. 
dt C 


18. Consider the system x' = Ax, and suppose that A has one zero eigenvalue. 

(a) Show that x = 0 is a critical point and that, in addition, every point on a certain straight 
line through the origin is also a critical point. 

(b) Let r\ = 0 and r 2 7^ 0, and let $ (1) and £ <2) be corresponding eigenvectors. Show that 
the trajectories are as indicated in Figure 9.1.8. What is the direction of motion on the 
trajectories? 

19. In this problem we indicate how to show that the trajectories are ellipses when the eigen- 
values are pure imaginary. Consider the system 



(i) 


(a) Show that the eigenvalues of the coefficient matrix are pure imaginary if and only if 


a u + a 22 = 0, 


(ii) 


a U a 22 ~ a 12 a 21 > 0 - 
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FIGURE 9.1.8 Nonisolated critical points; r\ — 0, r 2 ^ 0. Every point on the line through 
£ (1) is a critical point. 


(b) The trajectories of the system (i) can be found by converting Eqs. (i) into the single 
equation 

dy _ dy/dt _ a 2 ix + any 
dx dx/dt a\\x + any 

Use the first of Eqs. (ii) to show that Eq. (iii) is exact. 

(c) By integrating Eq. (iii), show that 

a .2 ix 2 + 2a 22 xy — any 2 = k, 

where k is a constant. Use Eqs. (ii) to conclude that the graph of Eq. (iv) is always an 
ellipse. 

Hint: What is the discriminant of the quadratic form in Eq. (iv)? 

20. Consider the linear system 

dx/dt = a\\x + any, dy/dt — a 2 \X + a 2 2 y, 

where an, . . . ,o 22 are real constants. Let p = an + a 22 , q = a n« 22 — ana 2 i, and 
A = p 2 — 4 q. Observe that p and q are the trace and determinant, respectively, of the 
coefficient matrix of the given system. Show that the critical point (0, 0) is a 

(a) Node if q > 0 and A > 0; (b) Saddle point if q < 0; 

(c) Spiral point if p ^ 0 and A < 0; (d) Center if p = 0 and q > 0. 

Hint: These conclusions can be obtained by studying the eigenvalues ri and r 2 . It may also 
be helpful to establish, and then to use, the relations r\r 2 = q and r t + r 2 = p. 

21. Continuing Problem 20, show that the critical point (0, 0) is 

(a) Asymptotically stable if q > 0 and p < 0; 

(b) Stable if q > 0 and p = 0; 

(c) Unstable if q < 0 or p > 0. 

The results of Problems 20 and 21 are summarized visually in Figure 9.1.9. 


(iii) 


(iv) 
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FIGURE 9.1.9 Stability diagram. 


9.2 Autonomous Systems and Stability 


In this section we begin to draw together, and to expand on, the geometrical ideas 
introduced in Section 2.5 for certain first order equations and in Section 9.1 for 
systems of two first order linear homogeneous equations with constant coefficients. 
These ideas concern the qualitative study of differential equations and the concept 
of stability, an idea that will be defined precisely later in this section. 

Autonomous Systems. We are concerned with systems of two simultaneous differential 
equations of the form 

dx/dt = F(x,y), dy /dt — G(x,y). (1) 

We assume that the functions F and G are continuous and have continuous partial 
derivatives in some domain D of the xy -plane. If (xo,yo) is a point in this domain, 
then by Theorem 7.1.1 there exists a unique solution x — (p(t),y — %l/(t) oi the system 
(1) satisfying the initial conditions 

x(to) = x 0 , y(t 0 ) = y 0 . (2) 

The solution is defined in some time interval I that contains the point to. 

Frequently, we will write the initial value problem (1), (2) in the vector form 

dx/dt = f(x), x(fo) = x°, (3) 

where x = x\ + yj, f(x) = F(x,y) i + G(x,y)j, and x° = xe\ + yoj. In this case the 
solution is expressed as x = </>(t), where $(Y) = cp(t)i+ i/r(f)j. As usual, we interpret a 
solution x = 0(f) as a curve traced by a moving point in the xy-plane, the phase plane. 
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Observe that the functions F and G in Eqs. (1) do not depend on the independent 
variable t, but only on the dependent variables x and y. A system with this property 
is said to be autonomous. The system 

x' = Ax, (4) 

where A is a constant 2x2 matrix, is a simple example of a two-dimensional au- 
tonomous system. On the other hand, if one or more of the elements of the co- 
efficient matrix A is a function of the independent variable t, then the system is 
nonautonomous. The distinction between autonomous and nonautonomous sys- 
tems is important because the geometrical qualitative analysis in Section 9.1 can be 
effectively extended to two-dimensional autonomous systems in general, but is not 
nearly as useful for nonautonomous systems. 

In particular, the autonomous system (1) has an associated direction field that is 
independent of time. Consequently, there is only one trajectory passing through each 
point (xo,yo) ' n the phase plane. In other words, all solutions that satisfy an initial 
condition of the form (2) lie on the same trajectory, regardless of the time t 0 at which 
they pass through (x 0 ,yo)- Thus, just as for the constant coefficient linear system (4), 
a single phase portrait simultaneously displays important qualitative information 
about all solutions of the system (1). We will see this fact confirmed repeatedly in 
this chapter. 

Autonomous systems occur frequently in applications. Physically, an autonomous 
system is one whose configuration, including physical parameters and external forces 
or effects, is independent of time. The response of the system to given initial condi- 
tions is then independent of the time at which the conditions are imposed. 

Stability and Instability. The concepts of stability, asymptotic stability, and instability 
have already been mentioned several times in this book. It is now time to give a 
precise mathematical definition of these concepts, at least for autonomous systems 
of the form 

x' = f(x). (5) 

In the following definitions, and elsewhere, we use the notation ||x|| to designate the 
length, or magnitude, of the vector x. 

The points, if any, where f(x) = 0 are called critical points of the autonomous 
system (5). At such points x' = 0 also, so critical points correspond to constant, or 
equilibrium, solutions of the system of differential equations. A critical point x° of 
the system (5) is said to be stable if, given any e > 0, there is a S > 0 such that every 
solution x = 0(f) of the system (1), which at t — 0 satisfies 

||0(O) — x°|| < <5, (6) 

exists for all positive t and satisfies 

110(0 -x°||<e (7) 

for all t > 0. This is illustrated geometrically in Figures 9.2.1a and 9.2.1 b. These 
mathematical statements say that all solutions that start “sufficiently close” (that is, 
within the distance 8) to x° stay “close” (within the distance e) to x°. Note that in 
Figure 9.2. In the trajectory is within the circle ||x — x°|| = <5 at t = 0 and, although 
it soon passes outside of this circle, it remains within the circle ||x — x°|| = e for all 
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t > 0. However, the trajectory of the solution does not have to approach the critical 
point x° as t — > oo, as illustrated in Figure 9.2.1 b. A critical point that is not stable is 
said to be unstable. 




FIGURE 9.2.1 (a) Asymptotic stability, (b) Stability. 


A critical point x° is said to be asymptotically stable if it is stable and if there exists 
a Sq (5o > 0) such that if a solution x = 0(f) satisfies 

||0(O) — x° || < <5 0 , (8) 

then 

lim 0(f) = x°. (9) 

t^-OO 

Thus trajectories that start “sufficiently close’’ to x° not only must stay “close” but 
must eventually approach x° as t — ► oo. This is the case for the trajectory in Fig- 
ure 9.2.1 a but not for the one in Figure 9.2. lb. Note that asymptotic stability is a 
stronger property than stability, since a critical point must be stable before we can 
even consider whether it might be asymptotically stable. On the other hand, the 
limit condition (9), which is an essential feature of asymptotic stability, does not by 
itself imply even ordinary stability. Indeed, examples can be constructed in which all 
the trajectories approach x° as f — > oo, but for which x° is not a stable critical point. 
Geometrically, all that is needed is a family of trajectories having members that start 
arbitrarily close to x° and then depart an arbitrarily large distance before eventually 
approaching x° as t -* oo. 

In this chapter we are concentrating on systems of two equations, but the definitions 
just given are independent of the size of the system. If you interpret the vectors in 
Eqs. (5) through (9) as rc-dimensional, then the definitions of stability, asymptotic 
stability, and instability apply also to systems of n equations. These definitions can 
be made more concrete by interpreting them in terms of a specific physical problem. 

The Oscillating Pendulum. The concepts of asymptotic stability, stability, and instability 
can be easily visualized in terms of an oscillating pendulum. Consider the configu- 
ration shown in Figure 9.2.2, in which a mass m is attached to one end of a rigid, but 
weightless, rod of length L. The other end of the rod is supported at the origin O , 
and the rod is free to rotate in the plane of the paper. The position of the pendulum 


500 


Chapter 9. Nonlinear Differential Equations and Stability 


is described by the angle 0 between the rod and the downward vertical direction, 
with the counterclockwise direction taken as positive. The gravitational force mg 
acts downward, while the damping force c\dQ/dt\, where c is positive, is always op- 
posite to the direction of motion. We assume that both 9 and dO /dt are positive. The 
equation of motion can be quickly derived from the principle of angular momentum, 
which states that the time rate of change of angular momentum about any point is 
equal to the moment of the resultant force about that point. The angular momentum 
about the origin is mL 2 (dO/dt), so the governing equation is 

j d 2 0 d6 

mL ——r — —cL— mgL sin0. (10) 

dt 1 dt 

The factors L and L sin 0 on the right side of Eq. (10) are the moment arms of the 
resistive force and of the gravitational force, respectively; the minus signs are due 
to the fact that the two forces tend to make the pendulum rotate in the clockwise 
(negative) direction. You should verify, as an exercise, that the same equation is 
obtained for the other three possible sign combinations of 6 and dQ/dt. 



FIGURE 9.2.2 An oscillating pendulum. 


By straightforward algebraic operations, we can write Eq. (10) in the standard 
form 


d 2 0 

dt 2 


c d9 
mL dt 


+ sin 9 — 0, 


( 11 ) 


or 


d 2 0 

dt 2 


d0 7 . 

y f or sin 6 = 0, 

dt 


( 12 ) 


where y = c/mL and co 2 = g/L. To convert Eq. (12) to a system of two first order 
equations, we let x = 0 and y = d0/df, then 


dx dy 2 . 

— =y, — = -orsm x-yy. (13) 


Since y and o> 2 are constants, the system (13) is an autonomous system of the 
form (1). 
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The critical points of Eqs. (13) are found by solving the equations 
y = 0, — to 2 sinx — yy = 0. 

We obtain y = 0 and x = ±m r , where n is an integer. These points correspond to two 
physical equilibrium positions, one with the mass directly below the point of support 
( 9 — 0) and the other with the mass directly above the point of support (6 = n). Our 
intuition suggests that the first is stable and the second is unstable. 

More precisely, if the mass is slightly displaced from the lower equilibrium posi- 
tion, it will oscillate back and forth with gradually decreasing amplitude, eventually 
approaching the equilibrium position as the initial potential energy is dissipated by 
the damping force. This type of motion illustrates asymptotic stability and is shown 
in Figure 9.2.3 a. 



(a) ( b ) (c) 

FIGURE 9.2.3 Qualitative motion of a pendulum, (a) With air resistance. ( b ) With or 
without air resistance, (c) Without air resistance. 

On the other hand, if the mass is slightly displaced from the upper equilibrium po- 
sition, it will rapidly fall, under the influence of gravity, and will ultimately approach 
the lower equilibrium position in this case also. This type of motion illustrates in- 
stability. See Figure 9.2.3 b. In practice, it is impossible to maintain the pendulum in 
its upward equilibrium position for any extended length of time without an external 
constraint mechanism, since the slightest perturbation will cause the mass to fall. 

Finally, consider the ideal situation in which the damping coefficient c (or y) is 
zero. In this case, if the mass is displaced slightly from its lower equilibrium position, 
it will oscillate indefinitely with constant amplitude about the equilibrium position. 
Since there is no dissipation in the system, the mass will remain near the equilibrium 
position but will not approach it asymptotically. This type of motion is stable but 
not asymptotically stable, as indicated in Figure 9.2.3c. In general, this motion is 
impossible to achieve experimentally, because the slightest degree of air resistance 
or friction at the point of support will eventually cause the pendulum to approach its 
rest position. 

Solutions of the pendulum equations are discussed in more detail in the next sec- 
tion. 

The Importance of Critical Points. Critical points correspond to equilibrium solutions, 
that is, solutions for which x(f) and y(t) are constant. For such a solution, the system 
described by x and y is not changing; it remains in its initial state forever. It might 
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seem reasonable to conclude that such points are not very interesting. Recall, how- 
ever, that for linear homogeneous systems with constant coefficients, x' = Ax, the 
nature of the critical point at the origin determines to a large extent the behavior of 
trajectories throughout the xy-plane. 

For nonlinear autonomous systems this is no longer true, for at least two reasons. 
First, there may be several, or many, critical points that are, so to speak, competing 
for influence on the trajectories. Second, the nonlinearities in the system are also of 
great importance, especially far away from the critical points. Nevertheless, critical 
points of nonlinear autonomous systems can be classified just as for linear systems. 
We will discuss this in detail in Section 9.3. Here we illustrate how it can be done 
graphically, assuming that you have software that can construct direction fields and 
perhaps plot good numerical approximations to a few trajectories. 


EXAMPLE 

1 


Consider the system 

dx/dt = —(x — y)(l — x — y), dy /dt = x{2 + y). (14) 

Find the critical points for this system and draw direction fields on rectangles containing the 
critical points. By inspecting the direction fields, classify each critical point as to type and state 
whether it is asymptotically stable, stable, or unstable. 

The critical points are found by solving the algebraic equations 

(x-y)(l-x-y) =0, x(2 + y) = 0. (15) 

One way to satisfy the second equation is by choosing x = 0. Then the first equation becomes 
y{ 1 — y) = 0,soy = Oory = 1. More solutions can be found by choosing y = —2 in the second 
equation. Then the first equation becomes (x + 2) (3 — x) = 0, so x = — 2 or x = 3. Thus we 
have obtained the four critical points (0, 0), (0, 1), (—2, —2), and (3, —2). 
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FIGURE 9.2.4 Direction field containing 
(0,0) and (0, 1). 



FIGURE 9.2.5 Direction field containing 
(—2, —2) and (3, —2). 
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EXAMPLE 

2 


Figure 9.2.4 shows a direction field containing the first two of the critical points. By com- 
paring this figure with those in Section 9.1 and in Chapter 7, it should be clear that the origin 
is a saddle point and that (0, 1) is a spiral point. Of course, the saddle point is unstable. The 
trajectories near the spiral point appear to be approaching this point, so we conclude that it is 
asymptotically stable. A direction field for the other two critical points is shown in Figure 9.2.5. 
Each of these points is a node. The arrows point toward the point (—2, —2) and away from the 
point (3, —2); thus the former is asymptotically stable and the latter is unstable. 


For a two-dimensional autonomous system with at least one asymptotically stable 
critical point, it is often of interest to determine where in the phase plane the tra- 
jectories lie that ultimately approach a given critical point. Let P be a point in the 
xy -plane with the property that a trajectory passing through P ultimately approaches 
the critical point as t -> oo. Then this trajectory is said to be attracted by the critical 
point. Further, the set of all such points P is called the basin of attraction or the 
region of asymptotic stability of the critical point. A trajectory that bounds a basin 
of attraction is called a separatrix because it separates trajectories that approach a 
particular critical point from other trajectories that do not do so. Determination 
of basins of attraction is important in understanding the large-scale behavior of the 
solutions of an autonomous system. 


Consider again the system (14) from Example 1. Describe the basin of attraction for each of 
the asymptotically stable critical points. 

Figure 9.2.6 shows a phase portrait for this system with a direction field in the background. 
Observe that there are two trajectories that approach the saddle point at the origin as t — > oo, 
One of these lies in the fourth quadrant and is almost a straight line from the unstable node 
at (3, —2). The other also originates at the unstable node, then passes into the first quadrant, 
loops around the spiral point, and ultimately approaches the saddle point through the second 
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FIGURE 9.2.6 Trajectories of the system (14). 
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quadrant. These two trajectories are separatrices; the region between them (but not including 
the separatrices themselves) is the basin of attraction for the spiral point at (0, 1). This region 
is shaded in Figure 9.2.6. 

The basin of attraction for the asymptotically stable node at (—2, —2) consists of the rest of 
the xy-plane, with only a handful of exceptions. The separatrices approach the saddle point, as 
we have noted already, rather than the node. The saddle point itself and the unstable node are 
equilibrium solutions, and thus remain fixed for all time. Finally, there is a trajectory lying on 
the line y = — 2 for x > 3 on which the direction of motion is always to the right; this trajectory 
also does not approach the point (—2, —2). 


Figures 9.2.4, 9.2.5, and 9.2.6 show that in the immediate vicinity of a critical point 
the direction field and pattern of trajectories resemble those for a linear system with 
constant coefficients. This becomes even more unmistakable if you use your software 
to zoom in closer and closer to a critical point. Thus we have visual evidence that a 
nonlinear system behaves very much like a linear system, at least in the neighborhood 
of a critical point. We will pursue this idea in the next section. 

Determination of Trajectories. The trajectories of a two-dimensional autonomous system 

dx/dt = F(x,y), dy/dt — G(x,y) (16) 

can sometimes be found by solving a related first order differential equation. From 
Eqs. (16) we have 

dy _ dy/dt _ G{x,y) 
dx dx/dt F(x,y)’ 

which is a first order equation in the variables x and y. Observe that such a reduction 
is not usually possible if F and G depend also on t. If Eq. (17) can be solved by any 
of the methods of Chapter 2, and if we write solutions (implicitly) in the form 

H(x,y) = c, (18) 

then Eq. ( 18) is an equation for the trajectories of the system (16). In other words, the 
trajectories lie on the level curves of Fl{x,y). Keep in mind that there is no general 
way of solving Eq. (17) to obtain the function H, so this approach is applicable only 
in special cases. 


EXAMPLE 

3 


Find the trajectories of the system 

dx/dt = y, dy/dt = x. 

In this case, Eq. (17) becomes 

dy x 
dx y 

This equation is separable since it can be written as 

ydy = x dx, 


(19) 

( 20 ) 


and its solutions are given by 


H(x,y) = y 2 — x 2 = c, 


( 21 ) 
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EXAMPLE 

4 


where c is arbitrary. Therefore the trajectories of the system (19) are the hyperbolas shown 
in Figure 9.2.7. The direction of motion on the trajectories can be inferred from the fact that 
both dx/dt and dy dt are positive in the first quadrant. The only critical point is the saddle 
point at the origin. 



FIGURE 9.2.7 Trajectories of the system (19). 


Another way to obtain the trajectories is to solve the system (19) by the methods of Sec- 
tion 7.5. We omit the details, but the result is 

x = Cie f + C 2 e~', y = c\e' — C 2 e~‘ . 

Eliminating t between these two equations again leads to Eq. (21). 


Find the trajectories of the system 


From the equations 


dx 

dt 


4-2 y, 


4 — 2 y = 0, 


dy 

dt 


12 - 3x 2 . 


12 - 3x 2 = 0 


( 22 ) 


we find that the critical points of the system (22) are the points (—2, 2) and (2, 2). To determine 
the trajectories, note that for this system Eq. (17) becomes 

dy 12 — 3x 2 
dx 4 — 2 y 


(23) 
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Separating the variables in Eq. (23) and integrating, we find that solutions satisfy 

H(x,y ) = 4 y — y * 1 2 — 12x + x 3 4 5 6 7 8 9 = c, (24) 

where c is an arbitrary constant. A computer plotting routine is helpful in displaying the level 
curves of H(x,y), some of which are shown in Figure 9.2.8. The direction of motion on the 
trajectories can be determined by drawing a direction field for the system (22), or by evaluating 
dx/dt and dy/dt at one or two selected points. From Figure 9.2.8 you can see that the critical 
point (2,2) is a saddle point and the point (—2,2) is a center. Observe that one trajectory 
leaves the saddle point (as t -> — oo), loops around the center, and returns to the saddle point 
(as t -> +oo). 



PROBLEMS 


In each of Problems 1 through 4 sketch the trajectory corresponding to the solution satisfying 
the specified initial conditions, and indicate the direction of motion for increasing t. 

1. dx/dt = — x, dy/dt = —2 y; x(0) = 4, y(0) = 2 

2. dx/dt = —x, dy/dt = 2 \y\ x(0) = 4, _y(0) = 2 and x(0) = 4, y(0) = 0 

3. dx/dt = —y, dy/dt = x; x(0) = 4, v(0) = 0 and x(0) = 0, y(0) = 4 

4. dx/dt = ay, dy/dt — —bx, a > 0, b > 0; x(0) = *] a , y(0) = 0 


For each of the systems in Problems 5 through 16: 

(a) Find all the critical points (equilibrium solutions). 

(b) Use a computer to draw a direction field and phase portrait for the system. 

(c) From the plot(s) in part (b), determine whether each critical point is asymptotically stable, 
stable, or unstable, and classify it as to type. 

(d) Describe the basin of attraction for each asymptotically stable critical point. 

5. dx/dt = x — xy, dy/dt = y + 2xy 

6. dx/dt = 1 + 2y, dy/dt = 1 — 3x 2 

7. dx/dt = 2x — x 2 — xy, dy/dt = 3y — 2 y 2 — 3 xy 

8. dx/dt = — (2 + y)(x + y), dy/dt = —y( 1 — x) 

9. dx/dt = y( 2 — x — y), dy/dt = — x — y — 2 xy 
(ffo 10- dx/dt = (2 + x)(y — x), dy/dt = y(2 + x — x 2 ) 
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11. dx/dt = —x + 2 xy. dy/dt = y — x 2 — y 2 

12. dx/dt = y, dy/dt = x — g x 3 — ly 

13. dx/dt = (2 + x)(y — x), dy/dt = (4 — x)(y + x) 

14. dx/dt = (2 — x){y — x), dy/dt = y(2 — x — x 2 ) 

15. dx/dt = x(2 — x — y), dy/dt = — x + 3_y — 2xy 

16. dx/df = x(2 — x — y), dy/dt = (1 — y) (2 + x) 


In each of Problems 17 through 24: 

(a) Find an equation of the form H(x,y) = c satisfied by the trajectories. 

(b) Plot several level curves of the function H. These are trajectories of the given system. 
Indicate the direction of motion on each trajectory. 

17. dx/dt = 2 y, dy/dt = 8x 18. dx/dt = 2 y, dy/dt = — 8x 

19. dx/dt = y, dy/dt = 2x + y 20. dx/dt = — x + y, dy/dt = — x — y 

21. dx/dt = — x + y + x 2 , dy/dt = y — 2xy 

22. dx/dt = 2x 2 y — 3x 2 — 4y, dy/dt = — 2xy 2 + 6xy 

23. Undamped pendulum: dx/dt = y, dy/dt = — sinx 

24. Duffing’s 2 equations: dx/dt = y, dy/dt = — x + (x 3 /6) 

25. Given thatx = c/>(t),y = i/r(r) is a solution of the autonomous system 


dx/dt = F(x,y ), dy/dt = G(x,y) 


for a < t < p, show that x = <t> ff) = 0(r — s), y = >P(r) = i/r(f — s) is a solution for 
<x + x<f< / 8+x for any real number 5. 

26. Prove that for the system 

dx/dt = F(x,y ), dy/dt = G(x,y) 


there is at most one trajectory passing through a given point (xo,yo). 

Hint: Let Co be the trajectory generated by the solution x = 0 O (?), y = ijr 0 (t), with 
(j> 0 (f 0 ) = xo, ^o(to) = yo, and let Cj be the trajectory generated by the solution x = 4>i(t), 
y = with <pi (q) = xo, Va (h) = yo- Use the fact that the system is autonomous, and 
also the existence and uniqueness theorem, to show that Co and Ci are the same. 

27. Prove that if a trajectory starts at a noncritical point of the system 

dx/dt = F(x,y), dy/dt — G(x,y), 


then it cannot reach a critical point (x 0 ,yo) in a finite length of time. 

Hint: Assume the contrary; that is, assume that the solution x = <p(t), y = i/r(f) satisfies 
<p(a) = xo, i //(a) = yo. Then use the fact thatx = xo,y = yo is a solution of the given system 
satisfying the initial condition x = x 0 ,y = y 0 at t = a. 

28. Assuming that the trajectory corresponding to a solution x = 4>(t).y = i !r{t),—oo < t < oo, 
of an autonomous system is closed, show that the solution is periodic. 

Hint: Since the trajectory is closed, there exists at least one point (x 0 ,yo) such that 
<p(t 0 ) =x 0 ,V f (to) = yo and a number T > 0 such that 4> (t 0 + T) =Xo,V r (fo + T) = yo- Show 
thatx = <J> (r) = <p{t + Dandy = >P(f) = xfr(t + T) is a solution, and then use the existence 
and uniqueness theorem to show that <t>(f) = cf>(t ) and T(f) = i jr(t) for all t. 


2 Georg Duffing (1861-1944), a German experimentalist, was a pioneer in the study of the oscillations of 

nonlinear mechanical systems. His most important work was an influential monograph published in 1918. 
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9.3 Locally Linear Systems 


In Section 9.1 we gave an informal description of the stability properties of the 
equilibrium solution x = 0 of the two-dimensional linear system 

x' = Ax. (1) 

The results are summarized in Table 9.1.1. Recall that we required that det A / 0, 
so x = 0 is the only critical point of the system (1). Now that we have defined the 
concepts of asymptotic stability, stability, and instability more precisely, we can restate 
these results in the following theorem. 

Theorem 9.3.1 Th e critical point x = 0 of the linear system (1) is asymptotically stable if the eigen- 
values r\, r 2 are real and negative or have negative real part; stable, but not asymp- 
totically stable, if r\ and r 2 are pure imaginary; unstable if r\ and r 2 are real and 
either is positive, or if they have positive real part. 


Effect of Small Perturbations. It is apparent from this theorem or from Table 9.1.1 that 
the eigenvalues r t , r 2 of the coefficient matrix A determine the type of critical point 
at x = 0 and its stability characteristics. In turn, the values of r\ and r 2 depend on the 
coefficients in the system (1). When such a system arises in some applied field, the 
coefficients usually result from the measurements of certain physical quantities. Such 
measurements are often subject to small uncertainties, so it is of interest to investigate 
whether small changes (perturbations) in the coefficients can affect the stability or 
instability of a critical point and/or significantly alter the pattern of trajectories. 

Recall that the eigenvalues r \ , r 2 are the roots of the polynomial equation 

det (A - rl) = 0. (2) 

It is possible to show that small perturbations in some or all the coefficients are 
reflected in small perturbations in the eigenvalues. The most sensitive situation 
occurs when r\ = ip and r 2 — —ip. that is, when the critical point is a center and the 
trajectories are closed curves (ellipses) surrounding it. If a slight change is made in 
the coefficients, then the eigenvalues r\ and r 2 will take on new values r\ = X' + ip' 
and r ' 2 = X' — ip', where X' is small in magnitude and p' = p (see Figure 9.3.1). If 
X' / 0, which almost always occurs, then the trajectories of the perturbed system are 
spirals rather than ellipses. The system is asymptotically stable if X' < 0 but unstable 
if X' > 0. Thus, in the case of a center, small perturbations in the coefficients may 
well change a stable system into an unstable one, and in any case may be expected 
to change the trajectories from ellipses to spirals (see Problem 27). 

Another slightly less sensitive case occurs if the eigenvalues r\ and r 2 are equal; 
in this case the critical point is a node. Small perturbations in the coefficients will 
normally cause the two equal roots to separate (bifurcate). If the separated roots 
are real, then the critical point of the perturbed system remains a node, but if the 
separated roots are complex conjugates, then the critical point becomes a spiral 
point. These two possibilities are shown schematically in Figure 9.3.2. In this case 
the stability or instability of the system is not affected by small perturbations in the 
coefficients, but the type of the critical point may be changed (see Problem 28). 
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r 2 = O,. 

0 a = -m 

FIGURE 9.3.1 Schematic perturbation of r\ = i/i, r 2 — —in- 
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FIGURE 9.3.2 
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0 
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X rf = A+i;i 
A 

O r 2 = X-in 


Schematic perturbation of ri = r 2 . 


In all other cases the stability or instability of the system is not changed, nor is the 
type of critical point altered, by sufficiently small perturbations in the coefficients of 
the system. For example, if ri and r 2 are real, negative, and unequal, then a small 
change in the coefficients will neither change the sign of r\ and r 2 nor allow them to 
coalesce. Thus the critical point remains an asymptotically stable node. 

Linear Approximations to Nonlinear Systems. Now let us consider a nonlinear two-dimen- 
sional autonomous system 

x' = f(x). (3) 

Our main object is to investigate the behavior of trajectories of the system (3) near 
a critical point x°. Recall that in Example 1 in Section 9.2 we noted that near each 
critical point of that nonlinear system the pattern of trajectories resembles the tra- 
jectories of a certain linear system. This suggests that near a critical point we may 
be able to approximate the nonlinear system (3) by an appropriate linear system, 
whose trajectories are easy to describe. The crucial question is whether and how we 
can find an approximating linear system whose trajectories closely match those of 
the nonlinear system near the critical point 

It is convenient to choose the critical point to be the origin. This involves no loss of 
generality, since if x° ^ 0, it is always possible to make the substitution u = x — x° in 
Eq. (3). Then u will satisfy an autonomous system with a critical point at the origin. 

First, let us consider what it means for a nonlinear system (3) to be “close” to a 
linear system (1). Accordingly, suppose that 

x' =Ax + g(x) (4) 

and that x = 0 is an isolated critical point of the system (4). This means that there 
is some circle about the origin within which there are no other critical points. In 
addition, we assume that det A / 0, so that x = 0 is also an isolated critical point 
of the linear system x' = Ax. For the nonlinear system (4) to be close to the linear 
system x' = Ax, we must assume that g(x) is small. More precisely, we assume that 
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the components of g have continuous first partial derivatives and satisfy the limit 
condition 

||g(x)||/||x|| -» 0 as x — 0; (5) 

that is, ||g|| is small in comparison to ||x|| itself near the origin. Such a system is called 
a locally linear system in the neighborhood of the critical point x = 0. 

It may be helpful to express the condition (5) in scalar form. If we let x r = (x,_y), 
then ||x|| = (x 2 + y 2 ) 1/2 = r. Similarly, if g r (x) = (gi(x,y), g 2 (x,y)), then 

|| g(x) || = [g 2 (x,y) + g\(x,y)] l/1 . Then it follows that condition (5) is satisfied if and 
only if 

8i(x,y)/r — »■ 0, g 2 (x,y)/r-y 0 as r -* 0. (6) 


Determine whether the system 





/ -x 2 -xy \ 

y— 0.75xy - 0.25 y 2 ) 


(V) 


is locally linear in the neighborhood of the origin. 

Observe that the system (7) is of the form (4), that (0, 0) is a critical point, and that det A^0. 
It is not hard to show that the other critical points of Eqs. (7) are 
(0, 2), (1, 0), and (0.5, 0.5); consequently, the origin is an isolated critical point. In check- 
ing the condition (6), it is convenient to introduce polar coordinates by letting x = rcos9, 
y — r sin 9 . Then 

gi(x,y) —x 2 —xy — r 2 cos 2 9 — r 2 sin 9 cos 9 
r r r 

= — r(cos 2 9 + sin 9 cos 9) — > 0 


as r — ¥■ 0. In a similar way you can show that g 2 (x,y)/r — >■ 0 as r — >• 0. Hence the system (7) is 
locally linear near the origin. 


The motion of a pendulum is described by the system [see Eq. (13) of Section 9.2] 

dx dy , . 

— =y, — = -a/sin x-yy. (8) 

The critical points are (0, 0), (±tt, 0), (±27r, 0), . . . , so the origin is an isolated critical point of 
this system. Show that the system is locally linear near the origin. 

To compare Eqs. (8) with Eq. (4), we must rewrite the former so that the linear and nonlinear 
terms are clearly identified. If we write sinx = x + (sinx — x) and substitute this expression 
in the second of Eqs. (8), we obtain the equivalent system 



On comparing Eq. (9) with Eq. (4), we see that gi(x, y) = 0 and gn{x,y) — — w 2 (sinx — x). 
From the Taylor series for sin x, we know that sinx — x behaves like — x 3 /3! = — (r 3 cos 3 9)/ 3! 
when x is small. Consequently, (sinx — x)/r — > 0 as r 0. Thus the conditions (6) are satisfied 
and the system (9) is locally linear near the origin. 
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Let us now return to the general nonlinear system (3), which we write in the scalar 
form 

x' = F{x,y), y' = G(x,y). (10) 

The system (10) is locally linear in the neighborhood of a critical point (xo,yo) when- 
ever the functions F and G have continuous partial derivatives up to order two. 
To show this, we use Taylor expansions about the point (xo,yo) to write F(x,y ) and 
G{x,y) in the form 


F(x,y) = F(x 0 .yo) + F x (x 0 ,y 0 )(x - x 0 ) + F y (x 0 ,y 0 )(y - y 0 ) + m(x,y), 

G(x,y ) = G(x 0 ,y 0 ) + G x (x 0 ,y 0 )(x - x 0 ) + G y (x 0 ,y 0 )(y - y 0 ) + r} 2 (x,y), 

where rn(x,y)/[(x - xq ) 2 + (y-yo) 2 ] 1/2 ^0 as (x,y) ->• (x 0 ,y 0 ), and similarly for 
i) 2 - Note that F(x 0 ,yo) — G(x 0 ,y 0 ) = 0, and that dx/dt = d(x — x 0 )/dt and 
dy/dt = d(y — yo)/dt. Then the system (10) reduces to 

d_/x-x 0 \ _ / F x {x 0 ,y 0 ) , 1;L , 

dt \y - yoj \ G A:(^o,yo) G y (x 0 ,y 0 )J \y - y 0 J \in(x,y)J ’ 

or, in vector notation, 

du d 1 .. 

— =—(x)u + I?(x), (12) 

dt dx 

where u = (x - x 0 ,y - yo) T and j] = (rn, ii 2 ) t ■ 

The significance of this result is twofold. First, if the functions F and G are twice 
differentiable, then the system (10) is locally linear, and it is unnecessary to resort 
to the limiting process used in Examples 1 and 2. Second, the linear system that 
approximates the nonlinear system (10) near (x 0 ,y 0 ) is given by the linear part of 
Eqs. (11) or (12): 


d l_ / U] \ _ / F x (x 0 ,yo ) E y (.v 0 ,yo)\ /wA 
dt \u 2 ) \G x (x 0 .y 0 ) G y (xo,yo)J \u 2 J 


(13) 


where u\ = x — xq and u 2 = y — yo. Equation (13) provides a simple and general 
method for finding the linear system corresponding to a locally linear system near a 
given critical point. 

The matrix 


J = 




(14) 


which appears as the coefficient matrix in Eq. (13), is called the Jacobian 3 matrix of 
the functions F and G with respect to x and y. We need to assume that det(J) is not 
zero at (xo,yo) so that this point is also an isolated critical point of the linear system 
(13). 


3 Carl Gustav Jacob Jacobi (1804-1851), a German analyst who was professor and lecturer at the Uni- 
versities of Konigsberg and Berlin, made important contributions to the theory of elliptic functions. The 
determinant of J and its extension to n functions of;; variables is called the Jacobian because of his notable 
paper in 1841 on the properties of this determinant. The corresponding matrix is also named for Jacobi 
although matrices were not developed until after his death. 
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Theorem 9.3.2 


Use Eq. (13) to find the linear system corresponding to the pendulum equations (8) near the 
origin; near the critical point (n, 0). 

In this case we have, from Eq. (8), 

F(x,y ) = y , G(x,y) = — of sinx — yy; (15) 


since these functions are differentiable as many times as necessary, the system (8) is locally 
linear near each critical point. The derivatives of F and G are 

F x = 0, F y = 1, G x — —co 2 cosx, G y = —y. (16) 


Thus, at the origin the corresponding linear system is 


d lx 
dt l v 




(17) 


which agrees with Eq. (9). 

Similarly, evaluating the partial derivatives in Eq. (16) at (n, 0), we obtain 


d 

dt 






(18) 


where u = x — tt, v = y. This is the linear system corresponding to Eqs. (8) near the point 
(7T,0). 


We now return to the locally linear system (4). Since the nonlinear term g(x) is 
small compared to the linear term Ax when x is small, it is reasonable to hope that 
the trajectories of the linear system (1) are good approximations to those of the 
nonlinear system (4), at least near the origin. This turns out to be true in many (but 
not all) cases, as the following theorem states. 


Let ri and r 2 be the eigenvalues of the linear system (1) corresponding to the locally 
linear system (4). Then the type and stability of the critical point (0, 0) of the linear 
system (1) and the locally linear system (4) are as shown in Table 9.3.1. 


At this stage, the proof of Theorem 9.3.2 is too difficult to give, so we will accept the 
results without proof. The statements for asymptotic stability and instability follow as 
a consequence of a result discussed in Section 9.6, and a proof is sketched in Problems 
10 to 12 of that section. Essentially, Theorem 9.3.2 says that for small x (or x — x°) 
the nonlinear terms are also small and do not affect the stability and type of critical 
point as determined by the linear terms except in two sensitive cases: r\ and r 2 pure 
imaginary, and and r 2 real and equal. Recall that earlier in this section we stated 
that small perturbations in the coefficients of the linear system (1), and hence in the 
eigenvalues ri and r 2 , can alter the type and stability of the critical point only in these 
two cases. It is reasonable to expect that the small nonlinear term in Eq. (4) might 
have a similar substantial effect, at least in these two cases. This is so, but the main 
significance of Theorem 9.3.2 is that in all other cases the small nonlinear term does 
not alter the type or stability of the critical point. Thus, except in the two sensitive 
cases, the type and stability of the critical point of the nonlinear system (4) can be 
determined from a study of the much simpler linear system (1). 
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TABLE 9.3.1 Stability and Instability Properties of Linear and Locally Linear Systems 


n ,r 2 


Linear System 

Locally Linear System 

Type 

Stability 

Type 

Stability 

ri > r2 > 0 

N 

Unstable 

N 

Unstable 

r x < r 2 < 0 

N 

Asymptotically 

N 

Asymptotically 



stable 


stable 

r 2 < 0 < r\ 

SP 

Unstable 

SP 

Unstable 

n =r 2 > 0 

PN or IN 

Unstable 

N or SpP 

Unstable 

n =r 2 < 0 

PN or IN 

Asymptotically 

N or SpP 

Asymptotically 



stable 


stable 

n,r 2 = k± ipt 





k > 0 

SpP 

Unstable 

SpP 

Unstable 

k < 0 

SpP 

Asymptotically 

SpP 

Asymptotically 



stable 


stable 

II 

II 

■£' 

C 

Stable 

C or SpP 

Indeterminate 


Note: N, node; IN, improper node; PN, proper node; SP, saddle point; SpP, spiral point; 
C, center. 


Even if the critical point is of the same type as that of the linear system, the 
trajectories of the locally linear system may be considerably different in appearance 
from those of the corresponding linear system, except very near the critical point. 
However, it can be shown that the slopes at which trajectories “enter” or “leave” the 
critical point are given correctly by the linear system. 


Damped Pendulum. We continue the discussion of the damped pendulum begun in Ex- 
amples 2 and 3. Near the origin the nonlinear equations (8) are approximated by the 
linear system (17), whose eigenvalues are 


n ,r 2 


— y ± \/y 2 — 4<w 2 
2 


(19) 


The nature of the solutions of Eqs. (8) and (17) depends on the sign of y 2 — 4a> 2 as 
follows: 


1. If y 2 — 4 or > 0, then the eigenvalues are real, unequal, and negative. The critical point 
(0, 0) is an asymptotically stable node of the linear system (17) and of the locally linear 
system (8). 

2. If y 2 — 4 a> 2 = 0, then the eigenvalues are real, equal, and negative. The critical point (0, 0) 
is an asymptotically stable (proper or improper) node of the linear system (17). It may be 
either an asymptotically stable node or spiral point of the locally linear system (8). 

3. If y 2 - 4 to 2 < 0, then the eigenvalues are complex with negative real part. The critical 
point (0,0) is an asymptotically stable spiral point of the linear system (17) and of the 
locally linear system (8). 

Thus the critical point (0, 0) is a spiral point of the system (8) if the damping is small 
and a node if the damping is large enough. In either case, the origin is asymptotically 
stable. 

Let us now consider the case y 2 — 4co 2 < 0, corresponding to small damping, in 
more detail. The direction of motion on the spirals near (0,0) can be obtained 
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directly from Eqs. (8). Consider the point at which a spiral intersects the positive 
y-axis (x = 0 and y > 0). At such a point it follows from Eqs. (8) that dx/dt > 0. Thus 
the point (x,y) on the trajectory is moving to the right, so the direction of motion on 
the spirals is clockwise. 

The behavior of the pendulum near the critical points (±«jr, 0), with n even, is the 
same as its behavior near the origin. We expect this on physical grounds since all these 
critical points correspond to the downward equilibrium position of the pendulum. 
The conclusion can be confirmed by repeating the analysis carried out above for the 
origin. Figure 9.3.3 shows the clockwise spirals at a few of these critical points. 


y 

m 1 , 


vs) 

-2k 

K ^ X 

2k 


FIGURE 9.3.3 Asymptotically stable spiral points for the damped pendulum. 


Now let us consider the critical point (7r,0). Flere the nonlinear equations (8) are 
approximated by the linear system (18), whose eigenvalues are 


n,r 2 


— y ± yV 2 + 4co 2 
2 


( 20 ) 


One eigenvalue (r\) is positive and the other (r 2 ) is negative. Therefore, regardless 
of the amount of damping, the critical point x = n, y = 0 is an unstable saddle point 
both of the linear system (18) and of the locally linear system (8). 

To examine the behavior of trajectories near the saddle point (7T, 0) in more detail, 
we write down the general solution of Eqs. (18), namely, 



= C i 



( 21 ) 


where Q and C 2 are arbitrary constants. Since r\ > 0 and r 2 < 0, it follows that the 
solution that approaches zero as t — »■ oo corresponds to C\ = 0. For this solution 
v/u — r 2 , so the slope of the entering trajectories is negative; one lies in the second 
quadrant (C 2 < 0), and the other lies in the fourth quadrant (C 2 > 0). For C 2 = 0 
we obtain the pair of trajectories “exiting” from the saddle point. These trajectories 
have slope r\ > 0; one lies in the first quadrant (Ci > 0), and the other lies in the 
third quadrant (Ci < 0). 

The situation is the same at other critical points (n:r,0) with n odd. These all 
correspond to the upward equilibrium position of the pendulum, so we expect them 
to be unstable. The analysis at (n,0) can be repeated to show that they are saddle 
points oriented in the same way as the one at (;r, 0). Diagrams of the trajectories in 
the neighborhood of two saddle points are shown in Figure 9.3.4. 



9.3 Locally Linear Systems 


515 



EXAMPLE 

4 


The equations of motion of a certain pendulum are 

dx/dt = y, dy/dt = — 9sinx — ^y, (22) 

where x = 9 and y = dB/dt. Draw a phase portrait for this system and explain how it shows 
the possible motions of the pendulum. 

By plotting the trajectories starting at various initial points in the phase plane, we obtain 
the phase portrait shown in Figure 9.3.5. As we have seen, the critical points (equilibrium 

solutions) are the points (m r,0), where n = 0, ±1,±2 Even values of n, including zero, 

correspond to the downward position of the pendulum, while odd values of n correspond to 
the upward position. Near each of the asymptotically stable critical points, the trajectories 
are clockwise spirals that represent a decaying oscillation about the downward equilibrium 
position. The wavy horizontal portions of the trajectories that occur for larger values of | y\ 
represent whirling motions of the pendulum. Note that a whirling motion cannot continue 
indefinitely, no matter how large |y| is; eventually the angular velocity is so much reduced by 
the damping term that the pendulum can no longer go over the top, and instead begins to 
oscillate about its downward position. 



FIGURE 9.3.5 Phase portrait for the damped pendulum of Example 4. 


The basin of attraction for the origin is the shaded region in Figure 9.3.5. It is bounded by 
the trajectories that enter the two adjacent saddle points at (jr,0) and ( — zr , 0). The bounding 
trajectories are separatrices. Each asymptotically stable critical point has its own basin of 
attraction, which is bounded by the separatrices entering the two neighboring saddle points. 
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All of the basins of attraction are congruent to the shaded one; the only difference is that they 
are translated horizontally by appropriate distances. Note that it is mathematically possible 
(but physically unrealizable) to choose initial conditions exactly on a separatrix so that the 
resulting motion leads to a balanced pendulum in a vertically upward position of unstable 
equilibrium. 


An important difference between nonlinear autonomous systems and the linear 
systems discussed in Section 9.1 is illustrated by the pendulum equations. Recall 
that the linear system (1) has only the single critical point x = 0 if det A ^ 0. Thus, 
if the origin is asymptotically stable, then not only do trajectories that start close to 
the origin approach it, but, in fact, every trajectory approaches the origin. In this 
case the critical point x = 0 is said to be globally asymptotically stable. This property 
of linear systems is not, in general, true for nonlinear systems, even if the nonlinear 
system has only one asymptotically stable critical point. Therefore, for nonlinear 
systems it is important to determine (or to estimate) the basin of attraction for each 
asymptotically stable critical point. 


PROBLEMS In each of Problems 1 through 4 verify that (0, 0) is a critical point, show that the system is 
locally linear, and discuss the type and stability of the critical point (0, 0) by examining the 
corresponding linear system. 

1. dx/dt = x — y 2 , dy/dt = x — 2y + x 2 

2. dx/dt = — x + y + 2 xy, dy/dt = — 4x — y + x 2 — y 2 

3. dx/dt = (1 + x) siny, dy/dt = 1 — x — cosy 

4. dx/dt = x + y 2 , dy/dt = x + y 

In each of Problems 5 through 18: 

(a) Determine all critical points of the given system of equations. 

(b) Find the corresponding linear system near each critical point. 

(c) Find the eigenvalues of each linear system. What conclusions can you then draw about 
the nonlinear system? 

(d) Draw a phase portrait of the nonlinear system to confirm your conclusions, or to extend 
them in those cases where the linear system does not provide definite information about the 
nonlinear system. 

5. dx/dt = (2 + x)(y — x), dy/dt = (4 — x)(y + x) 

6. dx/dt = x — x 2 — xy, dy/dt = 3 y — xy — 2y 2 

7. dx/dt = 1 — y, dy/dt = x 2 — y 2 

8. dx/dt = x — x 2 — xy, dy/dt = |y — ^y 2 — ^xy 

9. dx/dt = (2 + y)(y — 0.5x), dy/dt = (2 — x)(y + 0.5x) 

10. dx/dt = x + x 2 + y 2 , dy/dt = y — xy 

11. dx/dt = 2x + y + xy 3 , dy/dt = x — 2y — xy 

12. dx/dt = (1 + x) siny, dy/dt = 1 — x — cosy 

13. dx/dt = x — y 2 , dy/dt = y — x 2 

14. dx/dt = 1 — xy, dy/dt = x — y 3 

15. dx/dt = — 2x — y — x(x 2 + y 2 ), dy/dt = x — y + y(x 2 + y 2 ) 

16. dx/dt = y + x(l — x 2 — y 2 ), dy/dt = — x + y (1 — x 2 — y 2 ) 
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17. dx/dt = 4 — y 2 , dyldt — (1.5 + x){y — x) 

18. dx/dt = (1 — y)(2x — y), dy/dt = (2 + x)(x — 2 y) 

19. Consider the autonomous system 

dx/dt = y, dy/dt = x + 2x 3 . 

(a) Show that the critical point (0, 0) is a saddle point. 

(b) Sketch the trajectories for the corresponding linear system by integrating the equation 
for dy/dx. Show from the parametric form of the solution that the only trajectory on which 
x— >-0,y— >-0ast — >■ oo is y = — x. 

(c) Determine the trajectories for the nonlinear system by integrating the equation for 

dy/dx. Sketch the trajectories for the nonlinear system that correspond to y = — x and 

y = x for the linear system. 

20. Consider the autonomous system 

dx/dt = x, dy/dt = — 2y + x 3 . 

(a) Show that the critical point (0, 0) is a saddle point. 

(b) Sketch the trajectories for the corresponding linear system, and show that the trajec- 
tory for which x — > 0, y -> 0 as t — > oo is given by x = 0. 

(c) Determine the trajectories for the nonlinear system for x / 0 by integrating the equa- 
tion for dy/dx. Show that the trajectory corresponding to x = 0 for the linear system is 
unaltered, but that the one corresponding to y = 0 is y = x 3 /5. Sketch several of the 
trajectories for the nonlinear system. 

21. The equation of motion of an undamped pendulum is d 2 d/dt 2 + or sin# = 0, where 
co 2 = g/L. Letx = 6,y = dd/dt to obtain the system of equations 

dx/dt = y, dy/dt = —co 2 sinx. 

(a) Show that the critical points are (±nn , 0) , n = 0, 1 , 2, . . . , and that the system is locally 
linear in the neighborhood of each critical point. 

(b) Show that the critical point (0, 0) is a (stable) center of the corresponding linear system. 
Using Theorem 9.3.2, what can you say about the nonlinear system? The situation is similar 
at the critical points (±2rar, 0), n = 1, 2, 3, ... . What is the physical interpretation of these 
critical points? 

(c) Show that the critical point (jr,0) is an (unstable) saddle point of the corresponding 

linear system. What conclusion can you draw about the nonlinear system? The situa- 
tion is similar at the critical points [±(2 n — l)n ,Q],n — 1,2,3, What is the physical 

interpretation of these critical points? 

(d) Choose a value for co 2 and plot a few trajectories of the nonlinear system in the 
neighborhood of the origin. Can you now draw any further conclusion about the nature 
of the critical point at (0, 0) for the nonlinear system? 

(e) Using the value of co 2 from part (d), draw a phase portrait for the pendulum. Compare 
your plot with Figure 9.3.5 for the damped pendulum. 

22. (a) By solving the equation for dy/dx, show that the equation of the trajectories of the 
undamped pendulum of Problem 21 can be written as 

^y 2 + co 2 (l — cosx) = c, (i) 

where c is a constant of integration. 
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(b) Multiply Eq. (i) by mL 2 . Then express the result in terms of 6 to obtain 



2 


+ mgL(\ — cos 6) = E, 


(ii) 


where E = mL 2 c. 

(c) Show that the first term in Eq. (ii) is the kinetic energy of the pendulum and that the 
second term is the potential energy due to gravity. Thus the total energy E of the pendulum 
is constant along any trajectory; its value is determined by the initial conditions. 

23. The motion of a certain undamped pendulum is described by the equations 


dx/dt = y, dy/dt =—4smx. 


If the pendulum is set in motion with an angular displacement A and no initial velocity, 
then the initial conditions are x(0) = A,y(0) = 0. 

(a) Let A = 0.25 and plot x versus t. From the graph, estimate the amplitude R and 
period T of the resulting motion of the pendulum. 

(b) Repeat part (a) for A = 0.5, 1.0, 1.5, and 2.0. 

(c) How do the amplitude and period of the pendulum's motion depend on the initial 
position A1 Draw a graph to show each of these relationships. Can you say anything 
about the limiting value of the period as A — >■ 0? 

(d) Let A = 4 and plot x versus t. Explain why this graph differs from those in parts (a) 
and (b). For what value of A does the transition take place? 

24. Consider again the pendulum equations (see Problem 23) 


dx/dt = y, dy/dt =—4smx. 


If the pendulum is set in motion from its downward equilibrium position with angular 
velocity v, then the initial conditions are x(0) = 0,y(0) = v. 

(a) Plot x versus t for v = 2 and also for v = 5. Explain the differing motions of the 
pendulum that these two graphs represent. 

(b) There is a critical value of v, which we denote by v c , such that one type of motion 
occurs for v < v c and the other for v > v c . Estimate the value of v c . 

25. This problem extends Problem 24 to a damped pendulum. The equations of motion are 


dx/dt=y, dy/dt = — 4sinx — yy, 


where y is the damping coefficient, with the initial conditions x(0) = 0,y(0) = v. 

(a) For y = 1/4 plot x versus t for v = 2 and for v — 5. Explain these plots in terms of 
the motions of the pendulum that they represent. Also explain how they relate to the 
corresponding graphs in Problem 24(a). 

(b) Estimate the critical value v c of the initial velocity where the transition from one type 
of motion to the other occurs. 

(c) Repeat part (b) for other values of y and determine how v c depends on y. 

26. Theorem 9.3.2 provides no information about the stability of a critical point of a locally 
linear system if that point is a center of the corresponding linear system. That this must 
be the case is illustrated by the systems 


dx/dt = y + x(x 2 + y 2 ), 
dy/dt = — x + y(x 2 + y 2 ) 


and 
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dx/dt = y — x(x 2 + y 2 ), 
dy/dt = — x — y(x 2 + y 2 ). 

(a) Show that (0, 0) is a critical point of each system and, furthermore, is a center of the 
corresponding linear system. 

(b) Show that each system is locally linear. 

(c) Let r 2 = x 2 + y 2 , and note that x dx/dt + y dy/dt = r dr/dt. For system (ii) show that 
dr/dt < 0 and that r -> 0 as t — > oo; hence the critical point is asymptotically stable. For 
system (i) show that the solution of the initial value problem for r with r = r 0 at t = 0 
becomes unbounded as t — > 1/2 rg, and hence the critical point is unstable. 

27. In this problem we show how small changes in the coefficients of a system of linear equa- 
tions can affect a critical point that is a center. Consider the system 


x' = 



X. 


Show that the eigenvalues are ±i so that (0, 0) is a center. Now consider the system 



where |e| is arbitrarily small. Show that the eigenvalues are e ± i. Thus no matter how 
small |e | 0 is, the center becomes a spiral point. Ife < 0, the spiral point is asymptotically 

stable; if e > 0, the spiral point is unstable. 

28. In this problem we show how small changes in the coefficients of a system of linear equa- 
tions can affect the nature of a critical point when the eigenvalues are equal. Consider 
the system 



Show that the eigenvalues are r\ = — 1, ri = — 1 so that the critical point (0, 0) is an asymp- 
totically stable node. Now consider the system 



where |e| is arbitrarily small. Show that if e > 0, then the eigenvalues are —1 ± iyfe, so 
that the asymptotically stable node becomes an asymptotically stable spiral point. Ife < 0, 
then the roots are —1 ± and the critical point remains an asymptotically stable node. 

29. In this problem we derive a formula for the natural period of an undamped nonlinear 
pendulum [c = 0 in Eq. (10) of Section 9.2], Suppose that the bob is pulled through a 
positive angle a and then released with zero velocity. 

(a) We usually think of 9 and d9/dt as functions of t. However, if we reverse the roles of 
t and 9, we can regard t as a function of 9 and, consequently, can also think of d9/dt as a 
function of 9. Then derive the following sequence of equations: 


1 


d 


-mL — 
2 d9 



—mgL sin 9, 


1 

—m 

2 



— mgL(cos9 — cos a), 
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2 g *Jcos9 — cos a 


L d.9 


Why was the negative square root chosen in the last equation? 

(b) If T is the natural period of oscillation, derive the formula 



(c) By using the identities cos 9 = 1 — 2sin 2 (0/2) and cosa = 1 — 2sin 2 (a/2), followed 
by the change of variable sin(0/2) = /c sin 0 with k = sin(o!/2), show that 



The integral is called the elliptic integral of the first kind. Note that the period depends 
on the ratio L/g and also on the initial displacement a through k = sin(a!/2). 

(d) By evaluating the integral in the expression for T, obtain values for T that you can 
compare with the graphical estimates you obtained in Problem 23. 

30. A generalization of the damped pendulum equation discussed in the text, or a damped 
spring-mass system, is the Lienard 4 equation 


— +c(x)-+g(x) = 0. 


If c(x) is a constant and g(x) = kx, then this equation has the form of the linear pendu- 
lum equation [replace sin 9 with 9 in Eq. (12) of Section 9.2]; otherwise, the damping 
force c(x) dx/dt and the restoring force g(x) are nonlinear. Assume that c is continuously 
differentiable, g is twice continuously differentiable, and g(0) = 0. 

(a) Write the Lienard equation as a system of two first order equations by introducing 
the variable y = dx/dt. 

(b) Show that (0, 0) is a critical point and that the system is locally linear in the neighbor- 
hood of (0,0). 

(c) Show that if c(0) > 0 and g'(0) > 0, then the critical point is asymptotically stable, and 
that if c(0) < 0 or g'(0) < 0, then the critical point is unstable. 

Hint: Use Taylor series to approximate c and g in the neighborhood of x = 0. 


9.4 Competing Species 


In this section and the next, we explore the application of phase plane analysis to 
some problems in population dynamics. These problems involve two interacting 
populations and are extensions of those discussed in Section 2.5, which dealt with 
a single population. Although the equations discussed here are extremely simple 


4 Alfred-Marie Lienard (1869-1958), professor at l’Ecole des Mines in Paris, worked in electricity, mechan- 
ics, and applied mathematics. The results of his investigation of this differential equation were published 
in 1928. 
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compared to the very complex relationships that exist in nature, it is still possible to 
acquire some insight into ecological principles from a study of these model problems. 
The same, or similar, models have also been used to study other types of competitive 
situations, for instance, businesses competing in the same market. 

Suppose that in some closed environment there are two similar species competing 
for a limited food supply — for example, two species of fish in a pond that do not prey 
on each other but do compete for the available food. Letx andy be the populations of 
the two species at time t. As discussed in Section 2.5, we assume that the population 
of each of the species, in the absence of the other, is governed by a logistic equation. 
Thus 

dx / dt — x(e\ — o\x) , (la) 

dy/dt = y(e 2 - cr 2 y), (lb) 

respectively, where e\ and e 2 are the growth rates of the two populations, and e j /a\ 
and e 2 /a 2 are their saturation levels. However, when both species are present, each 
will tend to diminish the available food supply for the other. In effect, they reduce 
each other’s growth rates and saturation populations. The simplest expression for 
reducing the growth rate of species x due to the presence of species y is to replace 

the growth rate factor ei — o\x in Eq. (la) by ei — o\x — ayy, where a\ is a measure 

of the degree to which species y interferes with species x. Similarly, in Eq. (lb) we 
replace e 2 — a 2 y by e 2 — o 2 y — a 2 x. Thus we have the system of equations 

dx/dt — x(ei — <7 \X — a\ y), 

( 2 ) 

dy/dt = y(e 2 - a 2 y - a 2 x). 

The values of the positive constants €\, a\, ai, e 2 , o 2 , and <*2 depend on the particular 
species under consideration and in general must be determined from observations. 
We are interested in solutions of Eqs. (2) for which x and y are nonnegative. In the 
following two examples we discuss two typical problems in some detail. At the end 
of the section we return to the general equations (2). 


EXAMPLE 

1 


Discuss the qualitative behavior of solutions of the system 

dx/dt = x(l — x — y), 

(3) 

dy/dt = y(0.75 — y — 0.5x). 

We find the critical points by solving the system of algebraic equations 

x(l -x-y) = 0, y(0.75 -y- 0.5x) = 0. (4) 


The first equation can be satisfied by choosing x = 0; then the second equation requires that 
y = 0 or y = 0.75. Similarly, the second equation can be satisfied by choosing y = 0, and then 
the first equation requires that x = 0 or x = 1 . Thus we have found three critical points, namely, 
(0, 0), (0, 0.75), and (1,0). If neither x nor y is zero, then Eqs. (4) are also satisfied by solutions 
of the system 

1 — x — y = 0, 0.75 - y - 0.5* = 0, (5) 


which leads to a fourth critical point (0.5, 0.5). These four critical points correspond to equi- 
librium solutions of the system (3). The first three of these points involve the extinction of 
one or both species; only the last corresponds to the long-term survival of both species. Other 
solutions are represented as curves or trajectories in the xy-plane that describe the evolution 
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of the populations in time. To begin to discover their qualitative behavior, we can proceed in 
the following way. 

First observe that the coordinate axes are themselves trajectories. This follows directly 
from Eqs. (3) since dx/dt = 0 on the y-axis (where x = 0) and, similarly, dy/dt = 0 on the 
x-axis (where y — 0). Thus no other trajectories can cross the coordinate axes. For a population 
problem only nonnegative values of x and y are significant, and we conclude that any trajectory 
that starts in the first quadrant remains there for all time. 

A direction field for the system (3) in the positive quadrant is shown in Figure 9.4.1; the 
black dots in this figure are the critical points or equilibrium solutions. Based on the direction 
field, it appears that the point (0.5, 0.5) attracts other solutions and is therefore asymptotically 
stable, while the other three critical points are unstable. To confirm these conclusions, we can 
look at the linear approximations near each critical point. 
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FIGURE 9.4.1 Critical points and direction field for the system (3). 


The system (3) is locally linear in the neighborhood of each critical point. There are two 
ways to obtain the linear system near a critical point (X, Y). First, we can use the substitution 
x = X + u, y = Y + v in Eqs. (3), retaining only the terms that are linear in u and v. Alter- 
natively, we can evaluate the Jacobian matrix J at each critical point to obtain the coefficient 
matrix in the approximating linear system; see Eq. (13) in Section 9.3. When several critical 
points are to be investigated, it is usually better to use the Jacobian matrix. For the system (3) 
we have 

F(x,y) = x(l — x — y), G(x,y) = y(0.75 — y — 0.5x), (6) 

so 

1 — 2 x — y —x 

— 0.5y 0.75 -2 y- 0.5x 




We will now examine each critical point in turn. 
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x = 0, y = 0. This critical point corresponds to a state in which both species die as a result of 
their competition. Bysettingx = y = OinEq. (7), we see that near the origin the corresponding 
linear system is 

d (x\ _ (l 0 \ fx\ 

Jt \yj ~ VO 0.75 ; 


( 8 ) 


The eigenvalues and eigenvectors of the system (8) are 

(V) 


n = T 


? (1) = 


so the general solution of the system is 


= ci 


r 2 = 0.75, 


e' + c 2 | I e" 


S (2) = 


( 9 ) 


(10) 


Thus the origin is an unstable node of both the linear system (8) and the nonlinear system 
(3). In the neighborhood of the origin, all trajectories are tangent to the y-axis except for one 
trajectory that lies along the x-axis. 

x = 1, y = 0. This corresponds to a state in which species x survives the competition but 
species y does not. By evaluating J from Eq. (7) at (1, 0), we find that the corresponding linear 
system is 

d ( u \ _ [~ X -1 

Jt = I 0 0.25 J 


( 11 ) 


Its eigenvalues and eigenvectors are 

M = -1, ^ = 

and its general solution is 


r 2 = 0.25, 


$< 2 > = 


4 


= Cl 


e ' + c 2 


4 


( 12 ) 


(13) 


Since the eigenvalues have opposite signs, the point (1, 0) is a saddle point, and hence is an 
unstable equilibrium point of the linear system (11) and of the nonlinear system (3). The 
behavior of the trajectories near (1,0) can be seen from Eq. (13). If c 2 = 0, then there is one 
pair of trajectories that approaches the critical point along the x-axis. All other trajectories 
depart from the neighborhood of (1,0). As t — >■ — oo, one trajectory approaches the saddle 
point tangent to the eigenvector £ <2) whose slope is —1.25. 

x = 0,y = 0.75. In this case species y survives but x does not. The analysis is similar to that 
for the point (1, 0). The corresponding linear system is 


d lu 
dt In 

The eigenvalues and eigenvectors are 
n = 0.25, $ (1) = 


0.25 0 

-0.375 -0.75 


8 


r 2 = -0.75, 


£(2) _ 


(14) 


(15) 
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so the general solution of Eq. (14) is 



(16) 


Thus the point (0,0.75) is also a saddle point. All trajectories leave the neighborhood of this 
point except one pair that approaches along the y-axis. The trajectory that approaches the 
saddle point as t — >■ — oo is tangent to the line with slope —0.375 determined by the eigenvector 

* (1) . 


x = 0.5, y = 0.5. This critical point corresponds to a mixed equilibrium state, or coexis- 
tence, in the competition between the two species. The eigenvalues and eigenvectors of the 
corresponding linear system 


d (u\ _ /— 0.5 -0.5\ / u\ 

0.25 —0.5 / 


are 


n = (-2 + 721/4 = -0.146, 



r 2 = (-2 - V 2)/4 = - 0 . 854 , 



Therefore the general solution of Eq. (17) is 



(17) 


(18) 


(19) 


Since both eigenvalues are negative, the critical point (0.5, 0.5) is an asymptotically stable node 
of the linear system (17) and of the nonlinear system (3). All trajectories approach the critical 
point as t — >■ oo. One pair of trajectories approaches the critical point along the line with slope 



9.4 Competing Species 


525 


EXAMPLE 

2 


V2/2 determined from the eigenvector £ (2, . All other trajectories approach the critical point 
tangent to the line with slope — V2/2 determined from the eigenvector £ (1) . 

A phase portrait for the system (3) is shown in Figure 9.4.2. By looking closely at the 
trajectories near each critical point, you can see that they behave in the manner predicted by 
the linear system near that point. In addition, note that the quadratic terms on the right side 
of Eqs. (3) are all negative. Since for x and y large and positive these terms are the dominant 
ones, it follows that far from the origin in the first quadrant both x' and y' are negative; that 
is, the trajectories are directed inward. Thus all trajectories that start at a point (xo,yo) with 
x 0 > 0 and_y 0 > 0 eventually approach the point (0.5, 0.5). 


Discuss the qualitative behavior of the solutions of the system 

dx/dt = x(l — x — y), 

(20) 

dy/dt = y(0.5 - 0.25y - 0.75x), 

when x and y are nonnegative. Observe that this system is also a special case of the system (2) 
for two competing species. 

Once again, there are four critical points, namely, (0,0), (1,0), (0,2), and (0.5, 0.5), corre- 
sponding to equilibrium solutions of the system (20). Figure 9.4.3 shows a direction field for 
the system (20), together with the four critical points. From the direction field it appears that 
the mixed equilibrium solution (0.5, 0.5) is a saddle point, and therefore unstable, while the 
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FIGURE 9.4.3 Critical points and direction field for the system (20). 
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points (1, 0) and (0, 2) are asymptotically stable. Thus, for competition described by Eqs. (20), 
one species will eventually overwhelm the other and drive it to extinction. The surviving 
species is determined by the initial state of the system. To confirm these conclusions, we can 
look at the linear approximations near each critical point. For later use we record the Jacobian 
matrix J for the system (20): 


/ F x (x,y) F y (x,y ) 
\ G x (x,y) G y (x,y ) 


1 — 2 x — y — x 

— 0.75y 0.5 - 0.5y - 0.75x 


(21) 


x = 0, y = 0. Using the Jacobian matrix J from Eq. (21) evaluated at (0,0), we obtain the 
linear system 


d 

dt 




( 22 ) 


which is valid near the origin. The eigenvalues and eigenvectors of the system (22) are 


n = 1 , 




(23) 


so the general solution is 



(24) 


Therefore the origin is an unstable node of the linear system (22) and also of the nonlinear 
system (20). All trajectories leave the origin tangent to the y-axis except for one trajectory 
that lies along the x-axis. 

x = 1, y = 0. The corresponding linear system is 


d 


dt 



(25) 


Its eigenvalues and eigenvectors are 


n = - 1 , 



r 2 = -0.25, 



(26) 


and its general solution is 



(27) 


The point (1, 0) is an asymptotically stable node of the linear system (25) and of the nonlinear 
system (20). If the initial values of x andy are sufficiently close to (1,0), then the interaction 
process will lead ultimately to that state; that is, to the survival of species x and the extinction 
of species y. There is one pair of trajectories that approaches the critical point along the x-axis. 
All other trajectories approach (1,0) tangent to the line with slope —3/4 that is determined 
by the eigenvector £ <2) . 
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x = 0, y = 2. The analysis in this case is similar to that for the point (1, 0). The appropriate 
linear system is 


d 


dt 



(28) 


The eigenvalues and eigenvectors of this system are 


n = - 1 , 



r 2 = -0.5, 



(29) 


and its general solution is 



(30) 


Thus the critical point (0, 2) is an asymptotically stable node of both the linear system (28) and 
the nonlinear system (20). All trajectories approach the critical point tangent to the y-axis 
except for one trajectory that approaches along the line with slope 3. 


x = 0.5, y = 0.5. The corresponding linear system is 


d in 
dt Id 

The eigenvalues and eigenvectors are 
-5 + V57 


-0.5 -0.5 \ / u 

— 0.375 -0.125 Hd 


ri = 


ri = 


16 

—5 — V57 
16 


= 0.1594, 


= -0.7844, 


so the general solution is 
' u 


i (1) = 


(—3 — \/57)/8/ V— 1.3187 / ’ 


£ (2) = 


(—3 + \/57)/8 ) (0.5687/ ’ 


vl C ‘\ -1.3187 


0.1594r 

1 0.5687/ 


(31) 


(32) 


(33) 


Since the eigenvalues are of opposite sign, the critical point (0.5, 0.5) is a saddle point and 
therefore is unstable, as we had surmised earlier. All trajectories depart from the neighbor- 
hood of the critical point except for one pair that approaches the saddle point as t -> 00 . As 
they approach the critical point, the entering trajectories are tangent to the line with slope 
(-v/57 — 3)/8 = 0.5687 determined from the eigenvector £ (2) . There is also a pair of trajectories 
that approach the saddle point as t — ► — 00 . These trajectories are tangent to the line with 
slope —1.3187 corresponding to £ (1) 

A phase portrait for the system (20) is shown in Figure 9.4.4. Near each of the critical 
points the trajectories of the nonlinear system behave as predicted by the corresponding 
linear approximation. Of particular interest is the pair of trajectories that enter the saddle 
point. These trajectories form a separatrix that divides the first quadrant into two basins of 
attraction. Trajectories starting above the separatrix ultimately approach the node at (0,2), 
while trajectories starting below the separatrix approach the node at (1,0). If the initial state 
lies precisely on the separatrix, then the solution (x, y) will approach the saddle point as t — > 00 . 
Flowever, the slightest perturbation of the point (x, y) as it follows this trajectory will dislodge 
the point from the separatrix and cause it to approach one of the nodes instead. Thus, in 
practice, one species will survive the competition and the other will not. 
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Examples 1 and 2 show that in some cases the competition between two species 
leads to an equilibrium state of coexistence, while in other cases the competition 
results in the eventual extinction of one of the species. To understand more clearly 
how and why this happens, and to learn how to predict which situation will occur, it is 
useful to look again at the general system (2). There are four cases to be considered, 
depending on the relative orientation of the lines 

ei — o\x — ai y = 0 and ej — oiy — 012 X = 0, (34) 


as shown in Figure 9.4.5. These lines are called the x- and y-nullclines. respectively, 
because x! is zero on the first and y' is zero on the second. In each part of Figure 9.4.5 
the .r-nullclinc is the solid line and the y-nullcline is the dashed line. 

Let (X , Y) denote any critical point in any one of the four cases. As in Examples 
1 and 2, the system (2) is locally linear in the neighborhood of this point because the 
right side of each differential equation is a quadratic polynomial. To study the system 
(2) in the neighborhood of this critical point, we can look at the corresponding linear 
system obtained from Eq. (13) of Section 9.3 


dt \v 



€1-2(7! A - 011 Y 
-a 2 Y 


- ai X 

€2 — 2(72 y — 012X 


( 35 ) 
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FIGURE 9.4.5 The various cases for the competing-species system (2). The x-nullcline is 
the solid line and the y-nullcline is the dashed line. 


We now use Eq. (35) to determine the conditions under which the model described 
by Eqs. (2) permits the coexistence of the two species x and y. Of the four possible 
cases shown in Figure 9.4.5, coexistence is possible only in cases (c) and ( d ). In these 
cases the nonzero values of X and Y are obtained by solving the algebraic equations 
(34); the result is 


^ = 


ei<7 2 ~ pat 
<7102 - Ol\U2 


6 2 CTl ~ 6l«2 
<7102 - aia 2 


(36) 


Further, since e\ — o\X — a\Y = 0 and e 2 — g 2 Y — a 2 X — 0, Eq. (35) immediately 
reduces to 


d / u\ /—o\X — / u\ 

It W = \-<*2Y -<j 2 y) VuJ ' 

The eigenvalues of the system (37) are found from the equation 
r 2 + (g\X + a 2 Y)r + (gig 2 — a\ a 2 )XY = 0. 


(37) 


(38) 


Thus 


-{GiX + g 2 Y) ± y (oiX + g 2 Y) 2 - 4(cti g 2 - on a 2 )XY 


(39) 


If cti( 72 — aio !2 < 0, then the radicand of Eq. (39) is positive and greater than 
(rr i X + g 2 Y) 2 . Thus the eigenvalues are real and of opposite sign. Consequently, 
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the critical point (X, Y) is an (unstable) saddle point, and coexistence is not possi- 
ble. This is the case in Example 2, where oq = 1, oq = 1, oi = 0.25, «2 = 0.75, and 
<71(72 — oqoq = —0.5. 

On the other hand, if cti<t 2 — oqaq > 0, then the radicand of Eq. (39) is less than 
(ctiX + <72 T) 2 . Thus the eigenvalues are real, negative, and unequal, or complex with 
negative real part. A straightforward analysis of the radicand of Eq. (39) shows that 
the eigenvalues cannot be complex (see Problem 7). Thus the critical point is an 
asymptotically stable node, and sustained coexistence is possible. This is illustrated 
by Example 1, where <7! = 1, oq = 1, er 2 = 1, « 2 = 0.5, and eqe^ — oqaq = 0.5. 

Let us relate this result to Figures 9.4.5c and 9.4. 5d. In Figure 9.4.5c we have 

e l O A O 61 tA 

— > — or eioq > 62 < 7 i and — > — or 62011 > 6 i< 72 - (40) 

<7 1 Ot-2 <72 O' 1 

These inequalities, coupled with the condition that X and Y given by Eqs. (36) be 
positive, yield the inequality ay <72 < aqoq- Hence in this case the critical point is a 
saddle point. On the other hand, in Figure 9.4.5d we have 

H O , <?2 ei 

— < — or eio !2 < 62 < 7 i and — < — or 620*1 < ei<72. (41) 

<7 1 (X2 <72 0*1 

Now the condition that X and Y be positive yields < 7 i< 7 2 > aqaq- Hence the critical 
point is asymptotically stable. For this case we can also show that the other critical 
points (0, 0), (ei/cri, 0), and (0, 62 / 0 - 2 ) are unstable. Thus for any positive initial values 
of x and y, the two populations approach the equilibrium state of coexistence given 
by Eqs. (36). 

Equations (2) provide the biological interpretation of the result that whether co- 
existence occurs depends on whether < 7 i< 7 2 — oqaq is positive or negative. The cr’s are 
a measure of the inhibitory effect that the growth of each population has on itself, 
while the a’s are a measure of the inhibiting effect that the growth of each popula- 
tion has on the other species. Thus, when aqoq > oqoq, interaction (competition) is 
“weak" and the species can coexist; when < 7 i <72 < oqoq, interaction (competition) is 
“strong” and the species cannot coexist — one must die out. 


PROBLEMS Each of Problems 1 through 6 can be interpreted as describing the interaction of two species 
with populations x and y. In each of these problems carry out the following steps. 

(a) Draw a direction field and describe how solutions seem to behave. 

(b) Find the critical points. 

(c) For each critical point find the corresponding linear system. Find the eigenvalues and 
eigenvectors of the linear system; classify each critical point as to type, and determine whether 
it is asymptotically stable, stable, or unstable. 

(d) Sketch the trajectories in the neighborhood of each critical point. 

(e) Compute and plot enough trajectories of the given system to show clearly the behavior of 
the solutions. 

(f) Determine the limiting behavior of x and y as t -> 00 , and interpret the results in terms 
of the populations of the two species. 

1. dx/dt = x(1.5 — x — 0.5y) 
dy/dt = y(2 — y — 0.75x) 


4fl 2. dx/dt = x(1.5 — x — 0.5y) 
dy/dt = y(2 — 0.5y — 1.5x) 
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3. 

dx/dt 

= x(1.5 — 0.5x — y) 

41 

4. dx/dt 

= x(1.5 — 

0.5a - y) 



dy/dt 

— y( 2 — y — 1.125T) 


dy/dt 

— y(0.75 - 

- y — 0.125a) 

41 

5 . 

dx/dt 

= x(l — A — y) 

41 

6. dx/dt 

= X(1 — A 

+ 0.5)0 



dy/dt 

= y(l.S-y-x) 


dy/dt 

= y(. 2-5- 

1.5y + 0.25a) 


7 . 

Consider the eigenvalues given 

by Eq. (39) in the text. Show that 


0 o\X + ff2 Y) 2 — 4(iti<T 2 — ci 1 oi2)XY = ( a\X — (T 2 Y ) 2 + 4aia2XY. 


Hence conclude that the eigenvalues can never be complex. 

8. Two species of fish that compete with each other for food, but do not prey on each other, 
are bluegill and redear. Suppose that a pond is stocked with bluegill and redear, and let 
x and y be the populations of bluegill and redear, respectively, at time t. Suppose further 
that the competition is modeled by the equations 

dx/dt = x(ei — o\X — ot\y), 
dy/dt = y(e2 - ayy - oi 2 x). 


(a) If e 2/012 > ei/01 and 62/^2 > ei/ai, show that the only equilibrium populations in the 
pond are no fish, no redear, or no bluegill. What will happen for large f? 

(b) If ei/cri > 62/^2 and ei/oq > 62/^2. show that the only equilibrium populations in the 
pond are no fish, no redear, or no bluegill. What will happen for large f? 

9. Consider the competition between bluegill and redear mentioned in Problem 8. Suppose 
that €2/^2 > ei/oi and ei/<*i > 62/02, so, as shown in the text, there is a stable equilibrium 
point at which both species can coexist. It is convenient to rewrite the equations of 
Problem 8 in terms of the carrying capacities of the pond for bluegill ( B = ei/01) in the 
absence of redear and for redear ( R = 62/02) in the absence of bluegill. 

(a) Show that the equations of Problem 8 take the form 


dx 

dt 


e\x 





dy ( I y 2 

-7- = e 2y [4 - —y - —x 


dt 


R 


R 


where y\ = ai/01 and yi = 02/02. Determine the coexistence equilibrium point ( X , Y) in 
terms of B, R , and y 2 . 

(b) Now suppose that a fisherman fishes only for bluegill with the effect that B is reduced. 
What effect does this have on the equilibrium populations? Is it possible, by fishing, to 
reduce the population of bluegill to such a level that they will die out? 

10. Consider the system (2) in the text, and assume that 0^02 — a\ct2 = 0. 

(a) Find all the critical points of the system. Observe that the result depends on whether 
0162 — <*261 is zero. 

(b) If o - ! 62 — a 2 6i > 0, classify each critical point and determine whether it is asymptoti- 
cally stable, stable, or unstable. Note that Problem 5 is of this type. Then do the same if 

ffl62 - <*261 < 0 . 

(c) Analyze the nature of the trajectories when 0162 — <*261 = 0. 

11. Consider the system (3) in Example 1 of the text. Recall that this system has an asymptot- 
ically stable critical point at (0.5, 0.5), corresponding to the stable coexistence of the two 
population species. Now suppose that immigration or emigration occurs at the constant 
rates of 8 a and 8 b for the species x and y, respectively. In this case Eqs. (3) are replaced by 


dx/dt = a'(1 — x — y) + 8a. 
dy/dt = y(0.75 — y — 0.5x) + 8b. 


The question is what effect this has on the location of the stable equilibrium point. 
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(a) To find the new critical point, we must solve the equations 

x(l — x — y) + 8a = 0 , 

y(0.75 -y- 0.5x) + 8b = 0. ^ 

One way to proceed is to assume that x and y are given by power series in the parameter 
8; thus 

X = X 0 + Xi8 , y = y 0 -\-y l 8-\ . (iii) 

Substitute Eqs. (iii) into Eqs. (ii) and collect terms according to powers of 8. 

(b) From the constant terms (the terms not involving 5), show that x 0 = 0.5 and y 0 = 0.5, 
thus confirming that, in the absence of immigration or emigration, the critical point is 
(0.5, 0.5). 

(c) From the terms that are linear in 8, show that 


x\ = 4a — 4b, yi = —2a + 4b. (iv) 

(d) Suppose that a > 0 and b > 0 so that immigration occurs for both species. Show that 
the resulting equilibrium solution may represent an increase in both populations, or an in- 
crease in one but a decrease in the other. Explain intuitively why this is a reasonable result. 

ryl, 12. The system 

x! = —y, y = —yy — x(x — 0.15)(x — 2) 

results from an approximation to the Hodgkin-Huxley 5 equations, which model the trans- 
mission of neural impulses along an axon. 

(a) Find the critical points and classify them by investigating the approximate linear sys- 
tem near each one. 

(b) Draw phase portraits for y = 0.8 and for y = 1.5. 

(c) Consider the trajectory that leaves the critical point (2,0). Find the value of y for 
which this trajectory ultimately approaches the origin as t — > oo. Draw a phase portrait 
for this value of y . 

Bifurcation Points. Consider the system 

x ' = F(x,y,a), / = G{x,y,a), (i) 

where a is a parameter. The equations 

F(x,y,a) = 0, G(x,y,a) = 0 (ii) 

determine the x- and y-nullclines, respectively; any point where an jr-nullcline and a y-nullcline 
intersect is a critical point. As a varies and the configuration of the nullclines changes, it may 
well happen that, at a certain value of a, two critical points coalesce into one. For further 
variation in a, the critical point may once again separate into two critical points, or it may 
disappear altogether. Or the process may occur in reverse: For a certain value of a, two 
formerly nonintersecting nullclines may come together, creating a critical point, which, for 
further changes in a, may split into two. A value of a at which such phenomena occur is a 
bifurcation point. It is also common for a critical point to experience a change in its type and 
stability properties at a bifurcation point. Thus both the number and kind of critical points 


s Alan L. Hodgkin (1914-1998) and Andrew F. Huxley (1917- ) were awarded the Nobel Prize in phys- 
iology and medicine in 1963 for their work on the excitation and transmission of neural impulses. This 
work was done at Cambridge University; its results were first published in 1952. 
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may change abruptly as a passes through a bifurcation point. Since a phase portrait of a 
system is very dependent on the location and nature of the critical points, an understanding of 
bifurcations is essential to an understanding of the global behavior of the system’s solutions. 

In each of Problems 13 through 16: 

(a) Sketch the nullclines and describe how the critical points move as a increases. 

(b) Find the critical points. 

(c) Let a = 2. Classify each critical point by investigating the corresponding approximate 
linear system. Draw a phase portrait in a rectangle containing the critical points. 

(d) Find the bifurcation point a 0 at which the critical points coincide. Locate this critical point 
and find the eigenvalues of the approximate linear system. Draw a phase portrait. 

(e) For a > a 0 there are no critical points. Choose such a value of a and draw a phase portrait. 

13. x' = —4x + y + x 2 , y' =\a-y 

14. x' = \a - y, y' = -4x + y + x 2 

15. x' = —4x + y + x 2 , y' = -a - x + y 
(fl, 16. x! = -a - x + y, y' = -4x +y + x 2 

Problems 17 through 19 deal with competitive systems much like those in Examples 1 and 2, 
except that some coefficients depend on a parameter a. In each of these problems assume 
that x,y, and a are always nonnegative. In each of Problems 17 through 19: 

(a) Sketch the nullclines in the first quadrant, as in Figure 9.4.5. For different ranges of a your 
sketch may resemble different parts of Figure 9.4.5. 

(b) Find the critical points. 

(c) Determine the bifurcation points. 

(d) Find the Jacobian matrix J and evaluate it for each of the critical points. 

(e) Determine the type and stability property of each critical point. Pay particular attention 
to what happens as a passes through a bifurcation point 

(f) Draw phase portraits for the system for selected values of a to confirm your conclusions. 

17. dx/dt = x(l — x — y), dy/dt = y{a — y — 0.5x) 

18. dx/dt = x(l — x — y), dy/dt = y(0.75 — ay — 0.5x) 

19. dx/dt = x(l — x — y), dy/dt = y[a — y — (2 a — l)x] 


9.5 Predator-Prey Equations 


In the preceding section we discussed a model of two species that interact by 
competing for a common food supply or other natural resource. In this section 
we investigate the situation in which one species (the predator) preys on the other 
species (the prey), while the prey lives on a different source of food. For example, 
consider foxes and rabbits in a closed forest: The foxes prey on the rabbits, the rabbits 
live on the vegetation in the forest. Other examples are bass in a lake as predators 
and redear as prey, or ladybugs as predators and aphids as prey. We emphasize 
again that a model involving only two species cannot fully describe the complex re- 
lationships among species that actually occur in nature. Nevertheless, the study of 
simple models is the first step toward an understanding of more complicated phe- 
nomena. 
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We will denote by x and y the populations of the prey and predator, respectively, 
at time t. In constructing a model of the interaction of the two species, we make the 
following assumptions: 

1. In the absence of the predator, the prey grows at a rate proportional to the current popu- 
lation; thus dx/dt = ax, a > 0, when y = 0. 

2. In the absence of the prey, the predator dies out; thus dy/dt = —cy, c > 0, when x = 0. 

3. The number of encounters between predator and prey is proportional to the product of 
their populations. Each such encounter tends to promote the growth of the predator and 
to inhibit the growth of the prey. Thus the growth rate of the predator is increased by 
a term of the form yxy, while the growth rate of the prey is decreased by a term —ctxy, 
where y and a are positive constants. 

As a consequence of these assumptions, we are led to the equations 
dx/dt — ax — axy = x(a — ay), 
dy/dt — —cy + yxy = y(—c + yx). 

The constants a, c, a, and y are all positive; a and c are the growth rate of the prey 
and the death rate of the predator, respectively, and a and y are measures of the 
effect of the interaction between the two species. Equations (1) are known as the 
Lotka-Volterra equations. They were developed in papers by Lotka 6 in 1925 and by 
Volterra 7 in 1926. Although these are rather simple equations, they do characterize 
a wide class of problems. Ways of making them more realistic are discussed at the 
end of this section and in the problems. Our goal here is to determine the qualitative 
behavior of the solutions (trajectories) of the system (1) for arbitrary positive initial 
values of x and y . We first do this for a specific example and then return to the general 
equations (1) at the end of the section. 


EXAMPLE 

1 


Discuss the solutions of the system 

dx/dt = x(\ — 0.5y) = x — 0.5xy = F{x,y ), 

( 2 ) 

dy/dt = y{— 0.75 + 0.25x) = — 0.75y + 0.25xy = G(x,y) 
for x and y positive. 

The critical points of this system are the solutions of the algebraic equations 

x(l — 0.5y) = 0, y(— 0.75 + 0.25x) = 0, (3) 

namely, the points (0,0) and (3,2). Figure 9.5.1 shows the critical points and a direction field 
for the system (2). From this figure it appears that trajectories in the first quadrant encircle the 


6 Alfred J. Lotka (1880-1949), an American biophysicist, was born in what is now Ukraine and was educated 
mainly in Europe. He is remembered chiefly for his formulation of the Lotka-Volterra equations. He 
was also the author, in 1924, of the first book on mathematical biology; it is now available as Elements of 
Mathematical Biology (New York: Dover, 1956). 

7 Vito Volterra (1860-1940), a distinguished Italian mathematician, held professorships at Pisa, Turin, and 
Rome. He is particularly famous for his work in integral equations and functional analysis. Indeed, one 
of the major classes of integral equations is named for him; see Problem 21 of Section 6.6. His theory 
of interacting species was motivated by data collected by a friend, D 'Ancona, concerning fish catches in 
the Adriatic Sea. A translation of his 1926 paper can be found in an appendix to R. N. Chapman, Animal 
Ecology with Special Reference to Insects (New York: McGraw-Hill, 1931). 
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critical point (3, 2). Whether the trajectories are actually closed curves, or whether they slowly 
spiral in or out, cannot be definitely determined from the direction field. The origin appears 
to be a saddle point. Just as for the competition equations in Section 9.4, the coordinate axes 
are trajectories of Eqs. (1) or (2). Consequently, no other trajectory can cross a coordinate 
axis, which means that every solution starting in the first quadrant remains there for all time. 
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FIGURE 9.5.1 Critical points and direction field for the predator-prey system (2). 


Next we examine the local behavior of solutions near each critical point. Near the origin 
we can neglect the nonlinear terms in Eqs. (2) to obtain the corresponding linear system 



The eigenvalues and eigenvectors of Eq. (4) are 

'•1 = 1, r 2 — —0.75, = (5) 

so its general solution is 



Thus the origin is a saddle point both of the linear system (4) and of the nonlinear system (2), 
and therefore is unstable. One pair of trajectories enters the origin along the y-axis; all other 
trajectories depart from the neighborhood of the origin. 

To examine the critical point (3,2), we can use the Jacobian matrix 



/ 1 — 0.5y — 0.5x \ 

^ 0.25y -0.75 + 0.25.x J ' 


F x (x,y) F y (x,y) 
G x {x,y ) G y (x,y) 


(V) 
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Evaluating J at the point (3,2), we obtain the linear system 


d ( u \ _ (° - L5 \ M 

di\v) ~ ^0.5 0 ) \ J ’ 


( 8 ) 


where u = x — 3 and v = y — 2. The eigenvalues and eigenvectors of this system are 


n 


V3 i 

~ 2 ~’ 




( 9 ) 


Since the eigenvalues are imaginary, the critical point (3,2) is a center of the linear system 
(8) and is therefore a stable critical point for that system. Recall from Section 9.3 that this 
is one of the cases in which the behavior of the linear system may or may not carry over to 
the nonlinear system, so the nature of the point (3,2) for the nonlinear system (2) cannot be 
determined from this information. 

The simplest way to find the trajectories of the linear system (8) is to divide the second of 
Eqs. (8) by the first so as to obtain the differential equation 


or 


civ dv/dt 0.5h u 
du diijdt — 1.5u 3v’ 


u du + 3v dv = 0. 


Consequently, 


u 2 + 3tr = k. 


(10) 

( 11 ) 


where k is an arbitrary nonnegative constant of integration. Thus the trajectories of the linear 
system (8) are ellipses centered at the critical point and elongated somewhat in the horizontal 
direction. 

Now let us return to the nonlinear system (2). Dividing the second of Eqs. (2) by the first, 
we obtain 

dy_ _ y(— 0.75 + 0.25s) 
dx x(\ — 0.5y) 

Equation (12) is a separable equation and can be put in the form 

l-0.5y , —0.75 + 0.25.r , 

— dy = dx, 

y x 

from which it follows that 


0.75 Inx + In y - 0.5y - 0.25.x = c, (13) 

where c is a constant of integration. Although by using only elementary functions we cannot 
solve Eq. (13) explicitly for either variable in terms of the other, it is possible to show that 
the graph of the equation for a fixed value of c is a closed curve surrounding the critical point 
(3, 2). Thus the critical point is also a center of the nonlinear system (2), and the predator and 
prey populations exhibit a cyclic variation. 

Figure 9.5.2 shows a phase portrait of the system (2). For some initial conditions the tra- 
jectory represents small variations in x and y about the critical point, and is almost elliptical 
in shape, as the linear analysis suggests. For other initial conditions the oscillations in x and y 
are more pronounced, and the shape of the trajectory is significantly different from an ellipse. 
Observe that the trajectories are traversed in the counterclockwise direction. The dependence 
of x and y on t for a typical set of initial conditions is shown in Figure 9.5.3. Note that x and 
y are periodic functions of t, as they must be since the trajectories are closed curves. Further, 
the oscillation of the predator population lags behind that of the prey. Starting from a state in 
which both predator and prey populations are relatively small, the prey first increase because 
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there is little predation. Then the predators, with abundant food, increase in population also. 
This causes heavier predation, and the prey tend to decrease. Finally, with a diminished food 
supply, the predator population also decreases, and the system returns to the original state. 



FIGURE 9.5.3 Variations of the prey and predator populations with time for the system (2). 


The general system (1) can be analyzed in exactly the same way as in the example. 
The critical points of the system (1) are the solutions of 

x(a — ay) = 0, y(— c + yx) = 0, 


that is, the points (0,0) and (c/y,a/a). We first examine the solutions of the corre- 
sponding linear system near each critical point. 

In the neighborhood of the origin the corresponding linear system is 



d 

dt 


( 14 ) 
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The eigenvalues and eigenvectors are 

n = a, § (1) = Q ; r 2 = -c, § (2) = ^ , (15) 

so the general solution is 

r) = ei Q e " + e G) e_ “ (16) 

Thus the origin is a saddle point and hence unstable. Entrance to the saddle point is 
along the y-axis; all other trajectories depart from the neighborhood of the critical 
point. 

Next consider the critical point ( c/y ,a/a ). The Jacobian matrix is 


( a — ay — ax 

yy -c + yx 

Evaluating J at (c/y,a/a), we obtain the approximate linear system 

d (u\ _ ( 0 —ac/y\ / u\ 
dt\v) \ya/a 0 / \iy ’ 


(17) 


where u — x — ( c/y ) and v = y — (a/a). The eigenvalues of the system (17) are 
r = ±i^fac, so the critical point is a (stable) center of the linear system. To find the 
trajectories of the system (17), we can divide the second equation by the first to obtain 


or 


dv dv/dt (ya/a)u 

du du/dt (ac/y) u ’ 


y 2 au du + a 2 cv dv = 0. 


Consequently, 


y 2 au 2 + a 2 cv 2 = k, 


(18) 

(19) 

( 20 ) 


where k is a nonnegative constant of integration. Thus the trajectories of the linear 
system (17) are ellipses, just as in the example. 

Returning briefly to the nonlinear system (1), observe that it can be reduced to the 
single equation 

dy dy/dt y(— c + yx) 
dx dx/dt x(a — ay) 

Equation (21) is separable and has the solution 

n In y — ay + c In x — yx = C, 

where C is a constant of integration. Again it is possible to show that, for fixed C, 
the graph of Eq. (22) is a closed curve surrounding the critical point (c/y, a /a). Thus 
this critical point is also a center for the general nonlinear system (1). 


( 21 ) 

( 22 ) 
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The cyclic variation of the predator and prey populations can be analyzed in more 
detail when the deviations from the point ( c/y.a/a ) are small and the linear system 
(17) can be used. The solution of the system (17) can be written in the form 


u — —K cos (Vac t + </>), 
Y 


a c < — 

v — — -K sm(vnc t + </>), 
a V a 


where the constants K and (]> are determined by the initial conditions. Thus 

Q Q 

x— — | — K cos (Vac? + <p ), 

Y Y 

a a [c , — 

y= — I — . -K sin(v«ct + (/>). 
a a V a 


(23) 


(24) 


These equations are good approximations for the nearly elliptical trajectories close 
to the critical point ( c/y,a/a ). We can use them to draw several conclusions about 
the cyclic variation of the predator and prey on such trajectories. 

1. The sizes of the predator and prey populations vary sinusoidally with period 2n / ^ fac . This 
period of oscillation is independent of the initial conditions. 

2. The predator and prey populations are out of phase by one-quarter of a cycle. The prey 
leads and the predator lags, as explained in the example. 

3. The amplitudes of the oscillations are Kc/y for the prey and a^/cK/ot^/a for the predator 
and hence depend on the initial conditions as well as on the parameters of the problem. 

4. The average populations of predator and prey over one complete cycle are c/y and a/a, 
respectively. These are the same as the equilibrium populations; see Problem 10. 

Cyclic variations of predator and prey as predicted by Eqs. (1) have been observed 
in nature. One striking example is described by Odum (pp. 191-192); based on the 
records of the Hudson Bay Company of Canada, the abundance of lynx and snowshoe 
hare, as indicated by the number of pelts turned in over the period 1845-1935, shows 
a distinct periodic variation with period of 9 to 10 years. The peaks of abundance are 
followed by very rapid declines, and the peaks of abundance of the lynx and hare are 
out of phase, with that of the hare preceding that of the lynx by a year or more. 

Since the critical point (c/y , a/a) is a center, we expect that small perturbations of 
the Lotka-Volterra equations may well lead to solutions that are not periodic. Put 
another way, unless the Lotka-Volterra equations exactly describe a given predator- 
prey relationship, the actual fluctuations of the populations may differ substantially 
from those predicted by the Lotka-Volterra equations, due to small inaccuracies in 
the model equations. This has led to many attempts 8 to replace the Lotka-Volterra 
equations by other systems that are less susceptible to the effects of small perturba- 
tions. Problem 13 introduces one such alternative model. 

Another criticism of the Lotka-Volterra equations is that in the absence of the 
predator, the prey will grow without bound. This can be corrected by allowing for 
the natural inhibiting effect that an increasing population has on the growth rate of 
the population. For example, the first of Eqs. (1) can be modified so that when y = 0, 
it reduces to a logistic equation for x. The effects of this modification are explored in 


8 See the book by Brauer and Castillo-Chavez listed in the references for an extensive discussion of 
alternative models for predator-prey relationships. 
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Problems 11 and 12. Problems 14 through 16 deal with harvesting in a predator-prey 
relationship. The results may seem rather counterintuitive. 

Finally, we repeat a warning stated earlier: relationships among species in the 
natural world are often complex and subtle. You should not expect too much of a 
simple system of two differential equations in describing such relationships. Even if 
you are convinced that the general form of the equations is sound, the determination 
of numerical values for the coefficients may present serious difficulties. 


PROBLEMS Each of Problems 1 through 5 can be interpreted as describing the interaction of two species 
with population densities x and y. In each of these problems carry out the following steps. 

(a) Draw a direction field and describe how solutions seem to behave. 

(b) Find the critical points. 

(c) For each critical point find the corresponding linear system. Find the eigenvalues and 
eigenvectors of the linear system; classify each critical point as to type, and determine whether 
it is asymptotically stable, stable, or unstable. 

(d) Sketch the trajectories in the neighborhood of each critical point. 

(e) Draw a phase portrait for the system. 

(f) Determine the limiting behavior of x and y as t -> oo and interpret the results in terms of 
the populations of the two species. 


1. dx/dt = x(1.5 — 0.5y) 
dy/dt = y(— 0.5 + x) 

3. dx/dt = x(l — 0.5x — 0.5y) 
dy/dt = y(— 0.25 + 0.5x) 

5. dx/dt = x( — 1 + 2.5x — 0.3y — x 2 ) 
dy/dt = y{— 1.5 + x) 


2. dx/dt = x(l — 0.5y) 
dy/dt — y(— 0.25 + 0.5x) 

4. dx/dt — x(1.125 — x — 0.5 y) 
dy/dt = y {— 1 + x) 


6. In this problem we examine the phase difference between the cyclic variations of the 
predator and prey populations as given by Eqs. (24) of this section. Suppose we assume 
that K > 0 and that t is measured from the time that the prey population x is a maximum; 
then (f> = 0. 

(a) Show that the predator population y is a maximum at t = n /2j~ac = T /4, where T is 
the period of the oscillation. 

(b) When is the prey population increasing most rapidly? decreasing most rapidly? a 
minimum? 

(c) Answer the questions in part (b) for the predator population. 

(d) Draw a typical elliptic trajectory enclosing the point (c/y,a/a), and mark the points 
found in parts (a), (b), and (c) on it. 

7. (a) Find the ratio of the amplitudes of the oscillations of the prey and predator populations 
about the critical point (c/y.u/a), using the approximation (24), which is valid for small 
oscillations. Observe that the ratio is independent of the initial conditions. 

(b) Evaluate the ratio found in part (a) for the system (2). 

(c) Estimate the amplitude ratio for the solution of the nonlinear system (2) shown in 
Figure 9.5.3. Does the result agree with that obtained from the linear approximation? 

(d) Determine the prey-predator amplitude ratio for other solutions of the system (2), 
that is, for solutions satisfying other initial conditions. Is the ratio independent of the 
initial conditions? 
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8. (a) Find the period of the oscillations of the prey and predator populations, using the ap- 
proximation (24), which is valid for small oscillations. Note that the period is independent 
of the amplitude of the oscillations. 

(b) For the solution of the nonlinear system (2) shown in Figure 9.5.3, estimate the period 
as well as possible. Is the result the same as for the linear approximation? 

(c) Calculate other solutions of the system (2), that is, solutions satisfying other initial 
conditions, and determine their periods. Is the period the same for all initial conditions? 

9. Consider the system 

dx/dt = ax[ 1 — (y/ 2)], dy/dt = by[— 1 + (x/3)], 

where a and b are positive constants. Observe that this system is the same as in the example 
in the text if a = 1 and b = 0.75. Suppose the initial conditions are x(0) = 5 andy(0) = 2. 

(a) Letn = land?) = 1. Plot the trajectory in the phase plane and determine (or estimate) 
the period of the oscillation. 

(b) Repeat part (a) for a = 3 and a = 1/3, with 6 = 1. 

(c) Repeat part (a) for 6 = 3 and 6 = 1/3, with a = 1. 

(d) Describe how the period and the shape of the trajectory depend on a and 6. 

10. The average sizes of the prey and predator populations are defined as 

^ rA+T ^ rA+T 

x = — J x(t) dt , y = — j y(t) dt, 

respectively, where T is the period of a full cycle, and A is any nonnegative constant. 

(a) Using the approximation (24), which is valid near the critical point, show that x = c/y 
andy = a/a. 

(b) For the solution of the nonlinear system (2) shown in Figure 9.5.3, estimate x and y as 
well as you can. Try to determine whether x and y are given by c/y and a/a, respectively, 
in this case. 

Hint: Consider how you might estimate the value of an integral even though you do not 
have a formula for the integrand. 

(c) Calculate other solutions of the system (2), that is, solutions satisfying other initial 
conditions, and determine x and y for these solutions. Are the values of x and y the same 
for all solutions? 

In Problems 11 and 12 we consider the effect of modifying the equation for the prey x by 
including a term —ax 2 so that this equation reduces to a logistic equation in the absence of 
the predatory. Problem 11 deals with a specific system of this kind and Problem 12 takes 
up this modification to the general Lotka-Volterra system. The systems in Problems 3 
and 4 are other examples of this type. 

11. Consider the system 

x' = x(l — ax — 0.5y), y' = y(— 0.75 + 0.250, (25) 

where a > 0. Observe that this system is a modification of the system (2) in Example 1. 

(a) Find all of the critical points. How does their location change as a increases from 
zero? Observe that there is a critical point in the interior of the first quadrant only if 
a <1/3. 

(b) Determine the type and stability property of each critical point. Find the value 
ffi <1/3 where the nature of the critical point in the interior of the first quadrant changes. 
Describe the change that takes place in this critical point as a passes through a\. 
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(c) Draw a direction field and phase portrait for a value of a between zero and eri; for a 
value of a between o\ and 1/3. 

(d) Describe the effect on the two populations as a increases from zero to 1/3. 

12. Consider the system 


dx/dt = x(a — ax — ay), dy/dt = y(— c + yx), 


where a, a, a, c, and y are positive constants. 

(a) Find all critical points of the given system. How does their location change as a 
increases from zero? Assume that a/a > c/y, that is, a < ay /c. Why is this assumption 
necessary? 

(b) Determine the nature and stability characteristics of each critical point. 

(c) Show that there is a value of cr between zero and ay /c where the critical point in the 
interior of the first quadrant changes from a spiral point to a node. 

(d) Describe the effect on the two populations as a increases from zero to ay/c. 

13. In the Lotka-Volterra equations the interaction between the two species is modeled by 
terms proportional to the product xy of the respective populations. If the prey popula- 
tion is much larger than the predator population, this may overstate the interaction; for 
example, a predator may hunt only when it is hungry, and ignore the prey at other times. 
In this problem we consider an alternative model of a type proposed by Rosenzweig and 
MacArthur. 9 

(a) Consider the system 



Find all of the critical points of this system. 

(b) Determine the type and stability characteristics of each critical point. 

(c) Draw a direction field and phase portrait for this system. 

Harvesting in a Predator-Prey Relationship. In a predator-prey situation it may happen 
that one or perhaps both species are valuable sources of food (for example). Or, the prey 
may be regarded as a pest, leading to efforts to reduce their number. In a constant-effort 
model of harvesting we introduce a term — E\x in the prey equation and a term — E 2 y in 
the predator equation, where E\ and E 2 are measures of the effort invested in harvesting 
the respective species. A constant-yield model of harvesting is obtained by including the 
term —Hi in the prey equation and the term —Hi in the predator equation. The constants 
Ei,E 2 ,Hi, and H 2 are always nonnegative. Problems 14 and 15 deal with constant-effort 
harvesting, and Problem 16 with constant-yield harvesting. 

14. Applying a constant-effort model of harvesting to the Lotka-Volterra equations (1), we 
obtain the system 


x! = x(a — ay — E\), y' = y(—c + yx — E 2 ). 


When there is no harvesting the equilibrium solution is (c/y, a/ a). 

(a) Before doing any mathematical analysis, think about the situation intuitively. How 
do you think the populations will change if the prey alone is harvested? if the predator 
alone is harvested? if both are harvested? 


9 See the book by Brauer and Castillo-Chavez for further details. 
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(b) How does the equilibrium solution change if the prey is harvested, but not the predator 
{Ei > 0 : E 2 = 0)? 

(c) How does the equilibrium solution change if the predator is harvested, but not the 
prey (E\ = 0, E 2 > 0)? 

(d) How does the equilibrium solution change if both are harvested (Ei > 0, E 2 > 0)? 

15. If we modify the Lotka-Volterra equations by including a self-limiting term — ax 2 in the 
prey equation, and then assume constant-effort harvesting, we obtain the equations 

x' = x(a — ax — ay — Ei), y' = y(—c + yx — E 2 ). 

In the absence of harvesting the equilibrium solution of interest is x = c/y, 
y = (a/a) - (ac)/(ay). 

(a) How does the equilibrium solution change if the prey is harvested, but not the predator 
(Ei > 0,E 2 = 0)? 

(b) How does the equilibrium solution change if the predator is harvested, but not the 
prey (Ei = 0, E 2 > 0)? 

(c) How does the equilibrium solution change if both are harvested (Ei > 0, E 2 > 0)? 

16. In this problem we apply a constant -yield model of harvesting to the situation in Example 1 . 
Consider the system 

x! = x(\ — 0.5y) — Hi, y' = y(— 0.75 + 0.25x) — H 2 , 

where Hi and H 2 are nonnegative constants. Recall that if Hi = H 2 = 0, then (3, 2) is an 
equilibrium solution for this system. 

(a) Before doing any mathematical analysis, think about the situation intuitively. How 
do you think the populations will change if the prey alone is harvested? if the predator 
alone is harvested? if both are harvested? 

(b) How does the equilibrium solution change if the prey is harvested, but not the predator 
(Hi>0,H 2 = 0)? 

(c) How does the equilibrium solution change if the predator is harvested, but not the 
prey (Hi = 0, H 2 >0)? 

(d) How does the equilibrium solution change if both are harvested (Hi > 0, H 2 > 0)? 
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In Section 9.3 we showed how the stability of a critical point of a locally linear 
system can usually be determined from a study of the corresponding linear system. 
However, no conclusion can be drawn when the critical point is a center of the 
corresponding linear system. Examples of this situation are the undamped pendulum, 
Eqs. (1) and (2) below, and the predator-prey problem discussed in Section 9.5. Also, 
for an asymptotically stable critical point, it may be important to investigate the 
basin of attraction — that is, the domain such that all solutions starting within that 
domain approach the critical point. The theory of locally linear systems provides no 
information about this question. 
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In this section we discuss another approach, known as Liapunov’s 10 second method 
or direct method. The method is referred to as a direct method because no knowledge 
of the solution of the system of differential equations is required. Rather, conclusions 
about the stability or instability of a critical point are obtained by constructing a 
suitable auxiliary function. The technique is a very powerful one that provides a 
more global type of information, for example, an estimate of the extent of the basin 
of attraction of a critical point. Liapunov’s second method can also be used to study 
systems of equations that are not locally linear; however, we will not discuss such 
problems. 

Basically, Liapunov’s second method is a generalization of two physical principles 
for conservative systems, namely, (i) a rest position is stable if the potential energy is 
a local minimum, otherwise it is unstable, and (ii) the total energy is a constant during 
any motion. To illustrate these concepts, again consider the undamped pendulum (a 
conservative mechanical system), which is governed by the equation 


d 2 0 g . 

—r + Y Sind = 0. 

dt 2 L 

The corresponding system of first order equations is 


dx 

dt 


dy g . 

— = sini, 

dt L 


a) 

( 2 ) 


where x — 6 and y = d6 /dt. If we omit an arbitrary constant, the potential energy U 
is the work done in lifting the pendulum above its lowest position, namely, 


U(x,y) — mgL( 1 — cosx); 


( 3 ) 


see Figure 9.2.2. The critical points of the system (2) are x = ±nn,y = 0, n = 0, 1,2, 
3, . . . , corresponding to 9 = ±mt, dd/dt = 0. Physically, we expect the points x = 0, 
y — 0; x = ±27r, y — 0; . . ., corresponding to 0 = 0, ±2jt, . . . , to be stable, since for 
them the pendulum bob is vertical with the weight down; further, we expect the 
points x = ±7i, y = 0;jc = ±3n,y = 0; . . . , corresponding to 0 — ±jt,±3tt , . . . , to be 
unstable, since for them the pendulum bob is vertical with the weight up. This agrees 
with statement (i), for at the former points U is a minimum equal to zero, and at the 
latter points U is a maximum equal to 2 mgL. 

Next consider the total energy V, which is the sum of the potential energy U and 
the kinetic energy | mL 2 (dO/dt ) 2 . In terms of x and y, 

V(x,y) — mgL( 1 — cosx) + | mLry 2 . (4) 

On a trajectory corresponding to a solution x = 0(f), y = 0(f) of Eqs. (2), V can be 
considered a function of f. The derivative of V[<p{t), 0(f)] with respect to f is called 


10 Alexandr M. Liapunov (1857-1918), a student of Chebyshev at St. Petersburg, taught at the University of 
Kharkov from 1885 to 1901 , when he became an academician in applied mathematics at the St. Petersburg 
Academy of Sciences. In 1917 he moved to Odessa because of his wife’s frail health. His research in 
stability encompassed both theoretical analysis and applications to various physical problems. His second 
method formed part of his most influential work, General Problem of Stability of Motion, published in 
1892. 
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the rate of change of V following the trajectory. By the chain rule, 


dV[(j>(t),//(t)] 

dt 




d(h(t) 
-3— + 

dt 


v y m\tm 


d\jf(t) 

dt 


dx 

(. mgL sin x) — 
dt 


+ mL 2 y 


dy 

dt’ 


( 5 ) 


where it is understood that x = y = is(t). Finally, substituting in Eq. (5) for 
dx/dt and dy/dt from Eqs. (2), we find that dV /dt — 0. Flence V is a constant along 
any trajectory of the system (2), which is statement (ii). 

It is important to note that, at any point (x,y), the rate of change of V along the 
trajectory through that point was computed without actually solving the system (2). 
It is precisely this fact that enables us to use Liapunov’s second method for systems 
whose solution we do not know, which is the main reason for its importance. 

At the stable critical points, x = ±2nn,y = 0,n = 0, 1,2, . . . , the energy V is zero. If 
the initial state, say, (x\,yi), of the pendulum is sufficiently near a stable critical point, 
then the energy V(xi,yi) is small, and the motion (trajectory) associated with this 
energy stays close to the critical point. It can be shown that if V (x\,y\) is sufficiently 
small, then the trajectory is closed and contains the critical point. For example, 
suppose that (xi,yi) is near (0,0) and that V{xi,y\) is very small. The equation of 
the trajectory with energy V{x\.yi) is 

V(x,y) = mgL(\ - cos*) + \mLry 2 = V(xi,yi). 


For x small we have 1 — cosx = 1 — (1 — x 2 /2! + ••■)= x 2 /2. Thus the equation of 
the trajectory is approximately 

\mgLx 2 + \mL 2 y 2 — F(xi,yi), 


or 

2 2 

* + i = 1. 

2V (x\,y\)/mgL 2V(x 1 ,y 1 )/mL 2 

This is an ellipse enclosing the critical point (0, 0); the smaller V (xj ,y\ ) is, the smaller 
are the major and minor axes of the ellipse. Physically, the closed trajectory corre- 
sponds to a solution that is periodic in time — the motion is a small oscillation about 
the equilibrium point. 

If damping is present, however, it is natural to expect that the amplitude of the 
motion decays in time and that the stable critical point (center) becomes an asymp- 
totically stable critical point (spiral point). See the phase portrait for the damped 
pendulum in Figure 9.3.5. This can almost be argued from a consideration of dV /dt. 
For the damped pendulum, the total energy is still given by Eq. (4), but now, from 
Eqs. (13) of Section 9.2, dx/dt = y and dy/dt = — ( g/L ) sinx — ( c/mL)y . Substitut- 
ing for dx/dt and dy/dt in Eq. (5) gives dV /dt — —cLy 2 < 0. Thus the energy is 
nonincreasing along any trajectory, and except for the line y — 0, the motion is such 
that the energy decreases. Hence each trajectory must approach a point of minimum 
energy — a stable equilibrium point. If dV /dt < 0 instead of dV /dt < 0, it is reason- 
able to expect that this would be true for all trajectories that start sufficiently close 
to the origin. 
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To pursue these ideas further, consider the autonomous system 

dx/dt — F(x,y), dy/dt = G(x,y), (6) 

and suppose that the point* = 0,y = 0 is an asymptotically stable critical point. Then 
there exists some domain D containing (0,0) such that every trajectory that starts in 
D must approach the origin as t -> oo. Suppose that there exists an “energy” function 
V such that V > 0 for (x,y) in D with V — 0 only at the origin. Since each trajectory 
in D approaches the origin as t -> oo, then following any particular trajectory, V 
decreases to zero as t approaches infinity. The type of result we want to prove is 
essentially the converse: if, on every trajectory, V decreases to zero as t increases, 
then the trajectories must approach the origin as f — > oo, and hence the origin is 
asymptotically stable. First, however, it is necessary to make several definitions. 

Let V be defined on some domain D containing the origin. Then V is said to 
be positive definite on D if V(0,0) = 0 and V(x,y) > 0 for all other points in D. 
Similarly, V is said to be negative definite on D if V (0, 0) = 0 and V (x, y) < 0 for all 
other points in D. If the inequalities > and < are replaced by > and <, then V is said to 
be positive semidefinite and negative semidefinite, respectively. We emphasize that 
when we speak of a positive definite (negative definite, ...) function on a domain D 
containing the origin, the function must be zero at the origin in addition to satisfying 
the proper inequality at all other points in D. 


The function 


V(x,y) = sin(x 2 + y 1 ) 

is positive definite on x 2 + y 2 < 7r/2 since V(0,0) = 0 and V(x,y) > 0 for 0 < x 2 + y 2 <n/2. 
However, the function 

V(x,y) = (x + y) 2 

is only positive semidefinite since V(x,y) = 0 on the liney = — x. 


We also want to consider the function 


V(x,y) = V x {x,y)F(x,y) + V y (x,y)G(x,y), (7) 


where F and G are the same functions as in Eqs. (6). We choose this notation because 
V (x, y) can be identified as the rate of change of V along the trajectory of the system 
(6) that passes through the point (x,y). That is, if * = 0(f), y — 0(f) is a solution of 
the system (6), then 


dvm),jf(t)] 

dt 


y c [0(f),0(f)] 


dip (f) 
dt 




d\j/{t) 

dt 


= V x (x,y)F(x,y) + V y (x,y)G(x,y) 

= V(x,y). (8) 

The function V is sometimes referred to as the derivative of V with respect to the 
system (6). 

We now state two Liapunov theorems, the first dealing with stability, the second 
with instability. 
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Theorem 9.6.2 


Suppose that the autonomous system (6) has an isolated critical point at the origin. 
If there exists a function V that is continuous and has continuous first partial deriva- 
tives, is positive definite, and for which the function V given by Eq. (7) is negative 
definite on some domain D in the xy-plane containing (0,0), then the origin is an 
asymptotically stable critical point. If V is negative semidefinite, then the origin is 
a stable critical point. 


Let the origin be an isolated critical point of the autonomous system (6). Let V be a 
function that is continuous and has continuous first partial derivatives. Suppose that 
V (0, 0) = 0 and that in every neighborhood of the origin there is at least one point 
at which V is positive (negative). If there exists a domain D containing the origin 
such that the function V given by Eq. (7) is positive definite (negative definite) on 
D, then the origin is an unstable critical point. 


The function V is called a Liapunov function. Before sketching geometrical argu- 
ments for Theorems 9.6.1 and 9.6.2, we note that the difficulty in using these theorems 
is that they tell us nothing about how to construct a Liapunov function, assuming that 
one exists. In cases where the autonomous system (6) represents a physical prob- 
lem, it is natural to consider first the actual total energy function of the system as 
a possible Liapunov function. However, Theorems 9.6.1 and 9.6.2 are applicable in 
cases where the concept of physical energy is not pertinent. In such cases a judicious 
trial-and-error approach may be necessary. 

Now consider the second part of Theorem 9.6.1, that is, the case V < 0. Let c > 0 
be a constant and consider the curve in the xy-plane given by V ( x , y) = c. For c = 0 
the curve reduces to the single point x = 0, y = 0. We assume that if 0 < ci < C 2 , 
then the curve V(x,y) = c\ contains the origin and lies within the curve V(x,y) = C 2 , 
as illustrated in Figure 9.6. In. We show that a trajectory starting inside a closed 
curve V(x,y) = c cannot cross to the outside. Thus, given a circle of radius e about 
the origin, by taking c sufficiently small, we can ensure that every trajectory starting 
inside the closed curve V(x,y) = c stays within the circle of radius e; indeed, it stays 
within the closed curve V (x,y) = c itself. Thus the origin is a stable critical point. 



(a) ( b ) 

FIGURE 9.6.1 Geometrical interpretation of Liapunov’s method. 
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To show this, recall from calculus that the vector 

W(x,y) = V x (x,y) i + V y (x,y) j, (9) 

known as the gradient of V, is normal to the curve V (x, y) = c and points in the direc- 
tion of increasing V. In the present case, V increases outward from the origin, so V V 
points away from the origin, as indicated in Figure 9.6.1 b. Next, consider a trajectory 
x — 0(f), y — 0(f) of the system (6), and recall that the vector T(f) = <p'(t) i + 0'(f) j 
is tangent to the trajectory at each point; see Figure 9. 6. lb. Letxi = </>(fi),yi = 0(fi) 
be a point of intersection of the trajectory and a closed curve V (x, y) = c. At this 
point 0'(fi) = F{x\,y{), 0'(fi) = G(xi,yi), so from Eq. (7) we obtain 

V{xi,yx) = V x (xuyi)(t>'(h) + V y (xi,yi)f'(h) 

= [V x (,Xi,y l )i + V y (x u y\)\] • [0'(fi)i + 0'(fi)j] 

= VV{x l ,y l )-T(h). (10) 

Thus V(x\,y\) is the scalar product of the vector W(x\,yi) and the vector T(fi). 
Since V(x\,yi) < 0, it follows that the cosine of the angle between VU(xi,yi) and 
T (fj ) is also less than or equal to zero; hence the angle itself is in the range [jt/ 2, 37r /2] . 
Thus the direction of motion on the trajectory is inward with respect to V(x\,y\) = c 
or, at worst, tangent to this curve. Trajectories starting inside a closed curve 
V(xi , yi) = c (no matter how small c is) cannot escape, so the origin is a stable point. 
If V(x\,y\) < 0, then the trajectories passing through points on the curve are actually 
pointed inward. As a consequence, it can be shown that trajectories starting suffi- 
ciently close to the origin must approach the origin; hence the origin is asymptotically 
stable. 

A geometric argument for Theorem 9.6.2 follows in a somewhat similar manner. 
Briefly, suppose that V is positive definite, and suppose that given any circle about the 
origin, there is an interior point (xi,_yi) at which V(x\,y\) > 0. Consider a trajectory 
that starts at (xi,yi). Along this trajectory it follows from Eq. (8) that V must increase 
since V(x\,y\) > 0; furthermore, since V (x\,yi) > 0, the trajectory cannot approach 
the origin because V (0, 0) = 0. This shows that the origin cannot be asymptotically 
stable. By further exploiting the fact that V(x,y) > 0, it is possible to show that the 
origin is an unstable point; however, we will not pursue this argument. 


Use Theorem 9.6.1 to show that (0, 0) is a stable critical point for the undamped pendulum 
equations (2). Also use Theorem 9.6.2 to show that (n, 0) is an unstable critical point. 

Let V be the total energy given by Eq. (4): 

V (x, y) = mgL(l — cosx) + ^mL 2 y 2 . (4) 

If we take D to be the domain —n/2 < x < n/2, — oo < y < oo, then V is positive there except 
at the origin, where it is zero. Thus V is positive definite on D. Further, as we have already 
seen, 

V = (mgL sin x){y) + (mL 2 y)(— g sin x)/L = 0 

for all x and y. Thus V is negative semidefinite on D. Consequently, by the last statement 
in Theorem 9.6.1, the origin is a stable critical point for the undamped pendulum. Observe 
that this conclusion cannot be obtained from Theorem 9.3.2 because (0, 0) is a center for the 
corresponding linear system. 
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Now consider the critical point (n, 0). The Liapunov function given by Eq. (4) is no longer 
suitable because Theorem 9.6.2 calls for a function V for which V is either positive or negative 
definite. To analyze the point (zr , 0) , it is convenient to move this point to the origin by the 
change of variables x = n + u,y = v. Then the differential equations (2) become 

g 

du/dt = v, dv/dt= — sinu, (11) 

and the critical point is (0, 0) in the z<i>-plane. Consider the function 

V(u,v) = vsinu (12) 

and let D be the domain — jr/4 < u < jr/4, — oo < v < oo. Then 

V = (v cos u)(v) + (sin u)[(g/L) sin u\ = v 2 cos u + ( g/L ) sin 2 u (13) 

is positive definite in D. The only remaining question is whether there are points in every 
neighborhood of the origin where V itself is positive. From Eq. (12) we see that V(u, v) > 0 
in the first quadrant (where both sinzz and v are positive) and in the third quadrant (where 
both are negative). Thus the conditions of Theorem 9.6.2 are satisfied, and the point (0,0) in 
the Mr-plane, corresponding to the point (zr, 0) in the xy-plane, is unstable. 

The damped pendulum equations are discussed in Problem 7. 


From a practical point of view, we are often interested in the basin of attraction. 
The following theorem provides some information on this subject. 


Theorem 9.6.3 L et t ^ ie origin be an isolated critical point of the autonomous system (6). Let the 
function V be continuous and have continuous first partial derivatives. If there is a 
bounded domain Dk containing the origin where V (x, y) < K for some positive K , 
V is positive definite, and V is negative definite, then every solution of Eqs. (6) that 
starts at a point in D K approaches the origin as t approaches infinity. 


In other words. Theorem 9.6.3 says that if x = <p(t), y = is the solution of 
Eqs. (6) for initial data lying in D K , then (x,y) approaches the critical point (0, 0) as 
t — > oo. Thus Dk gives a region of asymptotic stability; of course, it may not be the 
entire basin of attraction. This theorem is proved by showing that (i) there are no 
periodic solutions of the system (6) in Dk, and (ii) there are no other critical points 
in Dk- It then follows that trajectories starting in Dk cannot escape and therefore 
must tend to the origin as t tends to infinity. 

Theorems 9.6.1 and 9.6.2 give sufficient conditions for stability and instability, re- 
spectively, but these conditions are not necessary. Also, our failure to find a suitable 
Liapunov function does not mean that there is no such function. Unfortunately, there 
are no general methods for the construction of Liapunov functions; however, there 
has been extensive work on the construction of Liapunov functions for special classes 
of equations. An elementary algebraic result that is often useful in constructing pos- 
itive definite or negative definite functions is stated without proof in the following 
theorem. 
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Theorem 9.6.4 


EXAMPLE 

3 


EXAMPLE 

4 


The function 


V (x,y) = ax 2 + bxy + cy 2 
is positive definite if, and only if, 

a > 0 and 4ac — b 2 > 0, 
and is negative definite if, and only if, 

a < 0 and 4 ac — b 2 > 0. 


The use of Theorem 9.6.4 is illustrated in the following example. 


(14) 

(15) 

(16) 


Show that the critical point (0, 0) of the autonomous system 

dx/dt = —x — xy 2 , dy/dt = —y — x 2 y (17) 

is asymptotically stable. 

We try to construct a Liapunov function of the form (14). Then V x (x,y) = lax + by , 
V y (x,y) = bx + Icy, so 

V(x,y) = (2ax + by)(—x — xy 2 ) + (bx + 2 cy)(—y — x 2 _y) 

= - [2 a(x 2 + x 2 y 2 ) + b(2xy + xy 3 + x 3 y) + 2 c(y 2 + x 2 y 2 )] . 

If we choose b = 0, and a and c to be any positive numbers, then V is negative definite and V 
is positive definite by Theorem 9.6.4. Thus, by Theorem 9.6.1, the origin is an asymptotically 
stable critical point. 


Consider the system 


dx/dt = x(l — x — y), 
dy/dt = y( 0.75 — y — 0.5x). 


(18) 


In Example 1 of Section 9.4 we found that this system models a certain pair of competing 
species, and that the critical point (0.5, 0.5) is asymptotically stable. Confirm this conclusion 
by finding a suitable Liapunov function. 

It is helpful to transform the point (0.5, 0.5) to the origin. To this end, let 

x = 0.5 + it, y = 0.5 + r. (19) 

Then, substituting for x and y in Eqs. (18), we obtain the new system 

du/dt = —0.5;/ — 0.5u — u 2 — uv, 

, (20) 

dv/dt = —0.25a — 0.5v — 0.5uv — v . 


To keep the calculations relatively simple, consider the function V (u, v ) — u 2 + v 2 as a possible 
Liapunov function. This function is clearly positive definite, so we need only to determine 


9.6 Liapunov’s Second Method 


551 


PROBLEMS 


whether there is a region containing the origin in the wu-plane where the derivative V with 
respect to the system (20) is negative definite. We compute V (u, v) and find that 

du d v 

V(u,v) = V u — + V v — 
dt dt 

= 2u(—0.5u — 0.5u — u 2 — uv ) + 2u(— 0.25w — 0.5t> — Q.Suv — ir), 


or 

V (u , v) = — [(« 2 + 1.5 uv + v 2 ) + (2u 3 4 + 2 u 2 v + uv 2 + 2v 3 )] , (21) 

where we have collected together the quadratic and cubic terms. We want to show that the 
expression in square brackets in Eq. (21) is positive definite, at least for u and v sufficiently 
small. Observe that the quadratic terms can be written as 

u 2 + 1.5 uv + v * 1 = 0.25(i< 2 + v 2 ) + 0.75(w + i>) 2 , (22) 

so these terms are positive definite. On the other hand, the cubic terms in Eq. (21) may be of 
either sign. Thus we must show that, in some neighborhood of u = 0, v = 0, the cubic terms 
are smaller in magnitude than the quadratic terms; that is, 

|2« 3 + 2u 2 v + uv 2 + 2d 3 | < 0.25(m 2 + a 2 ) + 0.75(« + a) 2 . (23) 

To estimate the left side of Eq. (23), we introduce polar coordinates u = r cos#, a = rsin#. 
Then 


1 2 w 3 + 2u 2 v + uv 2 + 2a 3 1 = r 3 |2cos 3 9 + 2 cos 2 9 sin# + cos 9 sin 2 9 + 2 sin 3 9 \ 

< r 3 [2 1 cos 3 9 1 + 2 cos 2 9 \ sin 0 1 + | cos6»| sin 2 9 + 2\ sin 3 9 1] 

<7 r 3 , 

since | sin f? | , | cos#| < 1. To satisfy Eq. (23), it is now certainly sufficient to satisfy the more 
stringent requirement 

7 r 3 < 0.25 Ur + a 2 ) = 0.25r 2 , 

which yields r < 1/28. Thus, at least in this disk, the hypotheses of Theorem 9.6.1 are satisfied, 
so the origin is an asymptotically stable critical point of the system (20). The same is then true 
of the critical point (0.5, 0.5) of the original system (18). 

If we refer to Theorem 9.6.3, the preceding argument also shows that the disk with center 
(0.5, 0.5) and radius 1/28 is a region of asymptotic stability for the system (18). This is a 
severe underestimate of the full basin of attraction, as the discussion in Section 9.4 shows. 
To obtain a better estimate of the actual basin of attraction from Theorem 9.6.3, we would 
have to estimate the terms in Eq. (23) more accurately, use a better (and presumably more 
complicated) Liapunov function, or both. 


In each of Problems 1 through 4 construct a suitable Liapunov function of the form ax 2 + cy 2 , 
where a and c are to be determined. Then show that the critical point at the origin is of the 
indicated type. 

1. dx/dt = —x 3 + xy 2 , dy/dt = — 2x 2 y — y 3 \ asymptotically stable 

2. dx/dt = — \x 3 + 2xy 2 , dy/dt=—y 3 \ asymptotically stable 

3. dx/dt = — x 3 + 2y 3 , dy/dt = —2xy 2 ; stable (at least) 

4. dx/dt = x 3 — y 3 , dy/dt = 2xy 2 + 4x 2 y + 2y 3 ; unstable 
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5. Consider the system of equations 


dx/dt = y — xf{x,y), dy/dt = — x — yf(x,y). 


where/ is continuous and has continuous first partial derivatives. Show that if/(x,y) > 0 
in some neighborhood of the origin, then the origin is an asymptotically stable critical 
point, and if f(x,y) < 0 in some neighborhood of the origin, then the origin is an unstable 
critical point. 

Hint: Construct a Liapunov function of the form c(x 2 + y 2 ). 

6. A generalization of the undamped pendulum equation is 


d 2 u/dt 2 + g(u) = 0, 


where g(0) = 0, g(u) > 0 for 0 < u < k, and g(u) < 0 for —k < u < 0; that is, ug(u) > 0 
for u 7^ 0, —k < u < k. Notice that g(u) = sinu has this property on (—n/2,n/2). 

(a) Letting x = u, y = du/dt , write Eq. (i) as a system of two equations, and show that 
x = 0, y = 0 is a critical point. 

(b) Show that 



—k < x < k 


(ii) 


is positive definite, and use this result to show that the critical point (0,0) is stable. 
Note that the Liapunov function V given by Eq. (ii) corresponds to the energy function 
V(x,y) = \y 2 + (1 — cosx) for the case g(u) = sin u. 

7. By introducing suitable dimensionless variables, we can write the system of nonlinear 
equations for the damped pendulum [Eqs. (8) of Section 9.3] as 


dx/dt = y, dy/dt = — y — sinx. 


(a) Show that the origin is a critical point. 

(b) Show that while V(x,y) = x 2 + y 2 is positive definite, V(x,y) takes on both positive 
and negative values in any domain containing the origin, so that V is not a Liapunov 
function. 

Hint: x — sinx > 0 for x > 0 and x — sinx < 0 for x < 0. Consider these cases with y 
positive but y so small that y 2 can be ignored compared to y. 

(c) Using the energy function V(x,y) = \y 2 + (1 — cosx) mentioned in Problem 6(b), 
show that the origin is a stable critical point. Since there is damping in the system, we 
can expect that the origin is asymptotically stable. However, it is not possible to draw this 
conclusion using this Liapunov function. 

(d) To show asymptotic stability, it is necessary to construct a better Liapunov function 
than the one used in part (c). Show that V (x, y) = | (x + y) 2 + x 2 + ^y 2 is such a Liapunov 
function, and conclude that the origin is an asymptotically stable critical point. 

Hint: From Taylor’s formula with a remainder it follows that sinx = x — ax 3 / 3!, where 
a depends on x but 0 < a < 1 for —n/2 < x < n/2. Then, letting x = rcos$,y = rsinO, 
show that U(rcos0,rsin0) = —r 2 [ 1 + h(r,0)], where \h(r,9)\ < 1 if r is sufficiently small. 

8. The Lienard equation (Problem 30 of Section 9.3) is 



where g satisfies the conditions of Problem 6 and c(u ) > 0. Show that the point u = 0, 
du/dt = 0 is a stable critical point. 
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9. (a) A special case of the Lienard equation of Problem 8 is 


d 2 u du 

_ + _ +g (n) = 0. 


where g satisfies the conditions of Problem 6. Letting x = u, y = du/dt, show that the 
origin is a critical point of the resulting system. This equation can be interpreted as 
describing the motion of a spring-mass system with damping proportional to the velocity 
and a nonlinear restoring force. Using the Liapunov function of Problem 6, show that 
the origin is a stable critical point, but note that even with damping, we cannot conclude 
asymptotic stability using this Liapunov function. 

(b) Asymptotic stability of the critical point (0, 0) can be shown by constructing a better 
Liapunov function, as was done in part (d) of Problem 7. However, the analysis for a 
general function g is somewhat sophisticated, and we mention only that an appropriate 
form for V is 


V(x,y) = iy 2 + 


Ayg(x)+ [ g(s)ds, 
Jo 


where A is a positive constant to be chosen so that V is positive definite and V is negative 
definite. For the pendulum problem g (x) = sin x; use V as given by the preceding equation 
with A = | to show that the origin is asymptotically stable. 

Hint: Use sinx = x — ax 3 /3\ and cosx = 1 — fix 2 / 2!, where a and fi depend on x, and 
0 < a < 1 and 0 < fi < 1 for —n/2 < x < tt/2; let x = rcosd, y = rsin#, and show 
that V(r cos8,r sin#) = — ^r 2 [l + \ sin 20 + h(r,9)], where \h(r,8)\ < \ if r is sufficiently 
small. To show that V is positive definite, use cosx = 1 — x 2 /2 + yx 4 /4!, where y depends 
on x, and 0 < y < 1 for —n/2 < x < jz / 2. 

In Problems 10 and 11 we will prove part of Theorem 9.3.2: if the critical point (0,0) of the 

locally linear system 


dx/dt = «nx + any + F\(x,y), dy/dt = a 2 ix + a 2 2 y + Gi{x,y) (i) 


is an asymptotically stable critical point of the corresponding linear system 


dx/dt = flux + «i 2 y, dy/dt = 021 * + a 2 iy , (ii) 

then it is an asymptotically stable critical point of the locally linear system (i). Problem 12 
deals with the corresponding result for instability. 

10. Consider the linear system (ii). 

(a) Since (0, 0) is an asymptotically stable critical point, show that an + a 22 < 0 and 
«ii «22 — aua 2 i > 0. (See Problem 21 of Section 9.1.) 

(b) Construct a Liapunov function V(x,y) = Ax 2 + Bxy + Cy 2 such that V is positive 
definite and V is negative definite. One way to ensure that V is negative definite is to 
choose A, B , and C so that V (x,y) = — x 2 — y 2 . Show that this leads to the result 

a 21 + a^ 2 + (ana 22 — ai 2 a 2 \) a\ 2 a 22 + an« 2 i 

’ B= A ’ 

fl ll + a \l + ( fl ll a 22 - fll2fl2l) 

2A ’ 


A = — 

C = - 


where A = (an + a 22 )(a n a 22 - «i 2 a 2 i)- 
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(c) Using the result of part (a), show that A > 0 and then show (several steps of algebra 
are required) that 

. d 2 ( fl ii + a v + a 2 i + a 22 )( a na 22 ~ «i 2 « 2 i) + 2(a n a 22 - a 12 a 21 ) 2 

4/1 C — D = > U. 

A 2 

Thus, by Theorem 9.6.4, V is positive definite. 

11. In this problem we show that the Liapunov function constructed in the preceding problem 
is also a Liapunov function for the locally linear system (i). We must show that there is 
some region containing the origin for which V is negative definite. 

(a) Show that 

V(x,y) = -(x 2 + y 2 ) + (2Ax + By)F\{x,y) + ( Bx + 2Cy)Gi(x,y). 

(b) Recall that /■) (x,y)/r 0 and Gi{x,y)/r -> 0 as r = (x 2 + y 2 ) 1/2 ->• 0. This means 

that, given any e > 0, there exists a circle r — R about the origin such that for 0 < r < R, 
\FAx,y)\ < er and \G\{x,,y)\ < er. Letting M be the maximum of \2A\, \B\, and |2C|, show 
by introducing polar coordinates that R can be chosen so that V (x,y) < 0 for r < R. 
Flint: Choose e sufficiently small in terms of M. 

12. In this problem we prove a part of Theorem 9.3.2 related to instability. 

(a) Show that if a n + a 22 > 0 and ana 22 — a 12 a 21 > 0, then the critical point (0, 0) of the 
linear system (ii) is unstable. 

(b) The same result holds for the locally linear system (i). As in Problems 10 and 11, 
construct a positive definite function V such that V(x,y) = x 2 + y 2 and hence is positive 
definite, and then invoke Theorem 9.6.2. 


9.7 Periodic Solutions and Limit Cycles 


In this section we discuss further the possible existence of periodic solutions of two- 
dimensional autonomous systems 

x' = f(x). (1) 

Such solutions satisfy the relation 

x(f + T) = \(t) (2) 

for all t and for some nonnegative constant T called the period. The corresponding 
trajectories are closed curves in the phase plane. Periodic solutions often play an 
important role in physical problems because they represent phenomena that occur 
repeatedly. In many situations a periodic solution represents a “final state” that is ap- 
proached by all “neighboring” solutions as the transients due to the initial conditions 
die out. 

A special case of a periodic solution is a constant solution x = x°, which corresponds 
to a critical point of the autonomous system. Such a solution is clearly periodic 
with any period. In this section, when we speak of a periodic solution, we mean a 
nonconstant periodic solution. In this case the period T is positive and is usually 
chosen as the smallest positive number for which Eq. (2) is valid. 
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Recall that the solutions of the linear autonomous system 

x' = Ax 


(3) 


are periodic if and only if the eigenvalues of A are pure imaginary. In this case the 
critical point at the origin is a center, as discussed in Section 9.1. We emphasize 
that if the eigenvalues of A are pure imaginary, then every solution of the linear 
system (3) is periodic, while if the eigenvalues are not pure imaginary, then there 
are no (nonconstant) periodic solutions. The predator-prey equations discussed in 
Section 9.5, although nonlinear, behave similarly: all solutions in the first quadrant 
are periodic. The following example illustrates a different way in which periodic 
solutions of nonlinear autonomous systems can occur. 


Discuss the solutions of the system 


EXAMPLE 



1 


(4) 


It is not difficult to show that (0, 0) is the only critical point of the system (4), and also that 
the system is locally linear in the neighborhood of the origin. The corresponding linear system 



(5) 


has eigenvalues 1 ± i. Therefore the origin is an unstable spiral point for both the linear system 
(5) and the nonlinear system (4). Thus any solution that starts near the origin in the phase 
plane will spiral away from the origin. Since there are no other critical points, we might think 
that all solutions of Eqs. (4) correspond to trajectories that spiral out to infinity. However, we 
now show that this is incorrect, because far away from the origin the trajectories are directed 
inward. 

It is convenient to introduce polar coordinates r and 0, where 


( 6 ) 


x=rcos9, y = rsin9. 


and r > 0. If we multiply the first of Eqs. (4) by x , the second by y, and add, we then obtain 



(V) 


Since r 2 = x 2 + y 2 and r(dr/dt) = x(dx/dt) + y(dy/dt), it follows from Eq. (7) that 



( 8 ) 


This equation is similar to the equations discussed in Section 2.5. The critical points (for r > 0) 
are the origin and the point r = 1, which corresponds to the unit circle in the phase plane. From 
Eq. (8) it follows that dr/dt > 0 if r < 1 and dr/dt < 0 if r > 1. Thus, inside the unit circle 
the trajectories are directed outward, while outside the unit circle they are directed inward. 
Apparently, the circle r = 1 is a limiting trajectory for this system. 

To determine an equation for 9 , we multiply the first of Eqs. (4) by y, the second by x, and 
subtract, obtaining 


dx dy 


u y 2.2 

■ = x + y . 



( 9 ) 
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Upon calculating dx/dt and dy/dt from Eqs. 
— r 2 (d6/dt ), so Eq. (9) reduces to 

d9 _ 
dt 


(6), we find that the left side of Eq. (9) is 


- 1 . 


( 10 ) 


The system of equations (8), (10) for r and 9 is equivalent to the original system (4). One 
solution of the system (8), (10) is 


r = 1, 9 = — t + 


( 11 ) 


where t 0 is an arbitrary constant. As t increases, a point satisfying Eqs. (11) moves clockwise 
around the unit circle. Thus the autonomous system (4) has a periodic solution. Other solutions 
can be obtained by solving Eq. (8) by separation of variables; if r j=- 0 and r ^ 1, then 


dr 

r ( 1 - r 2 ) 


= dt. 


( 12 ) 


Equation (12) can be solved by using partial fractions to rewrite the left side and then in- 
tegrating. By performing these calculations, we find that the solution of Eqs. (10) and (12) 
is 


1 

x/l + c 0 e~ 2 ' 


9 — —t + to. 


(13) 


where c 0 and t 0 are arbitrary constants. The solution (13) also contains the solution (11), which 
is obtained by setting c 0 = 0 in the first of Eqs. (13). 

The solution satisfying the initial conditions r = p,9 = a at t = 0 is given by 


1 

r = 9 = —{t — a). 

71 + [(1/p 2 ) - l]e -2 ' 


(14) 


If p < 1, then r — > 1 from the inside as t —*■ oo; if p > 1, then r — > 1 from the outside as t — > oo. 
Thus in all cases the trajectories spiral toward the circle r = 1 as t — > oo. Several trajectories 
are shown in Figure 9.7.1. 



FIGURE 9.7.1 Trajectories of the system (4); a limit cycle. 
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In this example, the circle r = 1 not only corresponds to periodic solutions of the 
system (4), but it also attracts other nonclosed trajectories that spiral toward it as 
t — ► oo. In general, a closed trajectory in the phase plane such that other nonclosed 
trajectories spiral toward it, either from the inside or the outside, as t — »■ oo, is called 
a limit cycle. Thus the circle r — 1 is a limit cycle for the system (4). If all trajectories 
that start near a closed trajectory (both inside and outside) spiral toward the closed 
trajectory as t — »■ oo, then the limit cycle is asymptotically stable. Since the limiting 
trajectory is itself a periodic orbit rather than an equilibrium point, this type of 
stability is often called orbital stability. If the trajectories on one side spiral toward 
the closed trajectory, while those on the other side spiral away as t — »■ oo, then the limit 
cycle is said to be semistable. If the trajectories on both sides of the closed trajectory 
spiral away as t — > oo, then the closed trajectory is unstable. It is also possible to 
have closed trajectories that other trajectories neither approach nor depart from — 
for example, the periodic solutions of the predator-prey equations in Section 9.5. In 
this case the closed trajectory is stable. 

In Example 1 the existence of an asymptotically stable limit cycle was established 
by solving the equations explicitly. Unfortunately, this is usually not possible, so it 
is worthwhile to know general theorems concerning the existence or nonexistence 
of limit cycles of nonlinear autonomous systems. In discussing these theorems, it is 
convenient to rewrite the system (1) in the scalar form 

dx/dt = F(x,y), dy/dt = G(x,y). (15) 


Theorem 9.7.1 Let the functions F and G have continuous first partial derivatives in a domain D 
of the xy-plane. A closed trajectory of the system (15) must necessarily enclose 
at least one critical (equilibrium) point. If it encloses only one critical point, the 
critical point cannot be a saddle point. 


Although we omit the proof of this theorem, it is easy to show examples of it. 
One is given by Example 1 and Figure 9.7.1 in which the closed trajectory encloses 
the critical point (0,0), a spiral point. Another example is the system of predator- 
prey equations in Section 9.5; see Figure 9.5.2. Each closed trajectory surrounds the 
critical point (3, 2); in this case the critical point is a center. 

Theorem 9.7.1 is also useful in a negative sense. If a given region contains no 
critical points, then there can be no closed trajectory lying entirely in the region. The 
same conclusion is true if the region contains only one critical point, and this point is 
a saddle point. For instance, in Example 2 of Section 9.4, an example of competing 
species, the only critical point in the interior of the first quadrant is the saddle point 
(0.5, 0.5). Therefore this system has no closed trajectory lying in the first quadrant. 

A second result about the nonexistence of closed trajectories is given in the fol- 
lowing theorem. 


Theorem 9.7.2 Let the functions F and G have continuous first partial derivatives in a simply 
connected domain D of the xy-plane. If F x + G y has the same sign throughout D, 
then there is no closed trajectory of the system (15) lying entirely in D. 
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A simply connected two-dimensional domain is one with no holes. Theorem 9.7.2 
is a straightforward consequence of Green’s theorem in the plane; see Problem 13. 
Note that if F x + G } changes sign in the domain, then no conclusion can be drawn; 
there may or may not be closed trajectories in D. 

To illustrate Theorem 9.7.2, consider the system (4). A routine calculation shows 
that 

F x (x,y) + G y (x,y) = 2 - 4(x 2 + y 2 ) = 2(1 - 2r 2 ), (16) 

where, as usual, r 2 = x 2 + y 2 . Hence F x + G y is positive for 0 < r < 1/V2, so there 
is no closed trajectory in this circular disk. Of course, we showed in Example 1 that 
there is no closed trajectory in the larger region r < 1. This illustrates that the infor- 
mation given by Theorem 9.7.2 may not be the best possible result. Again referring 
to Eq. (16), note that F x + G y < 0 for r > l/\/2. However, the theorem is not ap- 
plicable in this case because this annular region is not simply connected. Indeed, as 
shown in Example 1, it does contain a limit cycle. 

The following theorem gives conditions that guarantee the existence of a closed 
trajectory. 

Theorem 9.7.3 (Poincare-Bendixson 11 Theorem) Let the functions F and G have continuous first 
partial derivatives in a domain D of the xy-plane. Let D\ be a bounded subdomain 
in D, and let R be the region that consists of D\ plus its boundary (all points of R are 
in D ). Suppose that R contains no critical point of the system (15). If there exists a 
constant to such that x = <f>(t),y = \jr(t) is a solution of the system (15) that exists and 
stays in R for all t > to, then either x = (p{t),y = i jr(t) is a periodic solution (closed 
trajectory), ori= <p(t), y = i jr{t) spirals toward a closed trajectory as t oo. In 
either case, the system (15) has a periodic solution in R. 

Note that if R does contain a closed trajectory, then necessarily, by Theorem 9.7.1, 
this trajectory must enclose a critical point. However, this critical point cannot be in 
R. Thus R cannot be simply connected; it must have a hole. 

As an application of the Poincare-Bendixson theorem, consider again the system 
(4). Since the origin is a critical point, it must be excluded. For instance, we can 
consider the region R defined by 0.5 < r < 2. Next, we must show that there is a 
solution whose trajectory stays in R for all t greater than or equal to some t 0 . This 
follows immediately from Eq. (8). For r = 0.5, dr/dt > 0, so r increases, while for 
r — 2, dr/dt < 0, so r decreases. Thus any trajectory that crosses the boundary of R 
is entering R. Consequently, any solution of Eqs. (4) that starts in R at t = to cannot 
leave but must stay in R for t > to. Of course, other numbers could be used instead 
of 0.5 and 2; all that is important is that r = 1 is included. 

One should not infer from this discussion of the preceding theorems that it is easy 
to determine whether a given nonlinear autonomous system has periodic solutions; 
often it is not a simple matter at all. Theorems 9.7.1 and 9.7.2 are frequently inconclu- 
sive, and for Theorem 9.7.3 it is often difficult to determine a region R and a solution 
that always remains within it. 


11 Ivar Otto Bendixson (1861-1935) a Swedish mathematician, received his doctorate from Uppsala Uni- 
versity and was professor and for many years rector of Stockholm University. This theorem improved on 
an earlier result of Poincare and was published in a paper by Bendixson in Acta Mathemcitica in 1901. 
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EXAMPLE 

2 


We close this section with another example of a nonlinear system that has a limit 
cycle. 


The van der Pol 12 equation 

u" — n( 1 — u 2 )u' + u = 0, (17) 

where fi is a nonnegative constant, describes the current u in a triode oscillator. Discuss the 
solutions of this equation. 

If + = 0, Eq. (17) reduces to u" + u = 0, whose solutions are sine or cosine waves of period 
2n. For + > 0, the second term on the left side of Eq. (17) must also be considered. This is the 
resistance term, proportional to u', with a coefficient — +(1 — u 2 ) that depends on u. For large 
u, this term is positive and acts as usual to reduce the amplitude of the response. However, for 
small u, the resistance term is negative and so causes the response to grow. This suggests that 
perhaps there is a solution of intermediate size that other solutions approach as t increases. 

To analyze Eq. (17) more carefully, we write it as a system of two equations by introducing 
the variables x = u,y = u'. Then it follows that 

x' = y, y'= -x + +(1 - x 2 )y. (18) 

The only critical point of the system (18) is the origin. Near the origin the corresponding linear 
system is 



whose eigenvalues are (/r ± yj — 4) /2. Thus the origin is an unstable spiral point for 
0 < fi < 2 and an unstable node for fi >2. In all cases, a solution that starts near the ori- 
gin grows as t increases. 

With regard to periodic solutions, Theorems 9.7.1 and 9.7.2 provide only partial information. 
From Theorem 9.7.1 we conclude that if there are closed trajectories, they must enclose the 
origin. Next we calculate F x (x,y ) + G y (x,y), with the result that 

F x (x,y) + G y (x,y) = n(l - x 2 ). (20) 

Then it follows from Theorem 9.7.2 that closed trajectories, if there are any, are not contained 
in the strip \x\ < 1 where F x + G y > 0. 

The application of the Poincare-Bendixson theorem to this problem is not nearly as simple 
as for Example 1. If we introduce polar coordinates, we find that the equation for the radial 
variable r is 

r’ = +(1 — r 2 cos 2 9)r sin 2 0. (21) 

Again, consider an annular region R given by r y < r < r 2 , where r\ is small and r 2 is large. 
When r = r\, the linear term on the right side of Eq. (21) dominates, and r' > 0 except on 
the x-axis, where sin® =0 and consequently r' = 0 also. Thus trajectories are entering R at 
every point on the circle r = r y , except possibly for those on the x-axis, where the trajectories 
are tangent to the circle. When r = r 2 , the cubic term on the right side of Eq. (21) is the 
dominant one. Thus r' < 0, except for points on the x-axis where r' = 0 and for points near 
the y-axis where r 2 cos 2 9 < 1 and the linear term makes r' > 0. Thus, no matter how large 
a circle is chosen, there will be points on it (namely, the points on or near the y-axis) where 


12 Balthasar van der Pol (1889-1959) was a Dutch physicist and electrical engineer who worked at the 
Philips Research Laboratory in Eindhoven. He was a pioneer in the experimental study of nonlinear 
phenomena and investigated the equation that bears his name in a paper published in 1926. 
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trajectories are leaving R. Therefore, the Poincare-Bendixson theorem is not applicable unless 
we consider more complicated regions. 

It is possible to show, by a more intricate analysis, that the van der Pol equation does have a 
unique limit cycle. However, we will not follow this line of argument further. We turn instead 
to a different approach in which we plot numerically computed approximations to solutions. 
Experimental observations indicate that the van der Pol equation has an asymptotically stable 
periodic solution whose period and amplitude depend on the parameter /r. By looking at 
graphs of trajectories in the phase plane and of u versus ?, we can gain some understanding of 
this periodic behavior. 

Figure 9.7.2 shows two trajectories of the van der Pol equation in the phase plane for 
fi — 0.2. The trajectory starting near the origin spirals outward in the clockwise direction; this 
is consistent with the behavior of the linear approximation near the origin. The other trajectory 
passes through (—3,2) and spirals inward, again in the clockwise direction. Both trajectories 
approach a closed curve that corresponds to a stable periodic solution. In Figure 9.7.3 we 
show the plots of u versus t for the solutions corresponding to the trajectories in Figure 9.7.2. 
The solution that is initially smaller gradually increases in amplitude, while the larger solution 
gradually decays. Both solutions approach a stable periodic motion that corresponds to the 



FIGURE 9.7.2 Trajectories of the van der Pol equation (17) for // = 0.2. 



FIGURE 9.7.3 Plots of u versus t for the trajectories in Figure 9.7.2. 
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limit cycle. Figure 9.7.3 also shows that there is a phase difference between the two solutions as 
they approach the limit cycle. The plots of u versus t are nearly sinusoidal in shape, consistent 
with the nearly circular limit cycle in this case. 

Figures 9.7.4 and 9.7.5 show similar plots for the case /x = 1. Trajectories again move 
clockwise in the phase plane, but the limit cycle is considerably different from a circle. The plots 
of it versus t tend more rapidly to the limiting oscillation, and again show a phase difference. 
The oscillations are somewhat less symmetric in this case, rising somewhat more steeply than 
they fall. 



FIGURE 9.7.4 Trajectories of the van der Pol equation (17) for /x = 1. 



FIGURE 9.7.5 Plots of u versus t for the trajectories in Figure 9.7.4. 


Figure 9.7.6 shows the phase plane for /x = 5. The motion remains clockwise, and the limit 
cycle is even more elongated, especially in the y direction. Figure 9.7.7 is a plot of u versus t. 
Although the solution starts far from the limit cycle, the limiting oscillation is virtually reached 
in a fraction of a period. Starting from one of its extreme values on the x-axis in the phase 
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plane, the solution moves toward the other extreme position slowly at first, but once a certain 
point on the trajectory is reached, the remainder of the transition is completed very swiftly. 
The process is then repeated in the opposite direction. The waveform of the limit cycle, as 
shown in Figure 9.7.7, is quite different from a sine wave. 
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FIGURE 9.7.6 Trajectories of the van der Pol equation (17) for fj. = 5. 



FIGURE 9.7.7 Plot of u versus t for the outward spiralling trajectory in Figure 9.7.6. 


These graphs clearly show that, in the absence of external excitation, the van der Pol os- 
cillator has a certain characteristic mode of vibration for each value of /r. The graphs of u 
versus t show that the amplitude of this oscillation changes very little with /r, but the period 
increases as /r increases. At the same time, the waveform changes from one that is very nearly 
sinusoidal to one that is much less smooth. 

The presence of a single periodic motion that attracts all (nearby) solutions, that is, an asymp- 
totically stable limit cycle, is one of the characteristic phenomena associated with nonlinear 
differential equations. 
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PROBLEMS In each of Problems 1 through 6 an autonomous system is expressed in polar coordinates. 

Determine all periodic solutions, all limit cycles, and determine their stability characteristics. 

1. dr/dt = /•“ (1 — r 2 ), dd/dt = 1 2. dr/dt = r( 1 — r) 2 , dd/dt = —1 

3. dr/dt = r{r — l)(r — 3), dd/dt = 1 4. dr/dt = r(l — r)(r — 2), dd/dt = —l 

5. dr/dt = smnr, dd/dt = 1 

6. dr/dt = r|r — 2| (/• — 3), dd/dt =— 1 

7. Ifx = rcos#,y = r sin 0, show that y{dx/dt) —x(dy/dt) = —r 2 (dd /dt). 

8. (a) Show that the system 

dx/dt = — y + xf(r)/r, dy/dt = x + yf(r)/r 


has periodic solutions corresponding to the zeros of /(/•). What is the direction of motion 
on the closed trajectories in the phase plane? 

(b) Let f(r) = r(r — 2) 2 (r 2 — 4r + 3). Determine all periodic solutions and determine 
their stability characteristics. 

9. Determine the periodic solutions, if any, of the system 


dx x oo dy 

-r =y+ -=={x 2 + y 2 -2), — = -x + ' , 

dt yjx 1 + y 2 dt y x 2 + y- 


: (x 2 +y 2 -2). 


10. Using Theorem 9.7.2, show that the linear autonomous system 

dx/dt = sip + fl^y, dy/dt = « 2 iX + fl 2 ?y 


does not have a periodic solution (other than x = 0, y = 0) if an + a 2 2 7 ^ 0. 

In each of Problems 11 and 12 show that the given system has no periodic solutions other than 
constant solutions. 

11. dx/dt = x + y + x 3 — y 2 , dy/dt = — x + 2y + x 2 y + y 3 / 3 

12. dx/dt = — 2x — 3y — xy 2 , dy/dt = y + x 3 — x 2 y 

13. Prove Theorem 9.7.2 by completing the following argument. According to Green’s the- 
orem in the plane, if C is a sufficiently smooth simple closed curve, and if F and G are 
continuous and have continuous first partial derivatives, then 

J [F(x,y) dy - G(x,y) dx] = JJ [F x (x,y) + G y (x,y)] dA, 

where C is traversed counterclockwise and R is the region enclosed by C. Assume that 
x = <p(t),y = i jr(t) is a solution of the system (15) that is periodic with period T. Let C be 
the closed curve given by x = <p(t), y — i lr(t) for 0 < t < T. Show that for this curve the 
line integral is zero. Then show that the conclusion of Theorem 9.7.2 must follow. 

14. (a) By examining the graphs of u versus t in Figures 9.7.3, 9.7.5, and 9.7.7, estimate the 
period T of the van der Pol oscillator in these cases. 

(b) Calculate and plot the graphs of solutions of the van der Pol equation for other values 
of the parameter /r. Estimate the period T in these cases also. 

(c) Plot the estimated values of T versus /r. Describe how T depends on fi. 
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15. The equation 

u" — /x(l — \u' 2 )u' + u = 0 
is often called the Rayleigh 13 equation. 

(a) Write the Rayleigh equation as a system of two first order equations. 

(b) Show that the origin is the only critical point of this system. Determine its type and 
whether it is asymptotically stable, stable, or unstable. 

(c) Let /x = 1. Choose initial conditions and compute the corresponding solution of the 
system on an interval such as 0 < t < 20 or longer. Plot u versus t and also plot the 
trajectory in the phase plane. Observe that the trajectory approaches a closed curve (limit 
cycle). Estimate the amplitude A and the period T of the limit cycle. 

(d) Repeat part (c) for other values of /x, such as /x = 0.2, 0.5, 2, and 5. In each case 
estimate the amplitude A and the period T. 

(e) Describe how the limit cycle changes as /x increases. For example, make a table of 
values and/or plot A and T as functions of /x. 

16. Consider the system of equations 

x' = fix + y - x(x 2 + y 2 ), / = — x + fiy - y(x 2 +y 2 ), (i) 

where /x is a parameter. Observe that this system is the same as the one in Example 1, 
except for the introduction of /x. 

(a) Show that the origin is the only critical point. 

(b) Find the linear system that approximates Eqs. (i) near the origin and find its eigen- 
values. Determine the type and stability of the critical point at the origin. How does this 
classification depend on /x? 

(c) Referring to Example 1 if necessary, rewrite Eqs. (i) in polar coordinates. 

(d) Show that when /x > 0, there is a periodic solution r = yjl. By solving the system 
found in part (c), or by plotting numerically computed approximate solutions, conclude 
that this periodic solution attracts all other nonzero solutions. 

Note: As the parameter /x increases through the value zero, the previously asymptotically 
stable critical point at the origin loses its stability, and simultaneously a new asymptotically 
stable solution (the limit cycle) emerges. Thus the point /x = 0 is a bifurcation point; this 
type of bifurcation is called a Hopf 14 bifurcation. 

17. Consider the van der Pol system 

= y, y = -x + m( 1 - * 2 )y, 
where now we allow the parameter /x to be any real number. 

(a) Show that the origin is the only critical point. Determine its type and stability property, 
and how these depend on /x. 


13 John William Strutt (1842-1919), the third Lord Rayleigh, made notable contributions in several areas of 
mathematical physics. Apart from five years as Cavendish Professor of Physics at Cambridge, he worked 
primarily in his private laboratory at home. He was awarded the Nobel Prize in 1904 for the discovery of 
argon. 

14 Eberhard Hopf (1902-1983) was born in Austria and educated at the University of Berlin but spent 
much of his life in the United States, mainly at Indiana University. Hopf bifurcations are named for him 
in honor of his rigorous treatment of them in a 1942 paper. 


9. 7 Periodic Solutions and Limit Cycles 


565 


(b) Let = —1; draw a phase portrait and conclude that there is a periodic solution that 
surrounds the origin. Observe that this periodic solution is unstable. Compare your plot 
with Figure 9.7.4. 

(c) Draw a phase portrait for a few other negative values of /r. Describe how the shape 
of the periodic solution changes with fi. 

(d) Consider small positive or negative values of /r. By drawing phase portraits, determine 
how the periodic solution changes as fi — *■ 0. Compare the behavior of the van der Pol 
system as fi increases through zero with the behavior of the system in Problem 16. 

Problems 18 and 19 extend the consideration of the Rosenzweig-MacArthur predator- 
prey model introduced in Problem 13 of Section 9.5. 

18. Consider the system 



Observe that this system differs from that in Problem 13 of Section 9.5 only in the growth 
rate for the prey. 

(a) Find all of the critical points. 

(b) Determine the type and stability of each critical point. 

(c) Draw a phase portrait in the first quadrant and conclude that there is an asymptotically 
stable limit cycle. Thus this model predicts a stable long-term oscillation of the prey and 
predator populations. 

19. Consider the system 



where a is a positive parameter. Observe that this system includes the one in Problem 18 
above and also the one in Problem 13 in Section 9.5. 

(a) Find all of the critical points. 

(b) Consider the critical point in the interior of the first quadrant. Find the eigenvalues of 
the approximate linear system. Determine the value a 0 where this critical point changes 
from asymptotically stable to unstable. 

(c) Draw a phase portrait for a value of a slightly greater than a 0 . Observe that a limit 
cycle has appeared. Flow does the limit cycle change as a increases further? 

20. There are certain chemical reactions in which the constituent concentrations oscillate 
periodically over time. The system 


x! = 1 - (b T l)x + x 2 y/4, y' = bx-x 2 y ! 4 


is a special case of a model, known as the Brusselator, of this kind of reaction. Assume 
that b is a positive parameter, and consider solutions in the first quadrant of the xy-plane. 

(a) Show that the only critical point is (1,4 b). 

(b) Find the eigenvalues of the approximate linear system at the critical point. 

(c) Classify the critical point as to type and stability. How does the classification depend 
on bl 

(d) As b increases through a certain value h 0 , the critical point changes from asymptoti- 
cally stable to unstable. What is that value ho? 
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(e) Plot trajectories in the phase plane for values of b slightly less than and slightly greater 
than ho- Observe the limit cycle when b > ho; the Brusselator has a Hopf bifurcation point 
at h 0 . 

(f) Plot trajectories for several values of h > h 0 and observe how the limit cycle deforms 
as h increases. 

21. The system 

x! = 3(x + y — | x 3 — k), y' = — f(x + 0.8y — 0.7) 

is a special case of the Fitzhugh-Nagumo 15 equations, which model the transmission of 
neural impulses along an axon. The parameter k is the external stimulus. 

(a) Show that the system has one critical point regardless of the value of k. 

(b) Find the critical point for k = 0 and show that it is an asymptotically stable spiral 
point. Repeat the analysis for k = 0.5 and show that the critical point is now an unstable 
spiral point. Draw a phase portrait for the system in each case. 

(c) Find the value k 0 where the critical point changes from asymptotically stable to un- 
stable. Find the critical point and draw a phase portrait for the system for k = ko- 

(d) For k > k 0 the system exhibits an asymptotically stable limit cycle; the system has a 
Flopf bifurcation point at k 0 . Draw a phase portrait for k = 0.4, 0.5, and 0.6; observe that 
the limit cycle is not small when k is near k 0 . Also plot x versus t and estimate the period 
T in each case. 

(e) As k increases further, there is a value k\ at which the critical point again becomes 
asymptotically stable and the limit cycle vanishes. Find k\ . 


9.8 Chaos and Strange Attractors: The Lorenz Equations 


In principle, the methods described in this chapter for second order autonomous sys- 
tems can be applied to higher order systems as well. In practice, several difficulties 
arise when we try to do this. One problem is that there is simply a greater num- 
ber of possible cases that can occur, and the number increases with the number of 
equations in the system (and the dimension of the phase space). Another problem 
is the difficulty of graphing trajectories accurately in a phase space of more than two 
dimensions; even in three dimensions it may not be easy to construct a clear and 
understandable plot of the trajectories, and it becomes more difficult as the number 
of variables increases. Finally, and this has been clearly realized only fairly recently, 
there are different and very complex phenomena that can occur, and frequently do 
occur, in systems of third and higher order that are not present in second order sys- 
tems. Our goal in this section is to provide a brief introduction to some of these 
phenomena by discussing one particular three-dimensional autonomous system that 
has been intensively studied. In some respects, the presentation here is similar to the 
treatment of the logistic difference equation in Section 2.9. 


15 Richard Fitzhugh (1922-2007) of the United States Public Flealth Service and Jin-Ichi Nagumo (1926— 
1999) of the University of Tokyo independently proposed a simplification of the Hodgkin-Huxley model 
of neural transmission around 1961. 
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An important problem in meteorology, and in other applications of fluid dynamics, 
concerns the motion of a layer of fluid, such as the earth’s atmosphere, that is warmer 
at the bottom than at the top; see Figure 9.8.1. If the vertical temperature difference 
AT is small, then there is a linear variation of temperature with altitude but no 
significant motion of the fluid layer. Flowever, if A T is large enough, then the warmer 
air rises, displacing the cooler air above it, and a steady convective motion results. If 
the temperature difference increases further, then eventually the steady convective 
flow breaks up and a more complex and turbulent motion ensues. 


T Temperature 
J difference AT 

Cooler 


Warmer 

FIGURE 9.8.1 A layer of fluid heated from below. 


While investigating this phenomenon, Edward N. Lorenz 16 was led (by a process 
too involved to describe here) to the nonlinear autonomous three-dimensional sys- 
tem 


dx/dt = <j(—x + y ), 

dy/dt = rx — y — xz, (1) 

dz/dt = — bz + xy. 

Equations (1) are now commonly referred to as the Lorenz equations. 17 Observe that 
the second and third equations involve quadratic nonlinearities. However, except 
for being a system of three equations, superficially the Lorenz equations appear no 
more complicated than the competing species or predator-prey equations discussed 
in Sections 9.4 and 9.5. The variable x in Eqs. (1) is related to the intensity of the 
fluid motion, while the variables y and z are related to the temperature variations 
in the horizontal and vertical directions. The Lorenz equations also involve three 
parameters a, r, and b, all of which are real and positive. The parameters a and b 
depend on the material and geometrical properties of the fluid layer. For the earth’s 
atmosphere, reasonable values of these parameters are a = 10 and b = 8/3; they will 
be assigned these values in much of what follows in this section. The parameter r, on 
the other hand, is proportional to the temperature difference A T, and our purpose 
is to investigate how the nature of the solutions of Eqs. (1) changes with r. 


16 Edward N. Lorenz (1917-2008), American meteorologist, received his Ph.D. from the Massachusetts 
Institute of Technology in 1948 and has been associated with that institution throughout his scientific 
career. His first studies of the system (1) appeared in a famous 1963 paper dealing with the stability of 
fluid flows in the atmosphere. 

17 A very thorough treatment of the Lorenz equations appears in the book by Sparrow listed in the 
references at the end of this chapter. 
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Before proceeding further, we note that for an autonomous system of three first 
order equations 

dx/dt = F(x,y,z), dy/dt — G(x,y,z), dz/dt = H(x,y,z), (2) 

the Jacobian matrix J is defined by 


/ F x F y F z \ 


J = 


G, G y G z 
H x H y IF 


(3) 


Thus, for the Lorenz equations (1), the Jacobian matrix is 


( 


0 \ 


r — z —1 — x 
\ y x -b 


(4) 


The first step in analyzing the Lorenz equations is to locate the critical points by 
solving the algebraic system 


ax — ay — 0, 
rx — y — xz — 0, 
— bz + xy = 0. 


(5) 


From the first equation we have y — x. Then, eliminating y from the second and third 
equations, we obtain 

x(r - 1 - z) = 0, (6) 

-bz + x 2 = 0. (7) 

One way to satisfy Eq. (6) is to choose x — 0. Then it follows that y = 0 and, from 
Eq. (7), z = 0. Alternatively, we can satisfy Eq. (6) by choosing z = r — 1. Then 
Eq. (7) requires that x = ±*Jb(r — 1) and then y = ±*Jb(r - 1) also. Observe that 
these expressions for x and y are real only when r > 1. Thus (0,0,0), which we will 
denote by Pi, is a critical point for all values of r, and it is the only critical point 
for r < 1. However, when r > 1, there are also two other critical points, namely, 
(. \/b{r — 1), s/b(r — I ), r — 1) and {—■ s /b(r — 1), —y/b(r — 1 ),r — 1). We will denote 
the latter two points by P 2 and P3, respectively. Note that all three critical points 
coincide when r — 1. As r increases through the value 1, the critical point P\ at the 
origin bifurcates, and the critical points P 2 and P 3 come into existence. 

Next we will determine the local behavior of solutions in the neighborhood of 
each critical point. Although much of the following analysis can be carried out for 
arbitrary values of a and b, we will simplify our work by using the values a — 10 and 
b = 8/3. Near the origin (the critical point Pi) the approximating linear system is 
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The eigenvalues 18 are determined from the equation 


- 10-1 10 
r -1-1 

0 0 


0 

0 


= -(8/3 + 1)[1 2 + 111 - 10 (r - 1)] = 0. 


-8/3-1 


( 9 ) 


Therefore 


li = - 


8 

3’ 


^2 


-11 - V81 + 40r 
2 


^"3 


-11 + V81 + 40/' 
2 


(10) 


Note that all three eigenvalues are negative for r < l; for example, when r = 1/2, 
the eigenvalues are li = —8/3, 12 = —10.52494, 13 = —0.47506. Hence the origin is 
asymptotically stable for this range of r both for the linear approximation (8) and 
for the original system (1). However, 1 3 changes sign when r — 1 and is positive for 
r > 1. The value r = 1 corresponds to the initiation of convective flow in the physical 
problem described earlier. The origin is unstable for r > 1; all solutions starting near 
the origin tend to grow, except for those lying precisely in the plane determined by the 
eigenvectors associated with k\ and X 2 [or, for the nonlinear system (1), in a certain 
surface tangent to this plane at the origin]. 

Next let us consider the neighborhood of the critical point — l)/3, 

s/8(r — l)/3, r — 1) for r > 1. If u, v, and w are the perturbations from the criti- 
cal point in the x, y, and z directions, respectively, then the approximating linear 
system is 
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/ -10 
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The eigenvalues of the coefficient matrix of Eq. (11) are determined from the equa- 
tion 


3A 3 + 4U 2 + 8(r + 10)1 + 160(r - 1) = 0, (12) 


which is obtained by straightforward algebraic steps that are omitted here. The 
solutions of Eq. (12) depend on r in the following way: 

For 1 < r < r\ = 1.3456 there are three negative real eigenvalues. 

For i'i < r < r 2 = 24.737 there are one negative real eigenvalue and two complex 
eigenvalues with negative real part. 

For r 2 <r there are one negative real eigenvalue and two complex eigenvalues 
with positive real part. 

The same results are obtained for the critical point P 3 . Thus there are several 
different situations. 

For 0 < r < 1 the only critical point is P\ and it is asymptotically stable. All solu- 
tions approach this point (the origin) as t — 00 . 

For 1 < r < r\ the critical points Pi and P 3 are asymptotically stable and P\ is 
unstable. All nearby solutions approach one or the other of the points P 2 and Pi 
exponentially. 


18 Since r appears as a parameter in the Lorenz equations, we will use \ to denote the eigenvalues. 
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For r\ < r < r 2 the critical points P 2 and P 3 are asymptotically stable and P\ is 
unstable. All nearby solutions approach one or the other of the points P 2 and P 2 ; 
most of them spiral inward to the critical point. 

For r 2 < r all three critical points are unstable. Most solutions near P 2 or P 3 spiral 
away from the critical point. 

However, this is by no means the end of the story. Let us consider solutions for 
r somewhat greater than r 2 . In this case Pi has one positive eigenvalue and each 
of P 2 and P 3 has a pair of complex eigenvalues with positive real part. A trajectory 
can approach any one of the critical points only on certain highly restricted paths. 
The slightest deviation from these paths causes the trajectory to depart from the 
critical point. Since none of the critical points is stable, one might expect that most 
trajectories would approach infinity for large f. However, it can be shown that all 
solutions remain bounded as t — >■ 00 ; see Problem 5. In fact, it can be shown that all 
solutions ultimately approach a certain limiting set of points that has zero volume. 
Indeed, this is true not only for r > r 2 but for all positive values of r. 

A plot of computed values of x versus t for a typical solution with r > r 2 is shown 
in Figure 9.8.2. Note that the solution oscillates back and forth between positive and 
negative values in a rather erratic manner. Indeed, the graph of x versus t resembles 
a random vibration, although the Lorenz equations are entirely deterministic and 
the solution is completely determined by the initial conditions. Nevertheless, the 
solution also exhibits a certain regularity in that the frequency and amplitude of the 
oscillations are essentially constant in time. 



FIGURE 9.8.2 A plot of x versus t for the Lorenz equations (1 ) with r = 28; the initial point 
is (5,5,5). 


The solutions of the Lorenz equations are also extremely sensitive to perturbations 
in the initial conditions. Figure 9.8.3 shows the graphs of computed values of x versus 
t for the two solutions whose initial points are (5,5,5) and (5.01,5,5). The dashed 
graph is the same as the one in Figure 9.8.2, while the solid graph starts at a nearby 
point. The two solutions remain close until t is near 10, after which they are quite 
different and, indeed, seem to have no relation to each other. It was this property that 
particularly attracted the attention of Lorenz in his original study of these equations, 
and caused him to conclude that accurate detailed long-range weather predictions 
are probably not possible. 
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FIGURE 9.8.3 Plots of x versus t for two initially nearby solutions of Lorenz equations with 
r = 28; the initial point is (5, 5, 5) for the dashed curve and is (5.01, 5, 5) for the solid curve. 


The attracting set in this case, although of zero volume, has a rather complicated 
structure and is called a strange attractor. The term chaotic has come into general 
use to describe solutions such as those shown in Figures 9.8.2 and 9.8.3. 

To determine how and when the strange attractor is created, it is illuminating to 
investigate solutions for smaller values of r. For r — 21, solutions starting at three dif- 
ferent initial points are shown in Figure 9.8.4. For the initial point (3,8,0) the solution 





(a) Initial point is (3, 8, 0). ( b ) Initial point is (5, 5, 5). (c) Initial point is (5, 5, 10). 
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begins to converge to the point P 3 almost at once; see Figure 9.8.4a. For the second 
initial point (5, 5, 5) there is a fairly short interval of transient behavior, after which 
the solution converges to Pi\ see Figure 9.8.45. However, as shown in Figure 9.8.4c, 
for the third initial point (5, 5, 10) there is a much longer interval of transient chaotic 
behavior before the solution eventually converges to Pi. As r increases, the dura- 
tion of the chaotic transient behavior also increases. Whenr = = 24.06, the chaotic 

transients appear to continue indefinitely, and the strange attractor comes into being. 

We can also show the trajectories of the Lorenz equations in the three- 
dimensional phase space, or at least projections of them in various planes. Fig- 
ures 9.8.5 and 9.8.6 show projections in the xy- and xz-planes, respectively, of the 
trajectory starting at (5,5,5). Observe that the graphs in these figures appear to 
cross over themselves repeatedly, but this cannot be true for the actual trajectories in 
three-dimensional space because of the general uniqueness theorem. The apparent 
crossings are due wholly to the two-dimensional character of the figures. 



FIGURE 9.8.5 Projections of a trajectory of the Lorenz equations (with r = 28) in the 
xy-plane. 



FIGURE 9.8.6 Projections of a trajectory of the Lorenz equations (with r = 28) in the 
XZ' -plane. 
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The sensitivity of solutions to perturbations of the initial data also has implications 
for numerical computations, such as those reported here. Different step sizes, differ- 
ent numerical algorithms, or even the execution of the same algorithm on different 
machines will introduce small differences in the computed solution, which eventually 
lead to large deviations. For example, the exact sequence of positive and negative 
loops in the calculated solution depends strongly on the precise numerical algorithm 
and its implementation, as well as on the initial conditions. However, the general 
appearance of the solution and the structure of the attracting set are independent of 
all these factors. 

Solutions of the Lorenz equations for other parameter ranges exhibit other in- 
teresting types of behavior. For example, for certain values of r greater than rj, 
intermittent bursts of chaotic behavior separate long intervals of apparently steady 
periodic oscillation. For other ranges of r, solutions show the period-doubling prop- 
erty that we saw in Section 2.9 for the logistic difference equation. Some of these 
features are taken up in the problems. 

Since about 1975 the Lorenz equations and other higher dimensional autonomous 
systems have been studied intensively, and this is one of the most active areas of cur- 
rent mathematical research. Chaotic behavior of solutions appears to be much more 
common than was suspected at first, and many questions remain unanswered. Some 
of these are mathematical in nature, while others relate to the physical applications 
or interpretations of solutions. 


PROBLEMS Problems 1 through 3 ask you to fill in some of the details of the analysis of the Lorenz equations 

in this section. 

1. (a) Show that the eigenvalues of the linear system (8), valid near the origin, are given by 
Eq. (10). 

(b) Determine the corresponding eigenvectors. 

(c) Determine the eigenvalues and eigenvectors of the system (8) in the case where r = 28. 

2. (a) Show that the linear approximation valid near the critical point P 2 is given by Eq. (11). 

(b) Show that the eigenvalues of the system (11) satisfy Eq. (12). 

(c) For r = 28, solve Eq. (12) and thereby determine the eigenvalues of the system (11). 

3. (a) By solving Eq. (12) numerically, show that the real part of the complex roots changes 
sign when r = 24.737. 

(b) Show that a cubic polynomial x 3 4 5 + Ax 2 + Bx + C has one real zero and two pure 
imaginary zeros only if AB — C. 

(c) By applying the result of part (b) to Eq. (12), show that the real part of the complex 
roots changes sign when r = 470/19. 

4. Use the Liapunov function V ( x , y, z) = x 2 + cry 2 + az 2 to show that the origin is a globally 
asymptotically stable critical point for the Lorenz equations (1) if r < 1. 

5. Consider the ellipsoid 


V(x,y,z) = rx 2 + cry 2 + cr(z - 2 r) * 1 = c > 0. 

(a) Calculate dV /dt along trajectories of the Lorenz equations (1). 

(b) Determine a sufficient condition on c so that every trajectory crossing V(x,y,z) = c 
is directed inward. 

(c) Evaluate the condition found in part (b) for the case a = 10, b = 8/3, r = 28. 


574 


Chapter 9. Nonlinear Differential Equations and Stability 


In each of Problems 6 through 10 carry out the indicated investigations of the Lorenz equations. 

6. For r = 28 plot x versus t for the cases shown in Figures 9.8.2 and 9.8.3. Do your graphs 
agree with those shown in the figures? Recall the discussion of numerical computation in 
the text. 

7. For r = 28 plot the projections in the xy- and xz-planes, respectively, of the trajectory 
starting at the point (5, 5, 5). Do the graphs agree with those in Figures 9.8.5 and 9.8.6? 

8. (a) For r = 21 plot x versus t for the solutions starting at the initial points (3, 8, 0), (5, 5, 5), 
and (5, 5, 10). Use a t interval of at least 0 < t < 30. Compare your graphs with those in 
Figure 9.8.4. 

(b) Repeat the calculation in part (a) for r = 22, r = 23, and r = 24. Increase the t interval 
as necessary so that you can determine when each solution begins to converge to one of 
the critical points. Record the approximate duration of the chaotic transient in each case. 
Describe how this quantity depends on the value of r. 

(c) Repeat the calculations in parts (a) and (b) for values of r slightly greater than 24. 
Try to estimate the value of r for which the duration of the chaotic transient approaches 
infinity. 

9. For certain r intervals, or windows, the Lorenz equations exhibit a period-doubling prop- 
erty similar to that of the logistic difference equation discussed in Section 2.9. Careful 
calculations may reveal this phenomenon. 

(a) One period-doubling window contains the value r = 100. Let r = 100 and plot the 
trajectory starting at (5, 5, 5) or some other initial point of your choice. Does the solution 
appear to be periodic? What is the period? 

(b) Repeat the calculation in part (a) for slightly smaller values of r. When r = 99.98, you 
may be able to observe that the period of the solution doubles. Try to observe this result 
by performing calculations with nearby values of r. 

(c) As r decreases further, the period of the solution doubles repeatedly. The next period 
doubling occurs at about r = 99.629. Try to observe this by plotting trajectories for nearby 
values of r. 

10. Now consider values of r slightly larger than those in Problem 9. 

(a) Plot trajectories of the Lorenz equations for values of r between 100 and 100.78. You 
should observe a steady periodic solution for this range of r values. 

(b) Plot trajectories for values of r between 100.78 and 100.8. Determine as best you can 
how and when the periodic trajectory breaks up. 

The Rossler 19 System. The system 

x' = —y - z, y' = x + ay, z! = b + z(x - c), (i) 

where a, b, and c are positive parameters, is known as the Rossler 20 system. It is a relatively 
simple system, consisting of two linear equations and a third equation with a single quadratic 
nonlinearity. In Problems 11 through 15 we ask you to carry out some numerical investigations 
of this system, with the goal of exploring its period-doubling property. To simplify matters set 
a = 0.25, b = 0.5, and let c > 0 remain arbitrary. 


19 Otto E. Rossler (1940- ), German medical doctor and biochemist, was a student and later became a 

faculty member at the University of Tubingen. The equations named for him first appeared in a paper he 
published in 1976. 

20 See the book by Strogatz for a more extensive discussion and further references. 
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11. (a) Show that there are no critical points when c < \/(L5, one critical point for c = VoTS, 
and two critical points when c > V0.5. 

(b) Find the critical point(s) and determine the eigenvalues of the associated Jacobian 
matrix when c = V(L5 and when c = 1. 


(c) How do you think trajectories of the system will behave for c = 1? Plot the trajectory 
starting at the origin. Does it behave the way that you expected? 

(d) Choose one or two other initial points and plot the corresponding trajectories. Do 
these plots agree with your expectations? 


12. (a) Let c = 1.3. Find the critical points and the corresponding eigenvalues. What conclu- 
sions, if any, can you draw from this information? 

(b) Plot the trajectory starting at the origin. What is the limiting behavior of this trajec- 
tory? To see the limiting behavior clearly you may wish to choose a t-interval for your 
plot so that the initial transients are eliminated. 

(c) Choose one or two other initial points and plot the corresponding trajectories. Are 
the limiting behavior(s) the same as in part (b)? 

(d) Observe that there is a limit cycle whose basin of attraction is fairly large (although 
not allofxyz-space). Draw a plot of x,y, or z versus t and estimate the period 7j of motion 
around the limit cycle. 


4fl 13 - 


4fl 14 - 


The limit cycle found in Problem 12 comes into existence as a result of a Hopf bifurcation 
at a value ci of c between 1 and 1.3. Determine, or at least estimate more precisely, the 
value of ci. There are several ways in which you might do this. 

(a) Draw plots of trajectories for different values of c. 

(b) Calculate eigenvalues at critical points for different values of c. 

(c) Use the result of Problem 3(b) above. 

(a) Let c = 3. Find the critical points and the corresponding eigenvalues. 

(b) Plot the trajectory starting at the point (1,0, —2). Observe that the limit cycle now 
consists of two loops before it closes; it is often called a 2-cycle. 

(c) Plot x, y, or z versus t and show that the period T 2 of motion on the 2-cycle is very 
nearly double the period 7j of the simple limit cycle in Problem 12. There has been a 
period-doubling bifurcation of cycles for a certain value of c between 1.3 and 3. 


15. (a) Let c = 3.8. Find the critical points and the corresponding eigenvalues. 

(b) Plot the trajectory starting at the point (1,0, —2). Observe that the limit cycle is now a 
4-cycle. Find the period T 4 of motion. Another period-doubling bifurcation has occurred 
for c between 3 and 3.8. 


(c) For c = 3.85 show that the limit cycle is an 8-cycle. Verify that its period is very close 
to eight times the period of the simple limit cycle in Problem 12. 

Note: As c increases further there is an accelerating cascade of period-doubling bifurca- 
tions. The bifurcation values of c converge to a limit, which marks the onset of chaos. 
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CHAPTER 


Partial 
Differential 
Equations and 
Fourier Series 


In many important physical problems there are two or more independent variables, 
so the corresponding mathematical models involve partial, rather than ordinary, dif- 
ferential equations. This chapter treats one important method for solving partial dif- 
ferential equations, a method known as separation of variables. Its essential feature 
is the replacement of the partial differential equation by a set of ordinary differen- 
tial equations, which must be solved subject to given initial or boundary conditions. 
The first section of this chapter deals with some basic properties of boundary value 
problems for ordinary differential equations. The desired solution of the partial dif- 
ferential equation is then expressed as a sum, usually an infinite series, formed from 
solutions of the ordinary differential equations. In many cases we ultimately need 
to deal with a series of sines and/or cosines, so part of the chapter is devoted to a 
discussion of such series, which are known as Fourier series. With the necessary math- 
ematical background in place, we then illustrate the use of separation of variables in 
a variety of problems arising from heat conduction, wave propagation, and potential 
theory. 


10.1 Two-Point Boundary Value Problems 


Up to this point in the book we have dealt with initial value problems, consisting 
of a differential equation together with suitable initial conditions at a given point. 
A typical example, which was discussed at length in Chapter 3, is the differential 
equation 


y" + p(t)y' + q(t)y = g(t), 


( 1 ) 
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with the initial conditions 


yUo) = yo, y'(to) = y'o- ( 2 ) 

Physical applications often lead to another type of problem, one in which the value 
of the dependent variable y or its derivative is specified at two different points. Such 
conditions are called boundary conditions to distinguish them from initial conditions 
that specify the value of y and y 1 at the same point. A differential equation and 
suitable boundary conditions form a two-point boundary value problem. A typical 
example is the differential equation 

y" +p{x)y' + q{x)y = g(x) (3) 

with the boundary conditions 

_v(qO = y 0 , y(P) = yi- (4) 

The natural occurrence of boundary value problems usually involves a space coordi- 
nate as the independent variable, so we have used x rather than t in Eqs. (3) and (4). 
To solve the boundary value problem (3), (4), we need to find a function y = <p(x) that 
satisfies the differential equation (3) in the interval a < x < ft and that takes on the 
specified values yo and y i at the endpoints of the interval. Usually, we first seek the 
general solution of the differential equation and then use the boundary conditions 
to determine the values of the arbitrary constants. 

Boundary value problems can also be posed for nonlinear differential equations, 
but we will restrict ourselves to a consideration of linear equations only. An important 
classification of linear boundary value problems is whether they are homogeneous or 
nonhomogeneous. If the function g has the value zero for each x, and if the boundary 
values yo and yi are also zero, then the problem (3), (4) is called homogeneous. 
Otherwise, the problem is nonhomogeneous. 

Although the initial value problem (1), (2) and the boundary value problem (3), 
(4) may superficially appear to be quite similar, their solutions differ in some very 
important ways. Under mild conditions on the coefficients initial value problems 
are certain to have a unique solution. On the other hand, boundary value problems 
under similar conditions may have a unique solution, but they may also have no 
solution or, in some cases, infinitely many solutions. In this respect, linear boundary 
value problems resemble systems of linear algebraic equations. 

Let us recall some facts (see Section 7.3) about the system 

Ax = b, (5) 

where A is a given n x n matrix, b is a given n x 1 vector, and x is an n x 1 vector to 
be determined. If A is nonsingular, then the system (5) has a unique solution for any 
b. However, if A is singular, then the system (5) has no solution unless b satisfies a 
certain additional condition, in which case the system has infinitely many solutions. 
Now consider the corresponding homogeneous system 

Ax = 0, (6) 

obtained from the system (5) when b = 0. The homogeneous system (6) always has 
the solution x = 0, which is often referred to as the trivial solution. If A is nonsingular, 
then this is the only solution, but if A is singular, then there are infinitely many 
nonzero, or nontrivial, solutions. Note that it is impossible for the homogeneous 
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system to have no solution. These results can also be stated in the following way: the 
nonhomogeneous system (5) has a unique solution if and only if the homogeneous 
system (6) has only the solution x = 0 , and the nonhomogeneous system (5) has 
either no solution or infinitely many solutions if and only if the homogeneous system 
(6) has nonzero solutions. 

We now turn to some examples of linear boundary value problems that illustrate 
very similar behavior. A more general discussion of linear boundary value problems 
appears in Chapter 11. 


EXAMPLE 

1 


Solve the boundary value problem 

y" + 2y = 0, y(0) = 1, yin) — 0. (7) 

The general solution of the differential equation (7) is 

y = Ci cos Vlx + C 2 sin V2x. (8) 

The first boundary condition requires that ci = 1. The second boundary condition implies 
that ci cos \fl jt + C 2 sin V 2 n = 0, so C 2 = — cot -Jl. n = —0.2762. Thus the solution of the 
boundary value problem (7) is 

y = cos\/2x — cotv^jr sin->/2x. (9) 

This example illustrates the case of a nonhomogeneous boundary value problem with a unique 
solution. 


EXAMPLE 

2 


Solve the boundary value problem 


y" + y = 0, y(0) = 1, y{n) = a , 


(10) 


where a is a given number. 

The general solution of this differential equation is 

y = c\ cosx + c 2 sinx, (11) 

and from the first boundary condition we find that C\ = 1. The second boundary condition 
now requires that — ci = a. These two conditions on ci are incompatible if a ^ —1, so the 
problem has no solution in that case. However, if a = —1, then both boundary conditions are 
satisfied provided that Ci = 1, regardless of the value of c 2 . In this case there are infinitely 
many solutions of the form 

y = cosx + c 2 sin x, (12) 


where C 2 remains arbitrary. This example illustrates that a nonhomogeneous boundary value 
problem may have no solution, and also that under special circumstances it may have infinitely 
many solutions. 


Corresponding to the nonhomogeneous boundary value problem (3), (4) is the 
homogeneous problem consisting of the differential equation 

y" +P(x)y' + qix)y = 0 


(13) 
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and the boundary conditions 

y(a) = 0, y( fi) = 0. (14) 

Observe that this problem has the solution y = 0 for all x, regardless of the coefficients 
p(x) and q(x). This (trivial) solution is rarely of interest. What we usually want to 
know is whether the problem has other, nonzero, solutions. Consider the following 
two examples. 


EXAMPLE 

3 


Solve the boundary value problem 

y" + 2y = 0, y(0)=0, y(yt) = 0. (15) 

The general solution of the differential equation is again given by Eq. (8), 

y = ci cos\/2x + C 2 sinV2x. 

The first boundary condition requires that ci = 0, and the second boundary condition leads to 
c 2 sin V2 jt = 0. Since sin n ^ 0, it follows that c 2 = 0 also. Consequently, y = 0 for all x is 
the only solution of the problem (15). This example illustrates that a homogeneous boundary 
value problem may have only the trivial solution y = 0. 


EXAMPLE 

4 


Solve the boundary value problem 

y" + y = 0, y(0) = 0, y(jr) = 0. (16) 

The general solution is given by Eq. (11), 

y = c\ cosx + C 2 sinx, 

and the first boundary condition requires that C\ = 0. Since sin 7r = 0, the second boundary 
condition is also satisfied when c\ = 0, regardless of the value of c 2 . Thus the solution of 
the problem (16) is y = c 2 sinx, where c 2 remains arbitrary. This example illustrates that a 
homogeneous boundary value problem may have infinitely many solutions. 


Examples 1 through 4 illustrate (but of course do not prove) that there is the same 
relationship between homogeneous and nonhomogeneous linear boundary value 
problems as there is between homogeneous and nonhomogeneous linear algebraic 
systems. A nonhomogeneous boundary value problem (Example 1) has a unique 
solution, and the corresponding homogeneous problem (Example 3) has only the 
trivial solution. Further, a nonhomogeneous problem (Example 2) has either no 
solution or infinitely many, and the corresponding homogeneous problem (Example 
4) has nontrivial solutions. 

Eigenvalue Problems. Recall the matrix equation 

Ax = Ax (17) 

that we discussed in Section 7.3. Equation (17) has the solution x = 0 for every value 
of A, but for certain values of A, called eigenvalues, there are also nonzero solutions, 
called eigenvectors. The situation is similar for boundary value problems. 
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Consider the problem consisting of the differential equation 

y" + Xy = 0, (18) 

together with the boundary conditions 

y(0) = 0, y(jt) = 0. (19) 

Observe that the problem (18), (19) is the same as the problems in Examples 3 
and 4 if X = 2 and 1=1, respectively. Recalling the results of these examples, we 
note that for X — 2, Eqs. (18), (19) have only the trivial solution y = 0, while for 
1=1, the problem (18), (19) has other, nontrivial, solutions. By extension of the 
terminology associated with Eq. (17), the values of 1 for which nontrivial solutions 
of (18), (19) occur are called eigenvalues, and the nontrivial solutions themselves are 
called eigenfunctions. Restating the results of Examples 3 and 4, we have found that 
1 = 1 is an eigenvalue of the problem (18), (19) and that X = 2 is not. Further, any 
nonzero multiple of sin x is an eigenfunction corresponding to the eigenvalue 1=1. 

Let us now turn to the problem of finding other eigenvalues and eigenfunctions 
of the problem (18), (19). We need to consider separately the cases X > 0, X = 0, 
and X < 0, since the form of the solution of Eq. (18) is different in each of these 
cases. Suppose first that X > 0. To avoid the frequent appearance of radical signs, it 
is convenient to let X = /x 2 and to rewrite Eq. (18) as 

/ + n 2 y = o. (20) 

The characteristic polynomial equation for Eq. (20)isr 2 + /x 2 = 0 with roots r = ±//x, 
so the general solution is 

y = ci cos fix + C 2 sin /xx. (21) 

Note that /x is nonzero (since X > 0) and there is no loss of generality if we also 
assume that /x is positive. The first boundary condition requires that c\ = 0, and then 
the second boundary condition reduces to 

C 2 sin fjLTi = 0. (22) 

We are seeking nontrivial solutions so we must require that C 2 ^ 0. Consequently, 
sin /nr must be zero, and our task is to choose /x so that this will occur. We know that 
the sine function has the value zero at every integer multiple of n, so we can choose 
/x to be any (positive) integer. The corresponding values of X are the squares of the 
positive integers, so we have determined that 

*1 = 1, *2 = 4, * 3 = 9, ..., k„ = n 2 , ... (23) 

are eigenvalues of the problem (18), (19). The eigenfunctions are given by Eq. (21) 

with ci = 0, so they are just multiples of the functions sinnx for n = 1,2,3, 

Observe that the constant C 2 in Eq. (21) is never determined, so eigenfunctions are 
determined only up to an arbitrary multiplicative constant [just as are the eigenvec- 
tors of the matrix problem (17)]. We will usually choose the multiplicative constant 
to be 1 and write the eigenfunctions as 

yi(x) — sinx, y 2 (x) — sin2x, ..., y n (x) = sinnx, ..., (24) 

remembering that multiples of these functions are also eigenfunctions. 
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Now let us suppose that X < 0. If we let X = — q 2 , then Eq. (18) becomes 

y" - H 2 y = 0. (25) 

The characteristic equation for Eq. (25) is r 2 — q 2 = 0 with roots r = ±q, so its gen- 
eral solution can be written as 

y = ci cosh qx + ci sinh qx. (26) 

We have chosen the hyperbolic functions cosh qx and sinh qx, rather than the ex- 
ponential functions exp(qx) and exp(— qx), as a fundamental set of solutions for 
convenience in applying the boundary conditions. The first boundary condition re- 
quires that ci = 0, and then the second boundary condition gives C 2 sinh pit = 0. 
Since q ^ 0, it follows that sinh qjr / 0, and therefore we must have C 2 = 0. Con- 
sequently, y = 0 and there are no nontrivial solutions for X < 0. In other words, the 
problem (18), (19) has no negative eigenvalues. 

Finally, consider the possibility that X = 0. Then Eq. (18) becomes 

y" = o, (27) 

and its general solution is 

y = c\x + c 2 . (28) 

The boundary conditions (19) can be satisfied only by choosing ci = 0 and c 2 = 0, 
so there is only the trivial solution y = 0 in this case as well. That is, X — 0 is not an 
eigenvalue. 

To summarize our results: we have shown that the problem (18), (19) has an infinite 
sequence of positive eigenvalues X n = n 2 for n= 1,2,3, .. . and that the correspond- 
ing eigenfunctions are proportional to sin nx. Further, there are no other real eigen- 
values. There remains the possibility that there might be some complex eigenvalues; 
recall that a matrix with real elements may very well have complex eigenvalues. In 
Problem 23 we outline an argument showing that the particular problem (18), (19) 
cannot have complex eigenvalues. Later, in Section 11.2, we discuss an important 
class of boundary value problems that includes (18), (19). One of the useful proper- 
ties of this class of problems is that all their eigenvalues are real. 

In later sections of this chapter we will often encounter the problem 

y" + ~Xy = o, y(0) = 0, y(L) = 0, (29) 

which differs from the problem (18), (19) only in that the second boundary condition 
is imposed at an arbitrary point x — L rather than at x = n. The solution process 
for a > 0 is exactly the same as before up to the step where the second boundary 
condition is applied. For the problem (29) this condition requires that 

c 2 sin pL = 0 (30) 

rather than Eq. (22), as in the former case. Consequently, n L must be an integer 
multiple of jt, so q = njr/L, where n is a positive integer. Hence the eigenvalues and 
eigenfunctions of the problem (29) are given by 

X n = n 2 jr 2 /L 2 , 


y n (x) = sin(njrx/L), 


n — 1,2,3, ... . 


(31) 
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As usual, the eigenfunctions y n (x) are determined only up to an arbitrary multiplica- 
tive constant. In the same way as for the problem (18), (19), you can show that the 
problem (29) has no eigenvalues or eigenfunctions other than those in Eq. (31). 

The problems following this section explore to some extent the effect of different 
boundary conditions on the eigenvalues and eigenfunctions. A more systematic 
discussion of two-point boundary and eigenvalue problems appears in Chapter 11. 


PROBLEMS 


In each of Problems 1 through 13 either solve the given boundary value problem or else show 
that it has no solution. 


1. y"+y = 0, y(O) = 0, y'(n) = 1 2. y" + 2y = 0, 

3. y" + y = 0, y(0) = 0, y(L) = 0 4. y" + y = 0, 

5. y"+y = x, y(0) = 0, y(n)=0 6. y" + 2y = x, 

7. y" + 4_y = cosx, y(0) = 0, yin) = 0 

8. y" + 4y = sinx, y(0) = 0, y(j r) = 0 

9. y" + 4_y = cosx, y'(0) = 0, y'in) = 0 

10. y" + 3y = cosx, y'(0) = 0, y'in) = 0 

11. x 2 y" - 2xy' + 2y = 0, y(l) = -1, y( 2) = 1 

12. x 2 y” + 3 xy’ + y = ,r 2 , y( 1) = 0, y(e) = 0 

13. x 2 y" + 5 xy' + (4 + n 2 )y = Inx, yi 1) = 0, y(e) = 0 


y'(0) = 1, /(tt) = 0 

y(0) = 1, y(L) = 0 
y(0) = 0, y{n) = 0 


In each of Problems 14 through 20 find the eigenvalues and eigenfunctions of the given bound- 
ary value problem. Assume that all eigenvalues are real. 

14. y" + ;ty = 0, y(0) = 0, /(n) = 0 15. y" + Xy = 0, /(0) = 0, yin) = 0 

16. y" + Xy = 0, /(0) = 0, / in) = 0 17. y" + Xy = 0, y'(0) = 0, y(L) = 0 

18. y" + Xy = 0, y'(0) = 0, y\L) = 0 19. y" - Xy = 0, y(0) = 0, y'(L) = 0 

20. x 2 y" - xy' + )-y = 0, y(l) = 0, y(L) = 0, L>1 

21. The axially symmetric laminar flow of a viscous incompressible fluid through a long 
straight tube of circular cross section under a constant axial pressure gradient is known as 
Poiseuille 1 flow. The axial velocity w is a function of the radial variable r only and satisfies 
the boundary value problem 

1 G 

w" -\ — w' = , uiiR) = 0, wir) bounded for 0 < r < R, 

r [i 

where R is the radius of the tube, G is the pressure gradient, and /x is the coefficient of 
viscosity of the fluid. 

(a) Find the velocity profile wir). 

(b) By integrating wir) over a cross section, show that the total flow rate Q is given by 

Q = nR 4 G/8fi. 


Since Q , R , and G can be measured, this result provides a practical way to determine the 
viscosity /x. 


^ean Louis Marie Poiseuille (1797-1869) was a French physician who was also trained in mathematics 
and physics. He was particularly interested in the flow of blood and published his first paper on the subject 
in 1840. 
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(c) Suppose that R is reduced to 3/4 of its original value. What is the corresponding 
reduction in Q1 This result has implications for blood flow through arteries constricted 
by plaque. 

22. Consider a horizontal metal beam of length L subject to a vertical load/Cx) per unit length. 
The resulting vertical displacement in the beam y(x) satisfies the differential equation 


d 4 y 


where E is Young’s modulus and I is the moment of inertia of the cross section about 
an axis through the centroid perpendicular to the xy-plane. Suppose that/(x)/£7 is a 
constant k. For each of the boundary conditions given below solve for the displacement 
y(x ), and plot y versus x in the case that L = 1 and k = — 1. 

(a) Simply supported at both ends: y(0) = /'( 0) = y(L) = y"(L) = 0 

(b) Clamped at both ends: y(0) = y'(0) = y(L) = y'(L) = 0 

(c) Clamped at x = 0, free at x = L: y(0) = y'(0) = y"(L) = y'"(L) = 0 

23. In this problem we outline a proof that the eigenvalues of the boundary value problem 
(18), (19) are real. 

(a) Write the solution of Eq. (18) as y — k\ exp (i/xx) + /c 2 exp(— i/xx), where X = /x 2 , and 
impose the boundary conditions (19). Show that nontrivial solutions exist if and only if 


exp(i/x7r) — exp(— t/xjr) = 0. 


(i) 


(b) Let /x = v + ia and use Euler’s relation exppvjr) = cos(v7r) + i sin(i’^r) to determine 
the real and imaginary parts of Eq. (i). 

(c) By considering the equations found in part (b), show that v is an integer and that 
a = 0. Consequently, /x is real and so is X. 


10.2 Fourier Series 


Later in this chapter you will find that you can solve many important problems in- 
volving partial differential equations, provided that you can express a given function 
as an infinite sum of sines and/or cosines. In this and the following two sections we 
explain in detail how this can be done. These trigonometric series are called Fourier 
series 2 ; they are somewhat analogous to Taylor series in that both types of series pro- 
vide a means of expressing quite complicated functions in terms of certain familiar 
elementary functions. 


2 Fourier series are named for Joseph Fourier, who made the first systematic use, although not a completely 
rigorous investigation, of them in 1807 and 1811 in his papers on heat conduction. According to Riemann, 
when Fourier presented his first paper to the Paris Academy in 1807, stating that an arbitrary function 
could be expressed as a series of the form (1), the mathematician Lagrange was so surprised that he denied 
the possibility in the most definite terms. Although it turned out that Fourier’s claim of generality was 
somewhat too strong, his results inspired a flood of important research that has continued to the present 
day. See Grattan-Guinness or Carslaw (Historical Introduction) for a detailed history of Fourier series. 


10.2 Fourier Series 


585 


We begin with a series of the form 

OO 

cio / mnx . mnx\ 

y + 2^ \ am cos ^ 77 “ + sm — — J . (1) 

m = 1 

On the set of points where the series (1) converges, it defines a function /, whose 
value at each point is the sum of the series for that value of x. In this case the series 
(1) is said to be the Fourier series for/. Our immediate goals are to determine what 
functions can be represented as a sum of a Fourier series and to find some means of 
computing the coefficients in the series corresponding to a given function. The first 
term in the series (1) is written as ao/2 rather than simply as ao to simplify a formula 
for the coefficients that we derive below. Besides their association with the method 
of separation of variables and partial differential equations, Fourier series are also 
useful in various other ways, such as in the analysis of mechanical or electrical systems 
acted on by periodic external forces. 

Periodicity of the Sine and Cosine Functions. To discuss Fourier series, it is necessary to de- 
velop certain properties of the trigonometric functions sin(/?urx/L) and cos(mnx/L), 
where m is a positive integer. The first property is their periodic character. A function 
/ is said to be periodic with period T > 0 if the domain off contains x + T whenever 
it contains x, and if 

f(x +T)= f(x) (2) 

for every value of x. An example of a periodic function is shown in Figure 10.2.1. It 
follows immediately from the definition that if T is a period off, then 2T is also a 
period, and so indeed is any integral multiple of T. The smallest value of T for which 
Eq. (2) holds is called the fundamental period off. A constant function is a periodic 
function with an arbitrary period but no fundamental period. 


y 






X 




FIGURE 10.2.1 A periodic function. 


Iff and g are any two periodic functions with common period T, then their product 
fg and any linear combination c\f + C 2 g are also periodic with period T. To prove 
the latter statement, let Fix) = c\fix) + C 2 gix)\ then for any x 


Fix +T) = ci fix + T) + c 2 gix + T) = cifix) + c 2 g(x) = F(x). 


( 3 ) 
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Moreover, it can be shown that the sum of any finite number, or even the sum of a 
convergent infinite series, of functions of period T is also periodic with period T. 

In particular, the functions ?,m{mnx/L) and cos(mnx/L), m = 1, 2, 3, ... , are peri- 
odic with fundamental period T — 2 L/m. To see this, recall that sin x and cos x have 
fundamental period 2it and that sin ax and cos ax have fundamental period 2n /a. If 
we choose a — rrm/L , then the period T of sm{mnx/L) and cos(mnx/L) is given by 
T — 2j[L/nm = 2L/m. 

Note also that, since every positive integral multiple of a period is also a period, 
each of the functions sin(mjtx/L) and cos(mnx/L) has the common period 2 L. 


Orthogonality of the Sine and Cosine Functions. To describe a second essential property of 
the functions sin (mnx/L) and cos(>mrx/L), we generalize the concept of orthogonal- 
ity of vectors (see Section 7.2). The standard inner product (u, v) of two real-valued 
functions u and v on the interval a < x < is defined by 


L 


(u, V) = u{x)v{x) dx. 


(4) 


The functions u and v are said to be orthogonal on a < x < fi if their inner product 
is zero — that is, if 


f 


u(x)v(x) dx = 0. 


(5) 


A set of functions is said to be mutually orthogonal if each distinct pair of functions 
in the set is orthogonal. 

The functions sm(mnx/L) and cos(mnx/L), m = 1,2, . . . form a mutually orthog- 
onal set of functions on the interval —L < x < L. In fact, they satisfy the following 
orthogonality relations: 

r -L 


/ 


mnx nnx 

cos cos dx = 

L L 


-L 

j: 

/ L mnx nnx 

sin — - — sin — — dx = 
l L L 


mnx . nnx 
cos sin — — dx — 0, 


0, m ^ 7i, 

L, m — n\ 

all 77i, n; 

0, 771 ^ 77, 

L, 771 = 77. 


( 6 ) 

(7) 

( 8 ) 


These results can be obtained by direct integration. For example, to derive Eq. (8), 
note that 


£ Si 


mnx . nnx 

sm sin dx = - 

L L 2 


il 


L L 


COS 


(777 — n)nx 
~L 


— cos 


(777 + n)nx 

T 


dx 


1 L 

2 n 


sin[(7i7 — n)nx/L\ sin[(777 + n)nx/L\ 1 

77Z — 71 771 + 71 1 


-L 


= 0 


as long as m + n and m — n are not zero. Since m and n are positive, m + n ^ 0. On 
the other hand, if m — n = 0, then m = n , and the integral must be evaluated in a 
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different way. 


In this case 

-L 


L 


mnx nicx 
sin — - — sin — — ax — 

_L L L 


fj 

u 


mnx \ 2 
sin — ^ — J ax 


L 


1 — cos 


2mnx 

L 


sin(2mjtx/ L) 
2 mn/L 


= L. 


dx 

L 

-L 


This establishes Eq. (8);Eqs. (6) and (7) can be verified by similar computations. 


The Euler-Fourier Formulas. Now let us suppose that a series of the form (1) converges, 
and let us call its sum fix): 

OO 

. ao / mnx . mnx\ 

fix) = — + \ a m COS — ~j— + t>m Sin — — j . (9) 

m = 1 

The coefficients a m and b m can be related to f(x) as a consequence of the orthogonality 
conditions (6), (7), and (8). First multiply Eq. (9) by cos(wrx/L), where n is & fixed 
positive integer (n > 0), and integrate with respect to x from — L to L. Assuming 
that the integration can be legitimately carried out term by term, 3 we obtain 

/ L imx flo f L nnx C L mnx nnx 

fix) cos dx = — / cos dx + > a m I cos cos dx 

-L L 2 J_ L L ^ m J-L L L 


°° nL 

- ^ bm / si 

m = 1 — ^ 


mnx nnx 
sin —j— cos — — dx. 


(10) 


Keeping in mind that n is fixed whereas m ranges over the positive integers, it follows 
from the orthogonality relations (6) and (7) that the only nonzero term on the right 
side of Eq. (10) is the one for which m = n in the first summation. Hence 


/ '' nnx 

fix) cos dx — La n , n = 1, 2, 

l L 

To determine oq, we can integrate Eq. (9) from — L to L, obtaining 


(ID 


/ L n L 00 n 

fix) dx = — / dx + V a m / 

L 2 J ~ L m= i J - 


mnx 

cos ^ dx 


OO l 

b m I 

m = 1 — 


. m7TX 

sin dx 

L 


= Lao , 


( 12 ) 


3 This is a nontrivial assumption, since not all convergent series with variable terms can be so integrated. 
For the special case of Fourier series, however, term-by-term integration can always be justified. 
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since each integral involving a trigonometric function is zero. Thus 

a n = y f fix) cos Ax, n = 0,1,2, (13) 

L J-l L 

By writing the constant term in Eq. (9) as flo/2, it is possible to compute all the a n 
from Eq. (13). Otherwise, a separate formula would have to be used for ao- 
A similar expression for h n may be obtained by multiplying Eq. (9) by sin(wrx/L), 
integrating termwise from — L to L, and using the orthogonality relations (7) and (8); 
thus 

1 nit x 

b n = — J fix) sin —j— dx, n = 1,2,3, ... . (14) 

Equations (13) and (14) are known as the Euler-Fourier formulas for the coefficients 
in a Fourier series. Hence, if the series (9) converges to/(x), and if the series can be 
integrated term by term, then the coefficients must be given by Eqs. (13) and (14). 

Note that Eqs. (13) and (14) are explicit formulas for a n and b n in terms off, and 
that the determination of any particular coefficient is independent of all the other 
coefficients. Of course, the difficulty in evaluating the integrals in Eqs. (13) and (14) 
depends very much on the particular function / involved. 

Note also that the formulas (13) and (14) depend only on the values of f(x) in the 
interval — L < x < L. Since each of the terms in the Fourier series (9) is periodic 
with period 2 L, the series converges for all x whenever it converges in — L <x<L, 
and its sum is also a periodic function with period 2 L. Hence fix) is determined for 
all x by its values in the interval — L < x < L. 

It is possible to show (see Problem 27) that if g is periodic with period T, then every 
integral of g over an interval of length T has the same value. If we apply this result to 
the Euler-Fourier formulas (13) and (14), it follows that the interval of integration, 
— L < x < L, can be replaced, if it is more convenient to do so, by any other interval 
of length 2 L. 


EXAMPLE 

1 


Assume that there is a Fourier series converging to the function / defined by 


fix) = 



—2 < x < 0, 
0 < x < 2; 


f(x + 4) = fix). 


(15) 


Determine the coefficients in this Fourier series. 

This function represents a triangular wave (see Figure 10.2.2) and is periodic with period 4. 
Thus in this case L = 2, and the Fourier series has the form 

OO 

. ao yr — x / mnx . mnx\ 

f ix) = ~^“ + 2 _, \ a m cos — 2 ^ b m sm — 2 — ) ’ (l 6 ) 

m = 1 ~ ~ 

where the coefficients are computed from Eqs. (13) and (14) with L = 2. Substituting for/(x) 
in Eq. (13) with m = 0, we have 

If 0 1 [ 2 

ao = - I i~x) dx+ — I xdx — 1 + 1 = 2. 

2 J - 2 2 Jo 


( 17 ) 
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FIGURE 10.2.2 Triangular wave. 


For m > 0, Eq. (13) yields 


1 mnx 1 r~ mnx 

a m = — J (—x) cos — — — ax + — / x cos — - — ax. 


r 


7-2 2 2 J 0 2 

These integrals can be evaluated through integration by parts, with the result that 


1 


2 . mnx / 2 V nmx 

x sin cos 

run 2 \mn ) 2 


o 


2 mnx ( 2 \ mnx 

x sin 1- cos 

mn 2 \ mn ) 2 




+ I I cos mn + I — ) cosnijr — ^ 

mn I \ mn / \ mn J \ mn 


{mn) 


- (cos mn — 1), m = 1,2,... 


—8/(mn) 2 , m odd, 

0, m even. 

Finally, from Eq. (14) it follows in a similar way that 

b m = 0, m = 1,2,.... 


(18) 


(19) 


By substituting the coefficients from Eqs. (17), (18), and (19) in the series (16), we obtain the 
Fourier series for/: 

8 ( nx 1 3nx 1 5nx 

f(x) = 1 r- cos 1 — - cos 1 — — cos 1- • 

J n 2 \ 2 3 2 2 5 2 2 


8 cos(mnx/2) 

Tr2 2- i yy.2 


8 cos(2 n — l)nx/2 

t 2 (2 n — l) 2 


(20) 
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Let 

EXAMPLE 

2 


fix) = 


0, 

1, 

0 , 


—3 < x < — 1, 
— 1 < x < 1, 

1 < x < 3 


( 21 ) 


and suppose that/(x + 6) = /(x); see Figure 10.2.3. Find the coefficients in the Fourier series 
for/. 


1 

1 


y 

l 

i 

i 

i 



-7 

-5 

-3 

-l 

l 

3 

5 

7 f 


FIGURE 10.2.3 Graph of f(x) in Example 2. 


Note that f{x) is not assigned a value at the points of discontinuity, such as x = — 1 and 
x — 1. This has no effect on the values of the Fourier coefficients, because they result from the 
evaluation of integrals, and the value of an integral is not affected by the value of the integrand 
at a single point, or at a finite number of points. Thus the coefficients are the same regardless 
of what value, if any,/(x) is assigned at a point of discontinuity. 

Since / has period 6, it follows that L = 3 in this problem. Consequently, the Fourier series 
for / has the form 

OO 

„ an v— \ / nnx , . mzx\ 

fix) = — + 2_, (fl« cos — + b n sin — j , (22) 

n = 1 

where the coefficients a„ and b n are given by Eqs. (13) and (14) with L = 3. We have 



Similarly, 

i 

2 .rut 

= — sin — , « = 1,2, ..., 

mr 3 

-l 

and 


1 f 1 nnx 1 . nnx 

a n = I cos ax = — sin 

3 J_i 3 nn 3 


b 


n 



. nnx 1 nnx 

sin ax = cos 

3 nn 3 


l 


-l 


= 0, 


n = 1,2 


(23) 


(24) 


(25) 


Thus the Fourier series for / is 


fix) = - + J2 


2 . nn nnx 

— sin — cos 


nn 3 3 


1 V3 
3 n 


cos(2jr.r/3) cos(4^x/3) cos(5jtx/3) 
cos(jtx/ 3) + z + ■ 


(26) 
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EXAMPLE 

3 


Consider again the function in Example 1 and its Fourier series (20). Investigate the speed 
with which the series converges. In particular, determine how many terms are needed so that 
the error is no greater than 0.01 for all x. 

The «7th partial sum in this series 


n (x) - 1 2^2 


8 cos(2n — \)nx/2 


(2 n - l) 2 


(27) 


can be used to approximate the function/. The coefficients diminish as (In — l) -2 , so the series 
converges fairly rapidly. This is borne out by Figure 10.2.4, where the partial sums for m — 1 
and m = 2 are plotted. To investigate the convergence in more detail, we can consider the error 
e m (x) = f(x) — s m (x). Figure 10.2.5 shows a plot of \e6(x)\ versus x for 0 < x <2. Observe that 
|e 6 (x)| is greatest at the points x = 0 and x = 2 where the graph of f(x) has corners. It is more 
difficult for the series to approximate the function near these points, resulting in a larger error 
there for a given m. Similar graphs are obtained for other values of m. 



FIGURE 10.2.4 Partial sums in the Fourier series, Eq. (20), for the triangular wave. 



FIGURE 10.2.5 Plot of |e6(x)| versus x for the triangular wave. 

Once you realize that the maximum error always occurs at x = 0 or x = 2, you can obtain 
a uniform error bound for each m simply by evaluating \e m (x)\ at one of these points. For 
example, for m = 6 we have e 6 (2) = 0.03370, so |e 6 (x)| < 0.034 for 0 <x< 2 and consequently 
for all x. Table 10.2.1 shows corresponding data for other values of m \ these data are plotted 
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in Figure 10.2.6. From this information you can begin to estimate the number of terms that 
are needed in the series in order to achieve a given level of accuracy in the approximation. 
For example, to guarantee that \e m (x)\ < 0.01, we need to choose m = 21. 


TABLE 10.2.1 Values of the error 


e m (2) for the triangular wave 


m 

e m (2) 

2 

0.09937 

4 

0.05040 

6 

0.03370 

10 

0.02025 

15 

0.01350 

20 

0.01013 

25 

0.00810 



5 10 15 20 25 m 

FIGURE 10.2.6 Plot of e m (2) versus m for the triangular wave. 


In this book Fourier series appear mainly as a means of solving certain problems 
in partial differential equations. However, such series have much wider application 
in science and engineering and, in general, are valuable tools in the investigation of 
periodic phenomena. A basic problem is to resolve an incoming signal into its har- 
monic components, which amounts to constructing its Fourier series representation. 
In some frequency ranges the separate terms correspond to different colors or to 
different audible tones. The magnitude of the coefficient determines the amplitude 
of each component. This process is referred to as spectral analysis. 


PROBLEMS 


In each of Problems 1 through 8 determine whether the given function is periodic. If so, find 
its fundamental period. 

1. sin5x 2. cos 2 nx 3. sinh2x 4. sinnx/L 5. tanjrx 6. x 2 


1. f(x) = 


2n — 1 < x < 2n, 
2n < x < 2n + 1; 


n = 0, ±1, ±2, . . . 
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8. fix) = 


(— 1)", 2n — 1 < x < 2 n. 
1, 2 n < x < 2 n + 1; 


n = 0, ±1, ±2, . . . 


9. If f{x) = — xfor— L <x < L,andif/(x + 2L) = /(x),findaformulafor/(x)in the interval 
L < x < 2 L; in the interval —3 L < x < —2 L. 


10. If fix) = 


| x + 1, — 1 < x < 0, 

lx, 0 < x < 1, 

interval 1 < x < 2; in the interval 8 < x < 9. 


and if /(x + 2) = fix), find a formula for fix) in the 


11. If fix) = L — x for 0 < x < 2 L, and if fix + 2L) = fix), find a formula for /(x) in the 
interval — L < x < 0. 

12. Verify Eqs. (6) and (7) in this section by direct integration. 

In each of Problems 13 through 18: 

(a) Sketch the graph of the given function for three periods. 

(b) Find the Fourier series for the given function. 


14. fix) = 

15. fix) = 

16. fix) = 

17. fix) = 


18. fix) = 


— x, — L < x < L; 

/(x + 2L) = fix) 

J 1, — L < x < 0, 

jo, 0 < x < L; 

fix + 2L) = fix) 

1 x, — n < x < 0, 
jo, 0 < x < n; 

fix + 2n) = fix) 

lx + 1, —1 < x < 0, 

1 1 — x, 0<x<l; 

fix + 2) = fix) 

jx + L. —L < x < 0, 

1 L. 0 < x < L; 

fix + 2L)=f 

0, — 2 < x < — 1, 

X, — 1 < X < 1, 

0, 1 < x < 2; 

fix + 4) = /(x) 


In each of Problems 19 through 24: 

(a) Sketch the graph of the given function for three periods. 

(b) Find the Fourier series for the given function. 

(c) Plot s m ix) versus x for m = 5, 10, and 20. 

(d) Describe how the Fourier series seems to be converging. 


41 

41 

41 

41 


19. fix) = 


— 1, —2 < x < 0, 

1, 0 < x < 2; 


fix + 4) = fix) 


20. fix) = x, — 1 < x < 1; fix + 2) = f(x) 

21. fix) = x 2 12. — 2 < x < 2; fix + 4) = /(x) 


22. /(x) = 


x + 2, —2 < x < 0, 
2 — 2x, 0 < x < 2; 


/(x + 4) = f(x) 


41 


23. fix) = 



—2 < x < 0, 
0 < x < 2; 


/(x + 4) = fix) 
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-3 < x < 0, 
0 < x < 3; 


fix + 6) = fix ) 


25. Consider the function/ defined in Problem 21 and let e m (x) = fix ) — s m {x). 

(a) Plot |e„,(x)| versus x for 0 < x < 2 for several values of m. 

(b) Find the smallest value of m for which \e m (x)\ < 0.01 for all x. 

26. Consider the function / defined in Problem 24 and let e m {x) = fix) — s m ix). 

(a) Plot |e m (x)| versus x for 0 < x < 3 for several values of m. 

(b) Find the smallest value of m for which |e m (x)| <0.1 for all x. 

27. Suppose that g is an integrable periodic function with period T. 

(a) If 0 < a < T, show that 




Hint: Show 


Consider the change of variable 


j = x — T in the second integral. 

(b) Show that for any value of a , not necessarily in 0 < a < T, 



(c) Show that for any values of a and b, 



28. If / is differentiable and is periodic with period T, show that /' is also periodic with period 
T . Determine whether 


X 



fit ) dt 


is always periodic. 

29. In this problem we indicate certain similarities between three-dimensional geometric vec- 
tors and Fourier series. 

(a) Let vi, V2, and V3 be a set of mutually orthogonal vectors in three dimensions, and let 
u be any three-dimensional vector. Show that 


u = ai\i + a 2 \2 + (I3V3, 


where 


u • v, 


(ii) 


= 1,2,3. 


flj — 


Show that a, can be interpreted as the projection of u in the direction of v, divided by the 
length of \j. 

(b) Define the inner product (;<, 11) by 



L 


(hi) 
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Also let 


< p n (x ) = cos(nnx/L), n = 0,1,2, ... ; 

x[r n (x) = sm(nnx/L), n = 1,2, ... . 


Show that Eq. (10) can be written in the form 

OO OO 

( f . 072 ) = "^-(0O7 07,)T ^ ’ 77 n) (077 i > 077 ) T ^ ( 077 , - 07 , ) ■ 

m=l 7n=l 


(iv) 


(V) 


(c) Use Eq. (v) and the corresponding equation for (/, 0„), together with the orthogo- 
nality relations, to show that 


(/,0») 

(077 7 077) 


7i = 0,1,2,...; 


(A 07,) 
(07,7 0,,)’ 


71 = 1,2, ... . (vi) 


Note the resemblance between Eqs. (vi) and Eq. (ii). The functions 0„ and 0„ play a role 
for functions similar to that of the orthogonal vectors Vj, v?, and V 3 in three-dimensional 
space. The coefficients a n and b„ can be interpreted as projections of the function/' onto 
the base functions 0„ and 0„ . 

Observe also that any vector in three dimensions can be expressed as a linear combi- 
nation of three mutually orthogonal vectors. In a somewhat similar way, any sufficiently 
smooth function defined on — L < x < L can be expressed as a linear combination of the 
mutually orthogonal functions cos (nnx/L) and sin(7nrx/L), that is, as a Fourier series. 


10.3 The Fourier Convergence Theorem 


In the preceding section we showed that if the Fourier series 

OO 

«o / mTTX . m7TX\ 

y + 2^ \ am COS + b,n sm “ ) V 1 ) 

m = 1 

converges and thereby defines a function /, then / is periodic with period 2 L, and 
the coefficients a m and b m are related to f(x) by the Euler-Fourier formulas: 

1 f L mTTX 

a m = — J f(x) cos — — dx, m = 0, 1, 2, ... ; (2) 


bm ~ L f_ 


mnx 

f{x) sm — — dx, 

L L 


m = 1,2, ... . 


( 3 ) 


In this section we suppose that a function / is given. If this function is periodic 
with period 2 L and integrable on the interval [— L,L], then a set of coefficients a m 
and b m can be computed from Eqs. (2) and (3), and a series of the form (1) can 
be formally constructed. The question is whether this series converges for each 
value of x and, if so, whether its sum is fix). Examples have been discovered 
showing that the Fourier series corresponding to a function / may not converge 
to fix) or may even diverge. Functions whose Fourier series do not converge to 
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the value of the function at isolated points are easily constructed, and examples 
will be presented later in this section. Functions whose Fourier series diverge at 
one or more points are more pathological, and we will not consider them in this 
book. 

To guarantee convergence of a Fourier series to the function from which its coef- 
ficients were computed, it is essential to place additional conditions on the function. 
From a practical point of view, such conditions should be broad enough to cover 
all situations of interest, yet simple enough to be easily checked for particular func- 
tions. Through the years several sets of conditions have been devised to serve this 
purpose. 

Before stating a convergence theorem for Fourier series, we define a term that 
appears in the theorem. A function / is said to be piecewise continuous on an 
interval a < x < b if the interval can be partitioned by a finite number of points 
a — x o < < • • • < x n — b so that 

1. / is continuous on each open subinterval x t _\ < x < x t . 

2. / approaches a finite limit as the endpoints of each subinterval are approached from within 
the subinterval. 

The graph of a piecewise continuous function is shown in Figure 10.3.1. 



FIGURE 10.3.1 A piecewise continuous function. 


The notation /(c+) is used to denote the limit of f(x) as x —>■ c from the right; 
similarly, /(c— ) denotes the limit of f(x) as x approaches c from the left. 

Note that it is not essential that the function even be defined at the partition points 
x t . For example, in the following theorem we assume that f is piecewise continuous; 
but certainly /' does not exist at those points where / itself is discontinuous. It is also 
not essential that the interval be closed; it may also be open, or open at one end and 
closed at the other. 


Theorem 10.3.1 Suppose that / and f are piecewise continuous on the interval —L < x < L. Fur- 
ther, suppose that / is defined outside the interval — L < x < L so that it is periodic 
with period 2 L. Then / has a Fourier series 

OO 

. ao / mnx . mnx\ 

fix) = — + [ a m cos — — + b m sin — — ) , (4) 

m = 1 

whose coefficients are given by Eqs. (2) and (3). The Fourier series converges to 
f(x ) at all points where/ is continuous, and to [f(x+) + f(x—)]/2 at all points where 
/ is discontinuous. 
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Note that [f(x + ) +f(x — )]/2 is the mean value of the right- and left-hand limits 
at the point x. At any point where / is continuous, f(x+) = f(x—) = f(x). Thus it 
is correct to say that the Fourier series converges to \f(x+) +/(*—)]/ 2 at all points. 
Whenever we say that a Fourier series converges to a function /, we always mean 
that it converges in this sense. 

It should be emphasized that the conditions given in this theorem are only sufficient 
for the convergence of a Fourier series; they are by no means necessary. Nor are they 
the most general sufficient conditions that have been discovered. In spite of this, the 
proof of the theorem is fairly difficult and we do not discuss it here. 4 Under more 
restrictive conditions a much simpler convergence proof is possible; see Problem 18. 

To obtain a better understanding of the content of the theorem, it is helpful to con- 
sider some classes of functions that fail to satisfy the assumed conditions. Functions 
that are not included in the theorem are primarily those with infinite discontinuities 
in the interval [— L, L], such as 1/x 2 as x — * 0, or In \x — L\ asi^- L. Functions 
having an infinite number of jump discontinuities in this interval are also excluded; 
however, such functions are rarely encountered. 

It is noteworthy that a Fourier series may converge to a sum that is not differ- 
entiable, or even continuous, in spite of the fact that each term in the series (4) is 
continuous, and even differentiable infinitely many times. The example below is an 
illustration of this, as is Example 2 in Section 10.2. 


Let 

EXAMPLE 

1 



— L < x < 0, 
0 < x < L, 


(5) 


and let / be defined outside this interval so that/(x + 2 L) — f(x) for allx. We will temporarily 
leave open the definition of/ at the points x = 0, ±L. Find the Fourier series for this function 
and determine where it converges. 




y 

L 






1 I 


i 

1 

1 


-3 L 

- 2 L -L 


L 

2 L 

3 L 

X 


FIGURE 10.3.2 Square wave. 


The equation y = f(x) has the graph shown in Figure 10.3.2, extended to infinity in both 
directions. It can be thought of as representing a square wave. The interval [— L,L\ can 
be partitioned to give the two open subintervals (— L,0) and (0, L). In (0 , L),/(x) = L and 
f'{x) = 0. Clearly, both/ and/' are continuous and furthermore have limits asx — *■ 0 from the 
right and as x ^ L from the left. The situation in (— L, 0) is similar. Consequently, both / and 


4 Proofs of the convergence of a Fourier series can be found in most books on advanced calculus. See, for 
example, Kaplan (Chapter 7) or Buck (Chapter 6). 
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f are piecewise continuous on [— L,L), so/ satisfies the conditions ofTheorem 10.3.1. II the 
coefficients a m and b m are computed from Eqs. (2) and (3), the convergence of the resulting 
Fourier series to f(x) is ensured at all points where / is continuous. Note that the values of 
a m and b m are the same regardless of the definition of / at its points of discontinuity. This is 
true because the value of an integral is unaffected by changing the value of the integrand at a 
finite number of points. From Eq. (2), 


1 f L f L 

a 0 = — I f(x) dx = / dx = L; 

L J-L Jo 


-U 


f(x) cos 


-L L 

= 0, m / 0. 




. mnx 
dx = I cos j— dx 


Similarly, from Eq. (3), 


L 


L 


b m = y I fix) sin dx = / sin 


L 


[ L ■ 

/ sir 

Jo 


L 


■ dx 


(1 — cosmn) 

rmt 


0, m even; 

2L / tniz , m odd. 


Hence 


fix) = y + 

2 7T 


L 2 L ( . 7i x 1 . 3 nx 1 . 5 tzx 


sin 1 — sm 1 — sin b • 

L 3 L 5 L 


L 2 L sm(mnx/L) 

9 t r ‘ J rn 


L 2 L sin(2n — 1 )ttx/L 

~ 1 TT £—1 


2 n - 1 


( 6 ) 


At the points x = 0, ±nL , where the function/ in the example is not continuous, all terms in 
the series after the first vanish and the sum is L/2. This is the mean value of the limits from the 
right and left, as it should be. Thus we might as well define / at these points to have the value 
L/2. If we choose to define it otherwise, the series still gives the value L/2 at these points, 
since all of the preceding calculations remain valid. The series simply does not converge to 
the function at those points unless / is defined to have the value L/2. This illustrates the 
possibility that the Fourier series corresponding to a function may not converge to it at points 
of discontinuity unless the function is suitably defined at such points. 

The manner in which the partial sums 


L 2 L 

s„(x) = — 4 

2 71 



1 (2 n — 1 )ttx 

■ sm 


2 n - 1 


L 


n = 1,2,... 


of the Fourier series (6) converge to f(x) is indicated in Figure 10.3.3, where L has been chosen 
to be 1 and the graph of ssH) is plotted. The figure suggests that at points where / is continu- 
ous the partial sums do approach/(x) as n increases. However, in the neighborhood of points of 
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discontinuity, such as x = 0 and x = L, the partial sums do not converge smoothly to the mean 
value. Instead they tend to overshoot the mark at each end of the jump, as though they cannot 
quite accommodate themselves to the sharp turn required at this point. This behavior is typical 
of Fourier series at points of discontinuity and is known as the Gibbs 5 phenomenon. 



FIGURE 10.3.3 


The partial sum ssG) in the Fourier series, Eq. (6), for the square wave. 



0.2 0.4 0.6 0.8 1 x 

FIGURE 10.3.4 A plot of the error \e&(x)\ versus x for the square wave. 


Additional insight is attained by considering the error e n (x) = f(x) — s n {x). Figure 10.3.4 
shows a plot of \e„(x)\ versus x for n = 8 and for L = 1. The least upper bound of \e&(x)\ is 0.5 
and is approached as x — > 0 and as x — > 1. As n increases, the error decreases in the interior 
of the interval [where /(x) is continuous], but the least upper bound does not diminish with 
increasing n. Thus we cannot uniformly reduce the error throughout the interval by increasing 
the number of terms. 

Figures 10.3.3 and 10.3.4 also show that the series in this example converges more slowly 
than the one in Example 1 in Section 10.2. This is due to the fact that the coefficients in the 
series (6) are proportional only to 1/(2 n — 1). 


5 The Gibbs phenomenon is named after Josiah Willard Gibbs (1839-1903), who is better known for his 
work on vector analysis and statistical mechanics. Gibbs was professor of mathematical physics at Yale 
and one of the first American scientists to achieve an international reputation. The Gibbs phenomenon 
is discussed in more detail by Carslaw (Chapter 9). 
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PROBLEMS In each of Problems 1 through 6 assume that the given function is periodically extended outside 

the original interval. 

(a) Find the Fourier series for the extended function. 

(b) Sketch the graph of the function to which the series converges for three periods. 


1. fix) = 

-1, — 1 < x < 0, 

1, 0 < x < 1 

2. fix) = ■ 

0, 

x, 

-71 < X < 0, 

0 < x < jr 

3. fix) = 

L + x, — L < x < 0, 

L-x, 0 < x < L 

4. fix) = l-x 2 

-1 <x < 1 

5. fix) = 

0, — n < x < — jr/2, 

1, — 7r/2 < x < jr/2, 

0, tc/2 < X < 71 

6. fix) = 

0, 

* 2 , 

-1 < x < 0, 

0 < x < 1 


In each of Problems 7 through 12 assume that the given function is periodically extended 
outside the original interval. 

(a) Find the Fourier series for the given function. 

(b) Let e n (x) = fix) — s n (x). Find the least upper bound or the maximum value (if it exists) 
of |e„(x)| for n = 10, 20, and 40. 

(c) If possible, find the smallest n for which |e„(x)| < 0.01 for all x. 


7. f(x) = I ' f{x + 2 tc) = f(x ) (see Section 10.2, Problem 15) 

0, 0 < x < n; 

8. fix) = I X — ’ fix + 2) = fix) (see Section 10.2, Problem 16) 

1 — x, 0 < x < 1; 

9. fix) = x, — 1 < x < 1; fix + 2) = fix) (see Section 10.2, Problem 20) 

10. /(*)= +’ - — fix + 4) = fix) (see Section 10.2, Problem 22) 

•o [o, — 1 < x < 0, 

4fL 11. fix) = \ 2 fix + 2) = fix) (see Problem 6) 

I x, 0 < x < 1; 

12. fix) = x — x 3 , — 1 < x < 1; fix + 2) = fix) 

Periodic Forcing Terms. In this chapter we are concerned mainly with the use of Fourier series 
to solve boundary value problems for certain partial differential equations. However, Fourier 
series are also useful in many other situations where periodic phenomena occur. Problems 13 
through 16 indicate how they can be employed to solve initial value problems with periodic 
forcing terms. 

13. Find the solution of the initial value problem 

y" + ary = sinnf, y(0) = 0, /(0) = 0, 


where n is a positive integer and co 2 n 1 . What happens if ai 2 = » 2 ? 

14. Find the formal solution of the initial value problem 


OO 

y" + co 2 y = ^ b n sin nt, y(0) = 0, /(0) = 0, 

n = 1 
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where u> > 0 is not equal to a positive integer. How is the solution altered if a> = m, where 
m is a positive integer? 

15. Find the formal solution of the initial value problem 


y" + a> 2 y = f{t), y(0) = 0, /(()) = 0, 


where / is periodic with period 2n and 


f(t) = 


1 , 

0 , 

- 1 , 


0 < t < jt; 
t = 0,n,2n; 
Tt < t < 2n. 


See Problem 1. 

16. Find the formal solution of the initial value problem 


y" + co 2 y = f(t), y(0) = 1, >-'(0) = 0, 


where / is periodic with period 2 and 


fit) = 



0 < t < 1; 

1 < t < 2. 


See Problem 8. 

17. Assuming that 


. CIq 

m= 2 + 


oo 


nnx 

cos — ; 1- o„ sin 


L 


nnx\ 
~L~ ) ’ 


(i) 


show formally that 


j J dx = y + Y (a 2 n + b 2 n ). 


This relation between a function / and its Fourier coefficients is known as Parseval’s 6 
equation. This relation is very important in the theory of Fourier series; see Problem 9 in 
Section 11.6. 

Hint: Multiply Eq. (i) by/(x), integrate from —LtoL, and use the Euler-Fourier formulas. 


6 Marc-Antoine Parseval (1755-1836) was a relatively obscure French mathematician for whom an impor- 
tant result has been named. He presented a forerunner of this result in 1799, though not in the context of 
Fourier series. 
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18. This problem indicates a proof of convergence of a Fourier series under conditions more 
restrictive than those in Theorem 10.3.1. 

(a) If / and/' are piecewise continuous on — L < x < L, and if / is periodic with period 
2 L, show that na„ and nb„ are bounded as n —* oo. 

Hint: Use integration by parts. 

(b) If / is continuous on — L <x<L and periodic with period 2 L, and if/' and/" are 
piecewise continuous on — L < x < L, show that n 2 a„ and n 2 b n are bounded as n — > oo. If 
/ is continuous on the closed interval, then it is continuous for allx. Why is this important? 
Hint: Again, use integration by parts. 

OO OO 

(c) Using the result of part (b), show that ^ \a„\ and ^ \b„\ converge. 

n = 1 n = 1 

(d) From the result in part (c), show that the Fourier series (4) converges absolutely 7 for 
all x. 


Acceleration of Convergence. In the next problem we show how it is sometimes possible to 
improve the speed of convergence of a Fourier series. 

19. Suppose that we wish to calculate values of the function g, where 

8(x) = £ i + ( 2n -1)2 Sin(2 ” _ l)TCX ■ {i) 


It is possible to show that this series converges, albeit rather slowly. However, observe that 
for large n the terms in the series (i) are approximately equal to [sin(2« — \)nx\/{2n — 1) 
and that the latter terms are similar to those in the example in the text, Eq. (6). 

(a) Show that 

OO 

Y [sin(2n - l)nx\/{2n - 1) = (;r/2) [/(*) - \\ , (ii) 

n = 1 

where / is the square wave in the example with L = 1. 

(b) Subtract Eq. (ii) from Eq. (i) and show that 


g(x) = y [/(*) 


§]-E 


sin(2» — 1 )nx 
(2n - 1)[1 + (2 n - l) 2 ] ' 


(hi) 


The series (iii) converges much faster than the series (i) and thus provides a better way to 
calculate values of g(x). 


10.4 Even and Odd Functions 


Before looking at further examples of Fourier series, it is useful to distinguish two 
classes of functions for which the Euler-Fourier formulas can be simplified. These 
are even and odd functions, which are characterized geometrically by the property of 
symmetry with respect to the v-axis and the origin, respectively (see Figure 10.4.1). 


7 It also converges uniformly; for an explanation of what this means, see a book on advanced calculus or 
analysis. 
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y 



X 


(a) ( b ) 

FIGURE 10.4.1 (a) An even function, (b) An odd function. 


Analytically, / is an even function if its domain contains the point — x whenever it 
contains the point x, and if 

/(-*)=/(*) ( 1 ) 

for each x in the domain of /. Similarly, / is an odd function if its domain contains 
—x whenever it contains x , and if 


/(-*) = -/(*) ( 2 ) 

for each x in the domain of /. Examples of even functions are 1, x 2 , cos nx, |jc|, 
and x 2n . The functions x, x 3 , sin nx, and x 2n+ 1 are examples of odd functions. Note 
that according to Eq. (2), /( 0) must be zero if / is an odd function whose domain 
contains the origin. Most functions are neither even nor odd, for instance, e x . Only 
one function, / identically zero, is both even and odd. 

Elementary properties of even and odd functions include the following: 

1. The sum (difference) and product (quotient) of two even functions are even. 

2. The sum (difference) of two odd functions is odd; the product (quotient) of two odd 
functions is even. 

3. The sum (difference) of an odd function and an even function is neither even nor odd; the 
product (quotient) of two such functions is odd. 8 

The proofs of all these assertions are simple and follow directly from the definitions. 
For example, if both/i and/2 are odd, and if g(x) = / (x) + f 2 (x), then 

g(~x) +fi(~x ) = -/iW 

= — IfiW +/>(*)] = -g(x), ( 3 ) 

so/i + f 2 is an odd function also. Similarly, if li(x) = f) (x)f 2 (x). then 

h(-x) = /1 ( x)f 2 ( x) = [-fi(x)][-f 2 (x)] = f\ (x)f 2 (x) = h(x), ( 4 ) 

so that /1/2 is even. 

Also of importance are the following two integral properties of even and odd 
functions: 


8 These statements may need to be modified if either function vanishes identically. 
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4. If / is an even function, then 



dx = 2 / f{x ) dx. 

Jo 


(5) 


5. If / is an odd function, then 



dx = 0 . 


( 6 ) 


These properties are intuitively clear from the interpretation of an integral in terms 
of area under a curve, and they also follow immediately from the definitions. For 
example, if / is even, then 





dx. 


Letting x = — 5 in the first term on the right side and using Eq. (1), we obtain 




dx — 2 f fix) 

Jo 


dx. 


The proof of the corresponding property for odd functions is similar. 

Even and odd functions are particularly important in applications of Fourier se- 
ries since their Fourier series have special forms, which occur frequently in physical 
problems. 


Cosine Series. Suppose that / and f are piecewise continuous on — L <x<L and 
that / is an even periodic function with period 2 L. Then it follows from properties 1 
and 3 that/(x) cos(«jrx/L) is even and/(x) sin(/7jrx/L) is odd. As a consequence of 
Eqs. (5) and (6), the Fourier coefficients of / are then given by 


2 f L njtx 

a n = — fix ) cos — — dx, 

L Jo L 

bn = 0, n = 1,2, ... . 


n = 0, 1,2, ... ; 


(7) 


Thus / has the Fourier series 


OO 

n, x 00 , n7tX 

fix ) = — + 2 _a n cos — . 

n = 1 

In other words, the Fourier series of any even function consists only of the even 
trigonometric functions cosinirx/L) and the constant term; it is natural to call such a 
series a Fourier cosine series. From a computational point of view, observe that only 
the coefficients a„,for n = 0, 1,2, ... , need to be calculated from the integral formula 
(7). Each of the b n , for n — 1,2, . . . , is automatically zero for any even function and 
so does not need to be calculated by integration. 

Sine Series. Suppose that / and/' are piecewise continuous on — L < x < L and that 
/ is an odd periodic function of period 2 L. Then it follows from properties 2 and 3 
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1 


that f(x) cos(httx/L) is odd and f(x) sin(njtx/L) is even. In this case the Fourier 
coefficients of / are 


a n — 0, n = 0, 1,2, , 

2 C L . rutx 

b n = y / /W sm dx > n = 1,2,..., 

L Jo L 

and the Fourier series for / is of the form 


(8) 


OO 

. v-v , . mtx 

fix ) = 2_^b n sm— . 

n = 1 

Thus the Fourier series for any odd function consists only of the odd trigonometric 
functions sm(njrx/L); such a series is called a Fourier sine series. Again observe that 
only half of the coefficients need to be calculated by integration, since each a n , for 
n = 0, 1, 2, . . . , is zero for any odd function. 


Let/(x) = x, —L < x < L, and let /(— L) = f(L) = 0. Let / be defined elsewhere so that it is 
periodic of period 2 L (see Figure 10.4.2). The function defined in this manner is known as a 
sawtooth wave. Find the Fourier series for this function. 


y 

L 

i 1/ 1 

/ 1 / 

-3L -2L -L 

-L 

r 

a 

a 

i 


FIGURE 10.4.2 Sawtooth wave. 


Since / is an odd function, its Fourier coefficients are, according to Eq. (8), 
a n = 0, n = 0, 1,2, ; 

mtx 


>-u 


x sin —j— dx 


I . YlTtX 

J cin 

mtx 

/17TX 

pns y 

\ Jill 

1 L 

~ ~7r 

L 1 


= -( — 
L \nn 


= — (-1) B+1 , 17 = 1,2,.... 

17 jr 

Hence the Fourier series for/, the sawtooth wave, is 


2 L 


(-D 


,n+ 1 


fix ) = — > sin — — . 

7 T L ' F7 L 


( 9 ) 
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Observe that the periodic function / is discontinuous at the points ±L, ±3 L, . . . , as shown in 
Figure 10.4.2. At these points the series (9) converges to the mean value of the left and right 
limits, namely, zero. The partial sum of the series (9) for n = 9 is shown in Figure 10.4.3. The 
Gibbs phenomenon (mentioned in Section 10.3) again occurs near the points of discontinuity. 



FIGURE 10.4.3 A partial sum in the Fourier series, Eq. (9), for the sawtooth wave. 


Note that in this example /(—L) = f(L ) = 0, as well as/(0) = 0. This is required 
if the function / is to be both odd and periodic with period 2 L. When we speak of 
constructing a sine series for a function defined on 0 < x < L, it is understood that, 
if necessary, we must first redefine the function to be zero at x = 0 and x = L. 

It is worthwhile to observe that the triangular wave function (Example 1 of Section 
10.2) and the sawtooth wave function just considered are identical on the interval 
0 < x < L. Therefore, their Fourier series converge to the same function, /(x) = x, 
on this interval. Thus, if it is required to represent the function/(x) = x on 0 < x < L 
by a Fourier series, it is possible to do this by either a cosine series or a sine series. In 
the former case / is extended as an even function into the interval — L < x < 0 and 
elsewhere periodically (the triangular wave). In the latter case / is extended into 
— L < x < 0 as an odd function and elsewhere periodically (the sawtooth wave). If 
/ is extended in any other way, the resulting Fourier series will still converge to x in 
0 < x < L but will involve both sine and cosine terms. 

In solving problems in differential equations, it is often useful to expand in a 
Fourier series of period 2 L a function / originally defined only on the interval [0, L]. 
As indicated previously for the function /(x) = x, several alternatives are available. 
Explicitly, we can 

1. Define a function g of period 2 L so that 


g(x) = 


}/(-*), 


0 < x < L, 
—L < x < 0. 


(10) 


The function g is thus the even periodic extension of/. Its Fourier series, which is a cosine 
series, represents / on [0, L] . 
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EXAMPLE 

2 


2. Define a function h of period 2 L so that 


h(x) = 


fix), 

0, 

~f(~x), 


0 < x < L, 
x m 0, L, 
—L < x < 0. 


( 11 ) 


The function h is thus the odd periodic extension of /. Its Fourier series, which is a sine 
series, also represents / on (0, L). 

3. Define a function k of period 2 L so that 


k(x ) = f(x), 0 < x < L, 


( 12 ) 


and let k(x ) be defined for (— L,0) in any way consistent with the conditions of Theorem 
10.3.1. Sometimes it is convenient to define k(x) to be zero for — L < x < 0. The Fourier 
series for k, which involves both sine and cosine terms, also represents/ on [0, L] , regardless 
of the manner in which k(x) is defined in (— L, 0). Thus there are infinitely many such series, 
all of which converge to f(x) in the original interval. 

Usually, the form of the expansion to be used will be dictated (or at least suggested) 
by the purpose for which it is needed. However, if there is a choice as to the kind 
of Fourier series to be used, the selection can sometimes be based on the rapidity 
of convergence. For example, the cosine series for the triangular wave [Eq. (20) 
of Section 10.2] converges more rapidly than the sine series for the sawtooth wave 
[Eq. (9) in this section], although both converge to the same function for 0 < x < L. 
This is because the triangular wave is a smoother function than the sawtooth wave 
and is therefore easier to approximate. In general, the more continuous derivatives 
possessed by a function over the entire interval — oo < x < oo, the faster its Fourier 
series will converge. See Problem 18 of Section 10.3. 


Suppose that 


fix) = 


1 — x, 
0 


0 < x < 1, 

1 < x < 2. 


(13) 


As indicated previously, we can represent / either by a cosine series or by a sine series. Sketch 
the graph of the sum of each of these series for — 6 < x < 6. 

In this example L = 2, so the cosine series for / converges to the even periodic extension 
of / of period 4, whose graph is sketched in Figure 10.4.4. 


y 

l 

i ill i 

i i i i i 

-6 -4 -2 

-1 

2 4 6 x 


FIGURE 10.4.4 Even periodic extension of / (x ) given by Eq. (13). 


Similarly, the sine series for / converges to the odd periodic extension of / of period 4. The 
graph of this function is shown in Figure 10.4.5. 
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FIGURE 10.4.5 Odd periodic extension of fix) given by Eq. (13). 


PROBLEMS 


In each of Problems 1 through 6 determine whether the given function is even, odd, or neither. 

1. x 3 — 2x 2. x 3 — 2x + 1 

3. tan2x 4. secx 

5. |x| 3 6. e~ z 


In each of Problems 7 through 12 a function/ is given on an interval of length L. In each case 
sketch the graphs of the even and odd extensions of / of period 2 L. 


1. fix) = 


0 < x < 2, 
2 < x < 3 


9. f{x) = 2 — x, 0 < x < 2 


11. f(x ) = 


I”: 


0 < x < 1, 

1 < x < 2 


8. fix) = 


I 0 ’ 

\x-l, 


10. fix) = x — 3, 


0 < x < 1, 

1 < x < 2 

0 < x < 4 


12. /(x) = 4 — x * 2 , 0<x<l 


13. Prove that any function can be expressed as the sum of two other functions, one of which is 
even and the other odd. That is, for any function/, whose domain contains — x whenever 
it contains x, show that there are an even function g and an odd function h such that 
fix) = g(x) + hix). 

Hint: What can you say about fix) + /(— x)? 

14. Find the coefficients in the cosine and sine series described in Example 2. 


In each of Problems 15 through 22: 

(a) Find the required Fourier series for the given function. 

(b) Sketch the graph of the function to which the series converges over three periods. 


15. fix) = 


cosine series, period 4 


1, 0 < x < 1, 

0, 1 < x < 2; 

Compare with Example 1 and Problem 5 of Section 10.3. 


16. fix) = jy 

17. fix) = 1, 

18. fix) = 1, 

0 , 
1 , 
2 , 


0 < x < 1, 

sine series, period 4 

1 < x < 2; 

0 < x < jr; cosine series, period 2 n 

0 < x < 7r; sine series, period 2n 

0 < x < 7T, 

n < x <2n, 

2n < x < 3n; 


19. fix) = 


sine series, period 6n 



10.4 Even and Odd Functions 


609 


20. fix) = x, 0 < x < 1; series of period 1 

21. fix) = L — x, 0 <x<L\ cosine series, period 2 L 

Compare with Example 1 of Section 10.2. 

22. f{x) = L — x, 0 < x < L\ sine series, period 2 L 


41 

41 

41 

41 


In each of Problems 23 through 26: 

(a) Find the required Fourier series for the given function. 

(b) Sketch the graph of the function to which the series converges for three periods. 

(c) Plot one or more partial sums of the series. 


23. 

f(x ) = | 

\x, 

lo, 

0 < X < 71. 
tc < x < 2 tc; 

cosine series, period 4 n 

24. 

fix ) = - 

-X, 

—7i < x < 0; 

sine series, period 2n 

25. 

fix ) = 2 

> j^2 

0 < x < 2; 

sine series, period 4 

26. 

fix ) = 

~ — 2x, 

0 < x < 4; 

cosine series, period 8 


In each of Problems 27 through 30 a function is given on an interval 0 < x < L. 

(a) Sketch the graphs of the even extension g(x) and the odd extension h(x) of the given 
function of period 2 L over three periods. 

(b) Find the Fourier cosine and sine series for the given function. 

(c) Plot a few partial sums of each series. 

(d) For each series investigate the dependence on n of the maximum error on [0, L\. 


41 

27. 

II 

U) 

1 

* 

0 < x < 3 

41 

28. 

fix) jo! 

0 < x < 1, 

1 < x < 2 

41 

29. 

fix) = (4x 2 — 4x — 3)/4, 0 < x < 2 

41 

30. 

fix) = x 3 — 5x 

1 + 5jc + 1, 0 < x < : 


31. 

Prove that if/ 

is an odd function, then 



dx = 0. 


32. Prove properties 2 and 3 of even and odd functions, as stated in the text. 

33. Prove that the derivative of an even function is odd and that the derivative of an odd 
function is even. 


34. Let F(x ) = f fit ) dt. Show that if / is even, then F is odd, and that if / is odd, then F is 

Jo 

even. 

35. From the Fourier series for the square wave in Example 1 of Section 10.3, show that 



OO 


E 

n= 0 


(- 1 )" 
2n + 1 


This relation between n and the odd positive integers was discovered by Leibniz in 1674. 

36. From the Fourier series for the triangular wave (Example 1 of Section 10.2), show that 


n 2 1 1 

-- 1+ ^ + ^ + 


OO 


E 

n=0 


1 

(In + 1 ) 2 ' 


8 
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37. Assume that / has a Fourier sine series 


OO 

f(x) = E b n sin(njtx/L ), 0 < x < L. 

n = 1 


(a) Show formally that 

9 rL _°° 

J | f{x)] 2 dx=J2 b2 rr 

L J ° n = 1 


Compare this result (Parseval’s equation) with that of Problem 17 in Section 10.3. What 
is the corresponding result if / has a cosine series? 

(b) Apply the result of part (a) to the series for the sawtooth wave given in Eq. (9), and 
thereby show that 


2 -ii oo 1 

7T 1 1 ^— \ 1 

— 2 2 3 2 ~ 2-, „2' 


This relation was discovered by Euler about 1735. 

More Specialized Fourier Series. Let / be a function originally defined on 0 < x < L and 
satisfying there the continuity conditions of Theorem 10.3.1. In this section we have shown 
that it is possible to represent / by either a sine series or a cosine series by constructing odd 
or even periodic extensions of /, respectively. Problems 38 through 40 concern some other, 
more specialized Fourier series that converge to the given function / on (0, L). 

38. Let / be extended into ( L , 2 L\ in an arbitrary manner. Then extend the resulting function 
into (— 2L,0) as an odd function and elsewhere as a periodic function of period 4L (see 
Figure 10.4.6). Show that this function has a Fourier sine series in terms of the functions 
&m(nnx/2L),n = 1,2,3, ... ; that is, 


OO 

fix) = E b n sm(nnx/2L), 

n = l 


where 


1 f 2L 

b n = — / f(x) sin(rcjrx/2L) dx. 

L Jo 

This series converges to the original function on (0, L). 

39. Let / first be extended into (L, 2 L) so that it is symmetric about x = L; that is, so as 
to satisfy f(2L — x) = f(x) for 0 < x < L. Let the resulting function be extended into 
(—2 L, 0) as an odd function and elsewhere as a periodic function of period 4L (see Figure 
10.4.7). Show that this function has a Fourier series in terms of the functions sin(7rx/2L), 
sin(37rx/2L), sin(5jrx/2L), . . . ; that is, 


OO 

f{x) = ^2,b n sin 

K = 1 


(2 n — 1 )nx 
2 L 


where 


b 


n 



(2 n — l)nx 
2 1 


dx. 


This series converges to the original function on (0, L]. 
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FIGURE 10.4.6 Graph of the 
function in Problem 38. 
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FIGURE 10.4.7 Graph of the 
function in Problem 39. 


40. (a) How should /, originally defined on [0, L], be extended so as to obtain a Fourier 
series involving only the functions cos(nx/2L), cos(3nx/2L), cos(5nx/2L ), . . .? Refer to 
Problems 38 and 39. 

(b) If/(X) = x for 0 < x < L, sketch the function to which the Fourier series converges 
for — 4L < x < 4L. 


10.5 Separation of Variables; Heat Conduction in a Rod 


The basic partial differential equations of heat conduction, wave propagation, and 
potential theory that we discuss in this chapter are associated with three distinct 
types of physical phenomena: diffusive processes, oscillatory processes, and time- 
independent or steady processes. Consequently, they are of fundamental impor- 
tance in many branches of physics. They are also of considerable significance from 
a mathematical point of view. The partial differential equations whose theory is 
best developed and whose applications are most significant and varied are the linear 
equations of second order. All such equations can be classified into one of three cate- 
gories: the heat conduction equation, the wave equation, and the potential equation, 
respectively, are prototypes of each of these categories. Thus a study of these three 
equations yields much information about more general second order linear partial 
differential equations. 

During the last two centuries several methods have been developed for solving 
partial differential equations. The method of separation of variables is the oldest 
systematic method, having been used by D’Alembert, Daniel Bernoulli, and Euler 
about 1750 in their investigations of waves and vibrations. It has been considerably 
refined and generalized in the meantime, and it remains a method of great importance 
and frequent use today. To show how the method of separation of variables works, we 
consider first a basic problem of heat conduction in a solid body. The mathematical 
study of heat conduction originated 9 about 1800, and it continues to command the 


9 The first important investigation of heat conduction was carried out by Joseph Fourier (1768-1830) while 
he was serving as prefect of the department of Isere (Grenoble) from 1801 to 1815. He presented basic 
papers on the subject to the Academy of Sciences of Paris in 1807 and 1811. However, these papers were 
criticized by the referees (principally Lagrange) for lack of rigor and so were not published. Fourier 
continued to develop his ideas and eventually wrote one of the classics of applied mathematics, Theorie 
analytique de la chaleur, published in 1822. 
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attention of modern scientists. For example, analysis of the dissipation and transfer 
of heat away from its sources in high-speed machinery is frequently an important 
technological problem. 

Let us now consider a heat conduction problem for a straight bar of uniform cross 
section and homogeneous material. Let the x-axis be chosen to he along the axis 
of the bar, and let x = 0 and x = L denote the ends of the bar (see Figure 10.5.1). 
Suppose further that the sides of the bar are perfectly insulated so that no heat passes 
through them. We also assume that the cross-sectional dimensions are so small that 
the temperature u can be considered constant on any given cross section. Then u is 
a function only of the axial coordinate x and the time t. 


u{x, t) 



x = 0 x = L 

FIGURE 10.5.1 A heat-conducting solid bar. 


The variation of temperature in the bar is governed by a partial differential equa- 
tion whose derivation appears in Appendix A at the end of this chapter. The equation 
is called the heat conduction equation and has the form 

of — Uf , 0 x L , t ■■■'" 0 , (1) 

where a 2 is a constant known as the thermal diffusivity. The parameter a 2 depends 
only on the material from which the bar is made and is defined by 

a 2 = ic/ps, (2) 

where k is the thermal conductivity, p is the density, and s is the specific heat of the 
material in the bar. The units of a 2 are (length) 2 /time. Typical values of a 2 are given 
in Table 10.5.1. 


TABLE 10.5.1 Values of the Thermal 
Diffusivity for Some Common 
Materials 


Material 

a 2 (cm 2 /s) 

Silver 

1.71 

Copper 

1.14 

Aluminum 

0.86 

Cast iron 

0.12 

Granite 

0.011 

Brick 

0.0038 

Water 

0.00144 
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In addition, we assume that the initial temperature distribution in the bar is given; 
thus 

u(x. 0) = fix), 0 < x < L, (3) 

where/ is a given function. Finally, we assume that the ends of the bar are held at fixed 
temperatures: the temperature T\ at x = 0 and the temperature Ti^tx — L. How- 
ever, it turns out that we need only consider the case where T\— Tj — ^. We show 
in Section 10.6 how to reduce the more general problem to this special case. Thus in 
this section we will assume that u is always zero when x = 0 or x = L: 

m(0, t) = 0, u(L,t) — 0, t > 0. (4) 

The fundamental problem of heat conduction is to find u(x, t) that satisfies the differ- 
ential equation (1) for 0 < x < L and for t > 0, the initial condition (3) when t = 0, 
and the boundary conditions (4) at x = 0 and x — L. 

The problem described by Eqs. (1), (3), and (4) is an initial value problem in the 
time variable t; an initial condition is given and the differential equation governs 
what happens later. However, with respect to the space variable x, the problem 
is a boundary value problem; boundary conditions are imposed at each end of the 
bar and the differential equation describes the evolution of the temperature in the 
interval between them. Alternatively, we can consider the problem as a boundary 
value problem in the xf-plane (see Figure 10.5.2). The solution u(x,t) of Eq. (1) 
is sought in the semi-infinite strip 0 < x < L, t > 0, subject to the requirement that 
u(x, t ) must assume a prescribed value at each point on the boundary of this strip. 



FIGURE 10.5.2 Boundary value problem for the heat conduction equation. 

The heat conduction problem (1), (3), (4) is linear since u appears only to the first 
power throughout. The differential equation and boundary conditions are also ho- 
mogeneous. This suggests that we might approach the problem by seeking solutions 
of the differential equation and boundary conditions, and then superposing them to 
satisfy the initial condition. The remainder of this section describes how this plan 
can be implemented. 

One solution of the differential equation (1) that satisfies the boundary conditions 
(4) is the function u(x, t ) = 0, but this solution does not satisfy the initial condition 
(3) except in the trivial case in which fix) is also zero. Thus our goal is to find 
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other, nonzero solutions of the differential equation and boundary conditions. To 
find the needed solutions, we start by making a basic assumption about the form 
of the solutions that has far-reaching, and perhaps unforeseen, consequences. The 
assumption is that u(x, t) is a product of two functions, one depending only on x and 
the other depending only on t; thus 

u (x, t) = X (x) T{t). (5 ) 

Substituting from Eq. (5) for u in the differential equation (1) yields 

a 2 X"T = XT', (6) 


where primes refer to ordinary differentiation with respect to the independent vari- 
able, whether x or t. Equation (6) is equivalent to 


X" _ 1 T 
~X ~ Y’ 


( 7 ) 


in which the variables are separated; that is, the left side depends only on x and the 
right side only on t. 

It is now crucial to realize that for Eq. (7) to be valid for 0 < x < L, t > 0, it is 
necessary that both sides of Eq. (7) must be equal to the same constant. Otherwise, 
if one independent variable (say, x) were kept fixed and the other were allowed to 
vary, one side (the left in this case) of Eq. (7) would remain unchanged while the 
other varied, thus violating the equality. If we call this separation constant —X, then 
Eq. (7) becomes 


X" 1 T _ 

Yc - YY ~ 


( 8 ) 


Hence we obtain the following two ordinary differential equations for A (x) and T (t): 


X" + XX = 0, (9) 

T' + a 2 XT = 0. (10) 


We denote the separation constant by —A (rather than X) because it turns out that it 
must be negative, and it is convenient to exhibit the minus sign explicitly. 

The assumption (5) has led to the replacement of the partial differential equation 
(1) by the two ordinary differential equations (9) and (10). Each of these equations is 
linear and homogeneous, with constant coefficients, and so can be readily solved for 
any value of X. The product of two solutions of Eq. (9) and (10), respectively, provides 
a solution of the partial differential equation (1). However, we are interested only 
in those solutions of Eq. (1) that also satisfy the boundary conditions (4). As we now 
show, this severely restricts the possible values of X. 

Substituting for u(x, t) from Eq. (5) in the boundary condition at x = 0, we obtain 


n(0, 0 = X(0)T(t) = 0. 


( 11 ) 
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If Eq. (11) is satisfied by choosing T ( t ) to be zero for all t, then u(x, t) is zero for all 
x and t, and we have already rejected this possibility. Therefore Eq. (11) must be 
satisfied by requiring that 

JT(O) = 0. (12) 

Similarly, the boundary condition at x — L requires that 

X(L) = 0. (13) 

We now want to consider Eq. (9) subject to the boundary conditions (12) and (13). 
This is an eigenvalue problem and, in fact, is the same problem that we discussed in 
detail at the end of Section 10.1; see especially the paragraph following Eq. (29) in 
that section. The only difference is that the dependent variable there was called y 
rather than X. If we refer to the results obtained earlier [Eq. (31) of Section 10.1], 
the only nontrivial solutions of Eqs. (9), (12), and (13) are the eigenfunctions 

X n (x) = sin(niTx/L), n= 1,2,3, .. . (14) 

associated with the eigenvalues 

X n = n 2 7T 2 /L 2 , n = 1,2,3,.... (15) 

Turning now to Eq. (10) for T (t) and substituting n 2 ir 2 /L 1 for X, we have 

T + {n 2 n 2 or /L 2 )T = 0. (16) 

Thus T(t) is proportional to exp(— n 2 jr 2 a 2 t/L 2 ). Hence multiplying solutions of 
Eqs. (9) and (10) together, and neglecting arbitrary constants of proportionality, we 
conclude that the functions 

u n {x,t) = e ~ n2nla2tlL2 sm(mzx/L), n = 1,2,3,... (17) 

satisfy the partial differential equation (1) and the boundary conditions (4) for each 
positive integer value of n. The functions u n are sometimes called fundamental 
solutions of the heat conduction problem (1), (3), and (4). 

It remains only to satisfy the initial condition (3) 

u(x,0)=f(x), 0 <x<L. (18) 

Recall that we have often solved initial value problems by forming linear combina- 
tions of a set of fundamental solutions and then choosing the coefficients to satisfy 
the initial conditions. The analogous step in the present problem is to form a lin- 
ear combination of the functions (17) and then to choose the coefficients to satisfy 
Eq. (18). The main difference from earlier problems is that there are infinitely many 
functions (17), so a general linear combination of them is an infinite series. Thus we 
assume that 

OO OO 

U (X, t) = Y, c n u n (x, t) = T, c„e“" 27r2 “ 2r/i2 sin (19) 

n= 1 n= 1 

where the coefficients c n are as yet undetermined. The individual terms in the series 
(19) satisfy the differential equation (1) and boundary conditions (4). We will assume 
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that the infinite series of Eq. (19) converges and also satisfies Eqs. (1) and (4). To 
satisfy the initial condition (3), we must have 


OO 

u(x, 0) = E 

n = 1 


nnx 

c n srn-^-- =/(*). 


( 20 ) 


In other words, we need to choose the coefficients c„ so that the series of sine functions 
in Eq. (20) converges to the initial temperature distribution f(x) for 0 < x < L. The 
series in Eq. (20) is just the Fourier sine series for/; according to Eq. (8) of Section 
10.4, its coefficients are given by 


Cn — 



( 21 ) 


Hence the solution of the heat conduction problem of Eqs. (1), (3), and (4) is given 
by the series in Eq. (19) with the coefficients computed from Eq. (21). 


EXAMPLE 

1 


Find the temperature u(x, t) at any time in a metal rod 50 cm long, insulated on the sides, which 
initially has a uniform temperature of 20°C throughout and whose ends are maintained at 0°C 
for all t > 0. 

The temperature in the rod satisfies the heat conduction problem (1), (3), (4) with L = 50 
and/(x) = 20 for 0 < x < 50. Thus, from Eq. (19), the solution is 


u(x,t ) = ^c„e-' ,2lr2 “ 2,/2500 sin 

n = 1 


nnx 

~ 50 9 


( 22 ) 


where, from Eq. (21), 


Cn 


4 

5 



. nnx 

sin ax 

50 


40 

nn 


(1 — cosrar) = 


1 80 /nn, 

to. 


n odd; 
n even. 


(23) 


Finally, by substituting for c n in Eq. (22), we obtain 


u(x, t ) 


80 

7T 


E 

n=l,3,5,... 


_ n 2 jr 2 ot 2 r/2500 

n 


nnx 

IT' 


(24) 


The expression (24) for the temperature is moderately complicated, but the negative expo- 
nential factor in each term of the series causes the series to converge quite rapidly, except for 
small values of (ora 2 . Therefore accurate results can usually be obtained by using only a few 
terms of the series. 

In order to display quantitative results, let us measure t in seconds; then a 2 has the units 
of cm 2 /s. If we choose a 2 = 1 for convenience, this corresponds to a rod of a material whose 
thermal properties are somewhere between copper and aluminum. The behavior of the solu- 
tion can be seen from the graphs in Figures 10.5.3 through 10.5.5. In Figure 10.5.3 we show the 
temperature distribution in the bar at several different times. Observe that the temperature 
diminishes steadily as heat in the bar is lost through the end points. The way in which the 
temperature decays at a given point in the bar is indicated in Figure 10.5.4, where temperature 
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is plotted against time for a few selected points in the bar. Finally, Figure 10.5.5 is a three- 
dimensional plot of u versus both x and t. Observe that we obtain the graphs in Figures 10.5.3 
and 10.5.4 by intersecting the surface in Figure 10.5.5 by planes on which either t or x is con- 
stant. The slight waviness in Figure 10.5.5 at t = 0 results from using only a finite number of 
terms in the series for u(x , t) and from the slow convergence of the series for t = 0. 



FIGURE 10.5.3 Temperature distributions at several times for the heat conduction problem 
of Example 1. 



FIGURE 10.5.4 Dependence of temperature on time at several locations for the heat con- 
duction problem of Example 1. 


A problem with possible practical implications is to determine the time r at which the entire 
bar has cooled to a specified temperature. For example, when is the temperature in the entire 
bar no greater than 1°C? Because of the symmetry of the initial temperature distribution and 
the boundary conditions, the warmest point in the bar is always the center. Thus r is found by 
solving u{ 25, t) = 1 for t. Using one term in the series expansion (24), we obtain 

r = 2^0 i n ( 80 /jr) ^ 820 s. 

7i 
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PROBLEMS 



FIGURE 10.5.5 Plot of temperature u versus x and t for the heat conduction problem of 
Example 1. 


In each of Problems 1 through 6 determine whether the method of separation of variables 
can be used to replace the given partial differential equation by a pair of ordinary differential 
equations. If so, find the equations. 

1. xiij X + u, = 0 2. tu xx + xu, = 0 

3. u xx + u xt + u, = 0 4. \p(x)Ux\ x - r{x)u„ = 0 

5. u xx + (x 4- y)u yy = 0 6. u xx + u yy + xu = 0 

7. Find the solution of the heat conduction problem 

100u xl = u t , 0 < x < 1, t > 0; 

u(0,t)=0, m(1,0 = 0, t > 0; 

;/(x,0) = sin2jrx — sin5jrx, 0 < x < 1. 


8. Find the solution of the heat conduction problem 

u xx — 4 u t , 0 < x < 2, t > 0; 

«(0, t) — 0, u(2, t) = 0, t > 0; 

u(x, 0) = 2sin(7rx/2) — sin nx + 4 sin 2nx, 0 < x < 2. 


Consider the conduction of heat in a rod 40 cm in length whose ends are maintained at 0°C 
for all t > 0. In each of Problems 9 through 12 find an expression for the temperature u(x, t) 
if the initial temperature distribution in the rod is the given function. Suppose that a * 1 2 * * * * * 8 9 = 1. 

9. u(x, 0) = 50, 0 < x < 40 


10. u(x, 0) 


x, 

40 -x, 


0 < x < 20, 
20 < x < 40 
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11 . 


12 . 


4fl 13 - 


4fl 14. 


41 15. 
41 16- 
41 11 ■ 


41 18. 


19. 

20 . 


u(x, 0) 


0, 

50, 

0, 


0 < x < 10, 
10 < x < 30, 
30 < x < 40 


u(x, 0) = x, 0 < x < 40 


Consider again the rod in Problem 9. For t = 5 and x = 20 determine how many terms 
are needed to find the solution correct to three decimal places. A reasonable way to do 
this is to find n so that including one more term does not change the first three decimal 
places of m( 20, 5). Repeat for t = 20 and t = 80. Form a conclusion about the speed of 
convergence of the series for u(x, t). 

For the rod in Problem 9: 

(a) Plot u versus x for t = 5, 10, 20, 40, 100, and 200. Put all of the graphs on the same 
set of axes and thereby obtain a picture of the way in which the temperature distribution 
changes with time. 

(b) Plot u versus t for x = 5, 10, 15, and 20. 

(c) Draw a three-dimensional plot of u versus x and t. 

(d) How long does it take for the entire rod to cool off to a temperature of no more than 
1°C? 


Follow the instructions in Problem 14 for the rod in Problem 10. 

Follow the instructions in Problem 14 for the rod in Problem 11. 

For the rod in Problem 12: 

(a) Plot u versus x for t = 5, 10, 20, 40, 100, and 200. 

(b) For each value of t used in part (a) estimate the value of x for which the temperature 
is greatest. Plot these values versus t to see how the location of the warmest point in the 
rod changes with time. 

(c) Plot u versus t for x = 10, 20, and 30. 

(d) Draw a three-dimensional plot of u versus x and t. 

(e) How long does it take for the entire rod to cool off to a temperature of no more than 
1°C? 


Let a metallic rod 20 cm long be heated to a uniform temperature of 100° C. Suppose that 
at t = 0 the ends of the bar are plunged into an ice bath at 0°C, and thereafter maintained 
at this temperature, but that no heat is allowed to escape through the lateral surface. Find 
an expression for the temperature at any point in the bar at any later time. Determine 
the temperature at the center of the bar at time r = 30 s if the bar is made of (a) silver, 
(b) aluminum, or (c) cast iron. 

For the rod of Problem 18 find the time that will elapse before the center of the bar cools 
to a temperature of 5°C if the bar is made of (a) silver, (b) aluminum, or (c) cast iron. 

In solving differential equations, the computations can almost always be simplified by the 

use of dimensionless variables. 

(a) Show that if the dimensionless variable f = x/L is introduced, the heat conduction 
equation becomes 


3 2 u L 2 du 

W = a 1 9f ’ 


0 < £ < 1, t > 0. 


(b) Since L 1 /a 1 has the units of time, it is convenient to use this quantity to define a 
dimensionless time variable r = ( a 2 /L 2 )t . Then show that the heat conduction equation 
reduces to 
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21. Consider the equation 


bu, + cu = 0, 


where a, b , and c are constants. 

(a) Let u(x, t) = e s ‘w(x, t), where & is constant, and find the corresponding partial differ- 
ential equation for w. 

(b) If b / 0, show that <5 can be chosen so that the partial differential equation found in 
part (a) has no term in w. Thus, by a change of dependent variable, it is possible to reduce 
Eq. (i) to the heat conduction equation. 

22. The heat conduction equation in two space dimensions is 



Assuming that u(x,y,t ) = X (x)Y (y)T (f) , find ordinary differential equations that are sat- 
isfied by X (x), Y(y), and T(t). 

23. The heat conduction equation in two space dimensions may be expressed in terms of polar 
coordinates as 


a 2 [u rr + (1 /r)u r + (1 /r 2 )u ee \ = u t . 


Assuming that u(r,6,t ) = R(r)®(9)T(t), find ordinary differential equations that are sat- 
isfied by R(r), 0(0), and T(t). 


10.6 Other Heat Conduction Problems 


In Section 10.5 we considered the problem consisting of the heat conduction equation 


cru xx — u t , 0 < x < L, t > 0, 


(1) 


the boundary conditions 


«(0, t) = 0, u(L,t) = 0, t > 0, 


( 2 ) 


and the initial condition 


u(x, 0) = /(A), 0 < x < L. 


( 3 ) 


We found the solution to be 


OO 



sin — — , 


nnx 


( 4 ) 


n=l 


where the coefficients c n are the same as in the series 


OO 



( 5 ) 
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The series in Eq. (5) is just the Fourier sine series for/; according to Section 10.4, its 
coefficients are given by 



( 6 ) 


Hence the solution of the heat conduction problem, Eqs. (1) to (3), is given by the 
series in Eq. (4) with the coefficients computed from Eq. (6). 

We emphasize that at this stage the solution (4) must be regarded as a formal 
solution; that is, we obtained it without rigorous justification of the limiting processes 
involved. Such a justification is beyond the scope of this book. However, once 
the series (4) has been obtained, it is possible to show that in 0 < x < L, t > 0 it 
converges to a continuous function, that the derivatives u xx and u, can be computed 
by differentiating the series (4) term by term, and that the heat conduction equation 
(1) is indeed satisfied. The argument relies heavily on the fact that each term of 
the series (4) contains a negative exponential factor, and this results in relatively 
rapid convergence of the series. A further argument establishes that the function u 
given by Eq. (4) also satisfies the boundary and initial conditions; this completes the 
justification of the formal solution. 

It is interesting to note that although / satisfies the conditions of the Fourier con- 
vergence theorem (Theorem 10.3.1), it may have points of discontinuity. In this case 
the initial temperature distribution u(x, 0) = f(x) is discontinuous at one or more 
points. Nevertheless, the solution u(x, t) is continuous for arbitrarily small values of 
t > 0. This illustrates the fact that heat conduction is a diffusive process that instantly 
smooths out any discontinuities that may be present in the initial temperature dis- 
tribution. Finally, since / is bounded, it follows from Eq. (6) that the coefficients c n 
are also bounded. Consequently, the presence of the negative exponential factor in 
each term of the series (4) guarantees that 


( 7 ) 


lim u(x,t) = 0 


for all x regardless of the initial condition. This is in accord with the result expected 
from physical intuition. 

We now consider two other problems of one-dimensional heat conduction that can 
be handled by the method developed in Section 10.5. 

Nonhomogeneous Boundary Conditions. Suppose now that one end of the bar is held at a 
constant temperature T) and the other is maintained at a constant temperature T 2 . 
Then the boundary conditions are 


( 8 ) 


m(0, t) — 7), u(L,t) = T 2 , t > 0. 


The differential equation (1) and the initial condition (3) remain unchanged. 

This problem is only slightly more difficult, because of the nonhomogeneous bound- 
ary conditions, than the one in Section 10.5. We can solve it by reducing it to a problem 
having homogeneous boundary conditions, which can then be solved as in Section 
10.5. The technique for doing this is suggested by the following physical argument. 

After a long time — that is, as t — »■ 00 — we anticipate that a steady temperature 
distribution v(x) will be reached, which is independent of the time t and the initial 
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conditions. Since v(x) must satisfy the equation of heat conduction (1), we have 

v"(x) — 0, 0 < x < L. (9) 

Hence the steady state temperature distribution is a linear function of x. Further, 
v(x) must satisfy the boundary conditions 

i>(0) = T u v(L) = T 2 , (fO) 

which are valid even as t — »■ oo. The solution of Eq. (9) satisfying Eqs. (10) is 

v(x) = (T 2 - 7\)|- + T\. (11) 

Returning to the original problem, Eqs. (1), (3), and (8), we will try to express u(x, t ) 
as the sum of the steady state temperature distribution v(x) and another (transient) 
temperature distribution w(x, t): thus we write 

u(x,t ) = v(x) + w(x,t). (12) 

Since i>(x) is given by Eq. (11), the problem will be solved, provided that we can 
determine w(x,t). The boundary value problem for w(x,t) is found by substituting 
the expression in Eq. (12) for u(x, t) in Eqs. (1), (3), and (8). 

From Eq. (1) we have 

a 2 (v + w) xx = (v + w) t \ 


it follows that 


Wxx = wr, 


since v xx = 0 and v, = 0. Similarly, from Eqs. (12), (8), and (10), 
w(0,t) = u(0,t) — u(0) = T\ — T\ = 0, 
w(L,t) = u{Lj) — v(L ) = T 2 — T 2 — 0. 


(13) 


(14) 


Finally, from Eqs. (12) and (3), 

w(x, 0) = u(x, 0) — v(x) = f(x) — v(x), (15) 

where u(x) is given by Eq. (11). Thus the transient part of the solution to the original 
problem is found by solving the problem consisting of Eqs. (13), (14), and (15). This 
latter problem is precisely the one solved in Section 10.5, provided that/(x) — v(x) 
is now regarded as the initial temperature distribution. Hence 

OO 

u(x,t) = (T 2 - 7))^ + 7i + ^ c„e"' ,27r2 “ 2;/i2 sin (16) 

n = 1 

where 

2/’ L r„ m m x njr x , 

C„ = - J \f(x) — (T 2 — T\)— — TiJ sin —j— dx. (17) 

This is another case in which a more difficult problem is solved by reducing it to a 
simpler problem that has already been solved. The technique of reducing a problem 
with nonhomogeneous boundary conditions to one with homogeneous boundary 
conditions by subtracting the steady state solution has wide application. 
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EXAMPLE 

1 


Consider the heat conduction problem 



Wxx = Uf, 

0 < x < 30, t > 0, 

(18) 

u(0,t) = 20, 

m(30, t) = 50, t > 0, 

(19) 

u(x, 0) = 60 - 

■ 2x, 0 < x < 30. 

(20) 


Find the steady state temperature distribution and the boundary value problem that deter- 
mines the transient distribution. 

The steady state temperature satisfies v"(x) = 0 and the boundary conditions v(0) = 20 and 
u(30) = 50. Thus v(x) = 20 + x. The transient distribution w(x, t) satisfies the heat conduction 
equation 

w xx = w t , (21) 

the homogeneous boundary conditions 

w(0,t) = 0, in (30, t) = 0, (22) 

and the modified initial condition 

w(x, 0) = 60 — 2x — (20 + x) = 40 — 3x. (23) 

Note that this problem is of the form (1), (2), (3) with/pc) = 40 — 3x,a 2 = l.andL = 30. Thus 
the solution is given by Eqs. (4) and (6). 

Figure 10.6.1 shows a plot of the initial temperature distribution 60 — 2x , the final temper- 
ature distribution 20 + x, and the temperature at three intermediate times found by solving 
Eqs. (21) through (23). Note that the intermediate temperature satisfies the boundary condi- 
tions (19) for any t > 0. As t increases, the effect of the boundary conditions gradually moves 
from the ends of the bar toward its center. 



FIGURE 10.6.1 Temperature distributions at several times for the heat conduction problem 
of Example 1. 


Bar with Insulated Ends. A slightly different problem occurs if the ends of the bar are 
insulated so that there is no passage of heat through them. According to Eq. (2) in 
Appendix A, the rate of flow of heat across a cross section is proportional to the rate 
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of change of temperature in the x direction. Thus, in the case of no heat flow, the 
boundary conditions are 

u x (0,t ) = 0, u x (L,t) = 0, t > 0. (24) 


The problem posed by Eqs. (1), (3), and (24) can also be solved by the method of 
separation of variables. If we let 

u(x,t) = X(x)T(t), (25) 


and substitute for u in Eq. (1), then it follows, as in Section 10.5, that 

X" 1 T _ 

~X ~ YY ~ ~ k ’ 


(26) 


where X is a constant. Thus we obtain again the two ordinary differential equations 


X" + XX = 0, (27) 

T' + a 2 kT = 0. (28) 


For any value of A. a product of solutions of Eqs. (27) and (28) is a solution of the 
partial differential equation (1). However, we are interested only in those solutions 
that also satisfy the boundary conditions (24). 

If we substitute for u(x,t ) from Eq. (25) in the boundary condition at x — 0, we 
obtain X'(0 )T{t) — 0. We cannot permit T(t) to be zero for all t, since then u{x,t ) 
would also be zero for all t. Hence we must have 


X'(0) = 0. (29) 

Proceeding in the same way with the boundary condition at x = L, we find that 

X\L) = 0. (30) 

Thus we wish to solve Eq. (27) subject to the boundary conditions (29) and (30). It 
is possible to show that nontrivial solutions of this problem can exist only if X is real. 
One way to show this is indicated in Problem 18; alternatively, we can appeal to a 
more general theory to be discussed in Section 11.2. We will assume that X is real 
and will consider in turn the three cases X < 0, X = 0, and X > 0. 

If X < 0, it is convenient to let X — — /x 2 , where /x is real and positive. Then Eq. (27) 
becomes X" — /x 2 X = 0, and its general solution is 

X (x) = k\ sinh /ix + /o cosh /ix. (31) 

In this case the boundary conditions can be satisfied only by choosing k\ = ki = 0. 
Since this is unacceptable, it follows that X cannot be negative; in other words, the 
problem (27), (29), (30) has no negative eigenvalues. 

If X — 0, then Eq. (27) is X" = 0, and therefore 

X(x) — kix + k 2 . (32) 

The boundary conditions (29) and (30) require that ki = 0 but do not determine k 2 . 
Thus X = 0 is an eigenvalue, corresponding to the eigenfunction X (x) = 1. For 1 = 0 
it follows from Eq. (28) that T(t ) is also a constant, which can be combined with k 2 . 
Hence, for X — 0, we obtain the constant solution u(x, t ) = k 2 . 


10.6 Other Heat Conduction Problems 


625 


Finally, if X > 0, let X = /x 2 , where /x is real and positive. Then Eq. (27) becomes 
X" + /x 2 X — 0, and consequently, 

X (x) = k\ sin /xx + k 2 cos /xx. (33) 

The boundary condition (29) requires that k\ = 0, and the boundary condition (30) 
requires that /x = rut /L for n = 1, 2, 3, . . . but leaves ki arbitrary. Thus the problem 
(27), (29), (30) has an infinite sequence of positive eigenvalues X = n 2 n 2 /L 2 with 
the corresponding eigenfunctions X(x) — cos(rutx/L). For these values of X the 
solutions T(t) of Eq. (28) are proportional to exp(—n 2 jt 2 a 2 t/L 2 ). 

Combining all these results, we have the following fundamental solutions for the 
problem (1), (3), and (24): 


uo(x,t) = 1, 


„ , _„2^2 ry 2,/r2 nitX 

u n {x..t) = e ' cos— — , 


n = 1 , 2 ,..., 


(34) 


where arbitrary constants of proportionality have been dropped. Each of these 
functions satisfies the differential equation (1) and the boundary conditions (24). 
Because both the differential equation and the boundary conditions are linear and 
homogeneous, any finite linear combination of the fundamental solutions satisfies 
them. We will assume that this is true for convergent infinite linear combinations of 
fundamental solutions as well. Thus, to satisfy the initial condition (3), we assume 
that u(x, t) has the form 


Co ^ — > 

(*, t ) = T«o(X, 0 + 2^ C n u n(x, t) 


n = 1 


= c j + J2cne-» W'/^cos’™. 

n = 1 

The coefficients c n are determined by the requirement that 

OO 

_ Co v — mtx 
u(x, 0) = — + 2^ Cn COS — — = ;(x). 


n = 1 


(35) 


(36) 


Thus the unknown coefficients in Eq. (35) must be the coefficients in the Fourier 
cosine series of period 2 L for/. Flence 

c n = y f f(x) cos n I^ X L dx, n — 0,1,2, (37) 

L Jo L 

With this choice of the coefficients cq, ci, C 2 , . . . , the series (35) provides the solution 
to the heat conduction problem for a rod with insulated ends, Eqs. (1), (3), and (24). 

It is worth observing that the solution (35) can also be thought of as the sum 
of a steady state temperature distribution (given by the constant co/2), which is 
independent of time t, and a transient distribution (given by the rest of the infinite 
series) that vanishes in the limit as t approaches infinity. That the steady state is a 
constant is consistent with the expectation that the process of heat conduction will 
gradually smooth out the initial temperature distribution in the bar as long as no heat 


626 


Chapter 1 0. Partial Differential Equations and Fourier Series 


EXAMPLE 

2 


is allowed to enter from, or to escape to, the outside. The physical interpretation of 
the term 


co 

2 



(38) 


is that it is the mean value of the original temperature distribution. 


Find the temperature u(x, t ) in a metal rod of length 25 cm that is insulated on the ends as well 
as on the sides and whose initial temperature distribution is u(x, 0) = x for 0 < x < 25. 

The temperature in the rod satisfies the heat conduction problem (1), (3), (24) with L = 25. 
Thus, from Eq. (35), the solution is 


u(x,t) = j +J2 


-« 2 ir 2 a 2 (/625 


25 ’ 


where the coefficients are determined from Eq. (37). We have 

2 r 25 

co = — / x dx = 25 
25 J o 


and, for n > 1, 


2 f 25 nnx 

— / x cos dx 

25 J 0 25 


= 50(cos«7r — l)/(ror) 2 


| — 100/(«7r) 2 , n odd; 

1 0, n even. 


Thus 


u(x, t) 


25 

2 ~ 


100 


E 


n= 1,3,5,... 


±_ e -nVo,h/62S cos ( nnx /25) 
n 2 


is the solution of the given problem. 


(39) 


(40) 


(41) 


(42) 



FIGURE 10.6.2 Temperature distributions at several times for the heat conduction problem 
of Example 2. 
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For a 2 = 1, Figure 10.6.2 shows plots of the temperature distribution in the bar at several 
times. Again the convergence of the series is rapid so that only a relatively few terms are 
needed to generate the graphs. 


More General Problems. The method of separation of variables can also be used to 
solve heat conduction problems with other boundary conditions than those given 
by Eqs. (8) and Eqs. (24). For example, the left end of the bar might be held at a fixed 
temperature T while the other end is insulated. In this case the boundary conditions 
are 

m( 0, f) = T, u x (L,t) = 0, t > 0. (43) 

The first step in solving this problem is to reduce the given boundary conditions to 
homogeneous ones by subtracting the steady state solution. The resulting problem 
is solved by essentially the same procedure as in the problems previously consid- 
ered. However, the extension of the initial function / outside of the interval [0, L] is 
somewhat different from that in any case considered so far (see Problem 15). 

A more general type of boundary condition occurs when the rate of heat flow 
through the end of the bar is proportional to the temperature. It is shown in Appendix 
A that the boundary conditions in this case are of the form 

u x (0.t) — hiu(0,t) = 0, u x (L,t) + h 2 u(L,t) = 0, t > 0, (44) 

where h\ and h 2 are nonnegative constants. If we apply the method of separation of 
variables to the problem consisting of Eqs. (1), (3), and (44), we find that X{x) must 
be a solution of 

X" + XX = 0, X'(0) - hiXQS) = 0, X\L) + h 2 X(L) = 0, (45) 

where X is the separation constant. Once again it is possible to show that nontrivial 
solutions can exist only for certain nonnegative real values of X , the eigenvalues, but 
these values are not given by a simple formula (see Problem 20). It is also possible 
to show that the corresponding solutions of Eqs. (45), the eigenfunctions, satisfy an 
orthogonality relation and that we can satisfy the initial condition (3) by superposing 
solutions of Eqs. (45). However, the resulting series is not included in the discussion 
of this chapter. There is more general theory that covers such problems, and it is 
outlined in Chapter 11. 


PROBLEMS In each of Problems 1 through 8 find the steady-state solution of the heat conduction equation 
a 2 u xx = u, that satisfies the given set of boundary conditions. 

1. £/(0, t) = 10, i/(50, 0 = 40 2. «(0, 0 = 30, 

3. in (0,t) = 0, u(L,t) = 0 4. in(0,0=0, 

5. «(0, 0 = 0, u x ( L , 0 = 0 6. «(0, f) = T, 

7. in (0,0 — i<(0, 0 = 0, u(L,t) =T 8. «(0, 0 = T, 


m(40,0 = -20 
u(L.t) — T 
u x (L.t) = 0 
u x (L,t) + u(L, 0 = 0 


9. Let an aluminum rod of length 20 cm be initially at the uniform temperature of 25°C. 
Suppose that at time t = 0 the end x = 0 is cooled to 0°C while the end x = 20 is heated 
to 60° C, and both are thereafter maintained at those temperatures. 
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(a) Find the temperature distribution in the rod at any time t. 

(b) Plot the initial temperature distribution, the final (steady-state) temperature distri- 
bution, and the temperature distributions at two representative intermediate times on the 
same set of axes. 

(c) Plot u versus t for x = 5, 10, and 15. 

(d) Determine how much time must elapse before the temperature at x = 5 cm comes 
(and remains) within 1% of its steady-state value. 

10. (a) Let the ends of a copper rod 100 cm long be maintained at 0°C. Suppose that the 
center of the bar is heated to 100°C by an external heat source and that this situation is 
maintained until a steady state results. Find this steady-state temperature distribution. 

(b) At a time t = 0 [after the steady state of part (a) has been reached], let the heat source 
be removed. At the same instant let the end x = 0 be placed in thermal contact with a 
reservoir at 20°C, while the other end remains at 0°C. Find the temperature as a function 
of position and time. 

(c) Plot u versus x for several values of t. Also plot u versus t for several values of x. 

(d) What limiting value does the temperature at the center of the rod approach after a 
long time? How much time must elapse before the center of the rod cools to within 1° of 
its limiting value? 

11. Consider a rod of length 30 for which a 2 = 1. Suppose the initial temperature distribution 
is given by u(x , 0) = x(60 — x)/30 and that the boundary conditions are u( 0, t) = 30 and 
w(30, t) = 0. 

(a) Find the temperature in the rod as a function of position and time. 

(b) Plot u versus x for several values of t. Also plot u versus t for several values of x. 

(c) Plot u versus t for x = 12. Observe that u initially decreases, then increases for a 
while, and finally decreases to approach its steady-state value. Explain physically why this 
behavior occurs at this point. 

12. Consider a uniform rod of length L with an initial temperature given by 

u(x, 0) = sin(7rx/L), 0 < x < L. Assume that both ends of the bar are insulated. 

(a) Find the temperature u(x, t). 

(b) What is the steady-state temperature as t — >■ oo? 

(c) Let a 2 = 1 and L — 40. Plot u versus x for several values of t. Also plot u versus t for 
several values of x. 

(d) Describe briefly how the temperature in the rod changes as time progresses. 

13. Consider a bar of length 40 cm whose initial temperature is given by w(x , 0) = x(60 — x) /30. 
Suppose that or = 1/4 cm 2 /s and that both ends of the bar are insulated. 

(a) Find the temperature u(x,t). 

(b) Plot u versus x for several values of t. Also plot u versus t for several values of x. 

(c) Determine the steady-state temperature in the bar. 

(d) Determine how much time must elapse before the temperature at x = 40 comes within 
1° of its steady-state value. 

14. Consider a bar 30 cm long that is made of a material for which a 2 = 1 and whose ends are 
insulated. Suppose that the initial temperature is zero except for the interval 5 < x < 10, 
where the initial temperature is 25°C. 

(a) Find the temperature u(x,t). 

(b) Plot u versus x for several values of t. Also plot u versus t for several values of x. 

(c) Plot u(4 ,t) and w(ll,f) versus t. Observe that the points x = 4 and x = 11 are sym- 
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metrically located with respect to the initial temperature pulse, yet their temperature plots 
are significantly different. Explain physically why this is so. 

15. Consider a uniform bar of length L having an initial temperature distribution given by 
fix), 0 < x < L. Assume that the temperature at the end x = 0 is held at 0°C, while the 
end x = L is insulated so that no heat passes through it. 

(a) Show that the fundamental solutions of the partial differential equation and boundary 
conditions are 

u n (x, t ) = e -( 2 "- 1 >' ir '“ 2 '/4r. 2 s i n [(2n — l)nx/2 L], n = 1, 2, 3, ... . 

(b) Find a formal series expansion for the temperature u(x,t) 

OO 

u(x,t ) = E c n u„{x, t) 

n = 1 

that also satisfies the initial condition u(x, 0) = f(x). 

Hint: Even though the fundamental solutions involve only the odd sines, it is still possible 
to represent/ by a Fourier series involving only these functions. See Problem 39 of Section 
10.4. 

In the bar of Problem 15 suppose that L = 30, or = 1, and the initial temperature distri- 
bution is f(x) = 30 — x for 0 < x < 30. 

(a) Find the temperature u(x, t). 

(b) Plot u versus x for several values of t. Also plot u versus t for several values of x. 

(c) How does the location x m of the warmest point in the bar change as t increases? Draw 
a graph of x m versus t. 

(d) Plot the maximum temperature in the bar versus t. 

Suppose that the conditions are as in Problems 15 and 16 except that the boundary con- 
dition at x = 0 is z/(0, t) — 40. 

(a) Find the temperature u(x, t). 

(b) Plot u versus x for several values of t. Also plot u versus t for several values of x. 

(c) Compare the plots you obtained in this problem with those from Problem 16. Explain 
how the change in the boundary condition at x = 0 causes the observed differences in the 
behavior of the temperature in the bar. 

18. Consider the problem 

X" + IX = 0, JT (0) = 0, X'(L) = 0. (i) 

Let A. = /r 2 , where /r = v + ia with v and a real. Show that if cr ^ 0, then the only solution 
of Eqs. (i) is the trivial solution X(x) = 0. 

Hint: Use an argument similar to that in Problem 23 of Section 10.1. 

19. The right end of a bar of length a with thermal conductivity k\ and cross-sectional area A\ 
is joined to the left end of a bar of thermal conductivity k 2 and cross-sectional area A 2 . 
The composite bar has a total length L. Suppose that the end x = 0 is held at temperature 
zero, while the end x = L is held at temperature T. Find the steady-state temperature in 
the composite bar, assuming that the temperature and rate of heat flow are continuous at 
x = a. 

Hint: See Eq. (2) of Appendix A. 


16 . 


17. 
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20. Consider the problem 


& Wxx Li/ , 

1/(0, f) = 0, u x (L,t) + yu(L,t ) = 0, 
u(x, 0) =f(x), 


0 < x < L, t > 0; 
t > 0; 

0 < x < L. 


(a) Let u(x, t) = X (x) T (t ) and show that 


X" + XX = 0, X(0) = 0, X'(L) + j/X(L) = 0, 


(i) 


(ii) 


and 


r + ;u* 2 r = o, 


where X is the separation constant. 

(b) Assume that X is real, and show that problem (ii) has no nontrivial solutions if X < 0. 

(c) If X > 0, let X = /x 2 with p > 0. Show that problem (ii) has nontrivial solutions only 
if // is a solution of the equation 


p cos pL + y sin pL = 0. (iii) 

(d) Rewrite Eq. (iii) as tan pL = —ply . Then, by drawing the graphs of y = tan pL and 
y — —pL/yL for p > 0 on the same set of axes, show that Eq. (iii) is satisfied by infinitely 
many positive values of p\ denote these by pi,p2, ...,p„,..., ordered in increasing size. 

(e) Determine the set of fundamental solutions u n {x,t ) corresponding to the values p n 
found in part (d). 

An External Heat Source. Consider the heat conduction problem in a bar that is in thermal 
contact with an external heat source or sink. Then the modified heat conduction equation is 

u, = a 2 u xx + i(x), (i) 


where the term s(x) describes the effect of the external agency; s(x) is positive for a source and 
negative for a sink. Suppose that the boundary conditions are 


m(0, t) = T\, u(L,t ) = T 2 


(ii) 


and the initial condition is 


u(x, 0) = f(x). 


(iii) 


Problems 21 through 23 deal with this kind of problem. 

21. Write u(x,t ) = ii(x) + w(x,t), where v and w are the steady state and transient parts of 
the solution, respectively. State the boundary value problems that v(x) and w(x,t), re- 
spectively, satisfy. Observe that the problem for w is the fundamental heat conduction 
problem discussed in Section 10.5, with a modified initial temperature distribution. 


10.7 The Wave Equation: Vibrations of an Elastic String 


631 


41 22. (a) Suppose that or = 1 and s(x) = k, a constant, in Eq. (i). Find v(x). 

(b) Assume that T\ = 0, Ti = 0,L = 20, A; = 1/5, and that/(x) = 0 for 0 < x < L. Deter- 
mine w(x, t). Then plot u(x, t) versus x for several values of t; on the same axes also plot 
the steady-state part of the solution v(x). 

23. (a) Let a 2 = 1 and s(x) = kx/L , where A; is a constant, in Eq. (i). Find v(x). 

(b) Assume that 7) = 10, T 2 = 30, L = 20, k = 1 /2, and that f(x) = 0 for 0 < x < L. De- 
termine w(x,t). Then plot u(x,t ) versus x for several values of f; on the same axes also 
plot the steady-state part of the solution v(x). 
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A second partial differential equation that occurs frequently in applied mathematics 
is the wave 10 equation. Some form of this equation, or a generalization of it, almost 
inevitably arises in any mathematical analysis of phenomena involving the propaga- 
tion of waves in a continuous medium. For example, the studies of acoustic waves, 
water waves, electromagnetic waves, and seismic waves are all based on this equation. 

Perhaps the easiest situation to visualize occurs in the investigation of mechanical 
vibrations. Suppose that an elastic string of length L is tightly stretched between 
two supports at the same horizontal level, so that the x-axis lies along the string (see 
Figure 10.7.1). 

The elastic string may be thought of as a violin string, a guy wire, or possibly an 
electric power line. Suppose that the string is set in motion (by plucking, for example) 
so that it vibrates in a vertical plane, and let u{x, t) denote the vertical displacement 
experienced by the string at the point x at time t. If damping effects, such as air 
resistance, are neglected, and if the amplitude of the motion is not too large, then 
u(x, t) satisfies the partial differential equation 

Q u xx — iitt ( 1 ) 

in the domain 0 < x < L, t > 0. Equation (1) is known as the one-dimensional wave 
equation and is derived in Appendix B at the end of the chapter. The constant 



10 The solution of the wave equation was one of the major mathematical problems of the mid-eighteenth 
century. The wave equation was first derived and studied by D’Alembert in 1746. It also attracted the 
attention of Euler (1748), Daniel Bernoulli (1753), and Lagrange (1759). Solutions were obtained in 
several different forms, and the merits of, and relations among, these solutions were argued, sometimes 
heatedly, in a series of papers extending over more than 25 years. The major points at issue concerned 
the nature of a function and the kinds of functions that can be represented by trigonometric series. These 
questions were not resolved until the nineteenth century. 
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coefficient a 2 appearing in Eq. (1) is given by 

a 2 = T/p, (2) 

where T is the tension (force) in the string, and p is the mass per unit length of the 
string material. It follows that a has the units of length/time — that is, of velocity. In 
Problem 14 it is shown that a is the velocity of propagation of waves along the string. 

To describe the motion of the string completely, it is necessary also to specify 
suitable initial and boundary conditions for the displacement u(x,t). The ends are 
assumed to remain fixed, and therefore the boundary conditions are 

m(0,0 = 0, u(L,t) = 0, t > 0. (3) 

Since the differential equation (1) is of second order with respect to t, it is plausible 
to prescribe two initial conditions. These are the initial position of the string 

u(x,0)=f(x), 0 <x<L (4) 

and its initial velocity 

u t (x,0) = g(x), 0 < x < L, (5) 

where / and g are given functions. In order for Eqs. (3), (4), and (5) to be consistent, 
it is also necessary to require that 

/( 0) = f(L) = 0, g(0) = g(L) = 0. (6) 

The mathematical problem then is to determine the solution of the wave equation 
(1) that also satisfies the boundary conditions (3) and the initial conditions (4) and 
(5). Like the heat conduction problem of Sections 10.5 and 10.6, this problem is 
an initial value problem in the time variable t and a boundary value problem in the 
space variable x. Alternatively, it can be considered as a boundary value problem 
in the semi-infinite strip 0 < x < L, f>0of the xt-plane (see Figure 10.7.2). One 
condition is imposed at each point on the semi-infinite sides, and two are imposed at 
each point on the finite base. 



FIGURE 10.7.2 Boundary value problem for the wave equation. 
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It is important to realize that Eq. (1) governs a large number of other wave prob- 
lems besides the transverse vibrations of an elastic string. For example, it is only 
necessary to interpret the function u and the constant a appropriately to have prob- 
lems dealing with water waves in an ocean, acoustic or electromagnetic waves in 
the atmosphere, or elastic waves in a solid body. If more than one space dimension 
is significant, then Eq. (1) must be slightly generalized. The two-dimensional wave 
equation is 

a 2 (u xx + Uyy) = u„. (7) 

This equation would arise, for example, if we considered the motion of a thin elastic 
sheet, such as a drumhead. Similarly, in three dimensions the wave equation is 

U ( U xx T llyy T Ur-) — Uft . (^> ) 

In connection with the latter two equations, the boundary and initial conditions must 
also be suitably generalized. 

We now solve some typical boundary value problems involving the one-dimensional 
wave equation. 


Elastic String with Nonzero Initial Displacement. First suppose that the string is disturbed 
from its equilibrium position and then released at time t = 0 with zero velocity to vi- 
brate freely. Then the vertical displacement u(x, t) must satisfy the wave equation (1) 

ci 2 u xx = u tt , 0 < x < L, t > 0; 

the boundary conditions (3) 

u(0, t) = 0, u(L,t) — 0, t > 0; 
and the initial conditions 

u(x,0)=f(x), u t (x, 0) = 0, 0 < x < L, (9) 


where / is a given function describing the configuration of the string at t = 0. 

The method of separation of variables can be used to obtain the solution of Eqs. (1), 
(3), and (9). Assuming that 

u(x,t ) = X{x)T{t) (10) 


and substituting for u in Eq. (1), we obtain 


X" 1 T" _ 

~Y ~ a^Y ~ 


( 11 ) 


where X is a separation constant. Thus we find that X (x) and T ( t ) satisfy the ordinary 
differential equations 


X" + XX = 0, (12) 

T" + a 2 XT = 0. (13) 


Further, by substituting from Eq. (10) for u(x,t ) in the boundary conditions (3), we 
find that X (x) must satisfy the boundary conditions 


X(0) = 0, X(L) = 0. 


(14) 
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Finally, by substituting from Eq. (10) into the second of the initial conditions (9), we 
also find that T ( t ) must satisfy the initial condition 

r'(0) = 0. (15) 

Our next task is to determine X(x ), T(t), and X by solving Eq. (12) subject to the 
boundary conditions (14) and Eq. (13) subject to the initial condition (15). 

The problem of solving the differential equation (12) subject to the boundary 
conditions (14) is precisely the same problem that arose in Section 10.5 in connection 
with the heat conduction equation. Thus we can use the results obtained there and 
at the end of Section 10.1: the problem (12), (14) has nontrivial solutions if and only 
if X is an eigenvalue 

X = h 2 tt 2 /L 2 , n = 1,2,..., (16) 


and X(x) is proportional to the corresponding eigenfunction sin(/z7r.r/L). 
Using the values of X given by Eq. (16) in Eq. (13), we obtain 


Therefore 


T" + 


L 2 


T — 0. 


(17) 


T(t) = k\ cos 


mxat 

L 


+ k 2 sin 


mxat 


(18) 


where k\ and k 2 are arbitrary constants. The initial condition (15) requires that 
k 2 = 0, so T{t) must be proportional to cos(mr at / L) . 

Thus the functions 

mtx mxat 

u„(x,t) = sin —j— cos , n— 1,2,... (19) 


satisfy the partial differential equation (1), the boundary conditions (3), and the 
second initial condition (9). These functions are the fundamental solutions for the 
given problem. 

To satisfy the remaining (nonhomogeneous) initial condition (9), we will consider 
a superposition of the fundamental solutions (19) with properly chosen coefficients. 
Thus we assume that u(x, t) has the form 


OO 

u{x,t) = 

n = 1 


OO 

c n u n (x,t) — ^c„ 

n = 1 


. mtx 
sin — — cos 


nnat 


( 20 ) 


where the constants c„ remain to be chosen. The initial condition u(x, 0) = f(x) 
requires that 


OO 

u(x, 0 ) =Yl Cn 

n = 1 


nnx 

sm— = fix). 


( 21 ) 


Consequently, the coefficients c„ must be the coefficients in the Fourier sine series of 
period 2 L for/; hence 


n — 


2 

L 



. mtx 
sm —j— ax, 


( 22 ) 


c, 


n = 1 , 2 ,.... 
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EXAMPLE 

1 


Thus the formal solution of the problem of Eqs. (1), (3), and (9) is given by Eq. (20) 
with the coefficients calculated from Eq. (22). 

For a fixed value of n the expression sin(«jrx/L) cos(mrnf/L) in Eq. (19) is peri- 
odic in time t with the period 2 L/na\ it therefore represents a vibratory motion of the 
string having this period, or having the frequency nna/L. The quantities rnza/L for 
n = 1,2,... are the natural frequencies of the string — that is, the frequencies at which 
the string will freely vibrate. The factor sin (nnx/L) represents the displacement pat- 
tern occurring in the string when it is executing vibrations of the given frequency. 
Each displacement pattern is called a natural mode of vibration and is periodic in 
the space variable x; the spatial period 2 L/n is called the wavelength of the mode of 
frequency nna/L. Thus the eigenvalues rrn 2 /L 1 of the problem (12), (14) are pro- 
portional to the squares of the natural frequencies, and the eigenfunctions s'mimrx/ L) 
give the natural modes. The first three natural modes are sketched in Figure 10.7.3. 
The total motion of the string, given by the function u(x, t) of Eq. (20), is thus a com- 
bination of the natural modes of vibration and is also a periodic function of time with 
period 2 L/a. 


u 

u 

u 


1 

1 

1 




L 


-1 

L X 

-1 

X 

ipf -1 

L x 


(a) [b) (c) 

FIGURE 10.7.3 First three fundamental modes of vibration of an elastic string. 
(a) Frequency = na/L, wavelength = 2 L; ( b ) frequency = 2 na/L, wavelength = L; 
(c) frequency = 3na/L , wavelength = 2L/3. 


Consider a vibrating string of length L = 30 that satisfies the wave equation 

4 u xx — u„, 0 < x < 30, t > 0. 


(23) 


Assume that the ends of the string are fixed and that the string is set in motion with no initial 
velocity from the initial position 


u(x, 0) =/( x) = 


x/10, 0 < x < 10, 

(30 — x)/20, 10 < x < 30. 


(24) 


Find the displacement u(x , t) of the string and describe its motion through one period. 
The solution is given by Eq. (20) with a = 2 and L = 30; that is, 


. nnx 2nnt 
ll(x, t ) = 2 _^ c n Sin — jy- cos ■ 

n = 1 


30 


(25) 


where c n is calculated from Eq. (22). Substituting from Eq. (24) into Eq. (22), we obtain 


2 f 1 

Cn ~ 30 J 0 


x . nnx 2 

— sin ax - 1 

10 30 30 


L 


30 30 — x . nnx 

sm dx. 

10 20 30 


(26) 


636 


Chapter 1 0. Partial Differential Equations and Fourier Series 


By evaluating the integrals in Eq. (26), we find that 

9 . mr 

c n = — x sin — , ft = 1,2, ... . 

n^Ti- 3 


(27) 


The solution (25), (27) gives the displacement of the string at any point x at any time t. The 
motion is periodic in time with period 30, so it is sufficient to analyze the solution for 0 < t < 30. 

The best way to visualize the solution is by a computer animation showing the dynamic 
behavior of the vibrating string. Here we indicate the motion of the string in Figures 10.7.4, 
10.7.5, and 10.7.6. Plots of u versus x for t = 0, 4, 7.5, 11, and 15 are shown in Figure 10.7.4. 
Observe that the maximum initial displacement is positive and occurs at x = 10,whileatf = 15, 
a half-period later, the maximum displacement is negative and occurs at x = 20. The string 
then retraces its motion and returns to its original configuration at t = 30. Figure 10.7.5 shows 
the behavior of the points x = 10, 15, and 20 by plots of u versus t for these fixed values of x. 
The plots confirm that the motion is indeed periodic with period 30. Observe also that each 
interior point on the string is motionless for one-third of each period. Figure 10.7.6 shows a 
three-dimensional plot of u versus both x and t , from which the overall nature of the solution 
is apparent. Of course, the curves in Figures 10.7.4 and 10.7.5 lie on the surface shown in 
Figure 10.7.6. 




FIGURE 10.7.5 Plots of u versus t for fixed values of x for the string in Example 1. 
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FIGURE 10.7.6 Plot of u versus x and t for the string in Example 1. 


Justification of the Solution. As in the heat conduction problem considered earlier, 
Eq. (20) with the coefficients c„ given by Eq. (22) is only a formal solution of 
Eqs. (1), (3), and (9). To ascertain whether Eq. (20) actually represents the solution 
of the given problem requires some further investigation. As in the heat conduction 
problem, it is tempting to try to show this directly by substituting Eq. (20) for u(x, t ) 
in Eqs. (1), (3), and (9). However, upon formally computing u xx , for example, we 
obtain 

E °° /wr\ 2 . nnx mxat 

c "v77/ sin ^7 cos ~ 

n = 1 

due to the presence of the n 2 factor in the numerator, this series may not converge. 
This would not necessarily mean that the series (20) for u(x, t) is incorrect, but only 
that the series (20) cannot be used to calculate u xx and u tt . A basic difference between 
solutions of the wave equation and those of the heat conduction equation is that 
the latter contain negative exponential terms that approach zero very rapidly with 
increasing n, which ensures the convergence of the series solution and its derivatives. 
In contrast, series solutions of the wave equation contain only oscillatory terms that 
do not decay with increasing n. 

However, there is an alternative way to establish the validity of Eq. (20) indirectly. 
At the same time, we will gain additional information about the structure of the 
solution. First we will show that Eq. (20) is equivalent to 

u(x, t) = j [h(x — at) + h(x + n/)] , (28) 

where h is the function obtained by extending the initial data / into (— L,0) as an 
odd function, and to other values of x as a periodic function of period 2 L. That is. 


h(x) = 


fix), 

-f(-x), 


0 < x < L, 
—L < x < 0; 


(29) 


h(x. + 2 L) — h(x). 
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To establish Eq. (28), note that h has the Fourier series 



(30) 


where c„ is given by Eq. (22). Then, using the trigonometric identities for the sine of 
a sum or difference, we obtain 




and Eq. (28) follows immediately upon adding the last two equations. From Eq. (28) 
we see that u(x,t ) is continuous for 0 < x < L, t > 0, provided that h is continuous 
on the interval (— 00 , 00 ). This requires / to be continuous on the original interval 
[0, L]. Similarly, u is twice continuously differentiable with respect to either variable 
in 0 < x < L, t > 0, provided that h is twice continuously differentiable on (— 00 , 00 ). 
This requires /' and /" to be continuous on [0, L]. Furthermore, since h" is the 
odd extension of/", we must also have /"( 0) = /" ( L ) = 0. Ffowever, since h' is the 
even extension of /', no further conditions are required on /'. Provided that these 
conditions are met, u xx and u„ can be computed from Eq. (28), and it is an elementary 
exercise to show that these derivatives satisfy the wave equation. Some of the details 
of the argument just indicated are given in Problems 19 and 20. 

If some of the continuity requirements stated in the last paragraph are not met, 
then 11 is not differentiable at some points in the semi-infinite strip 0 < x < L, t > 0, 
and thus is a solution of the wave equation only in a somewhat restricted sense. An 
important physical consequence of this observation is that if there are any disconti- 
nuities present in the initial data/, then they will be preserved in the solution u(x, t ) 
for all time. In contrast, in heat conduction problems, initial discontinuities are in- 
stantly smoothed out (Section 10.6). Suppose that the initial displacement / has a 
jump discontinuity at x = xq, 0 < xq < L. Since h is a periodic extension of/, the 
same discontinuity is present in /z(£) at f — x 0 + 2nL and at £ = — x 0 + 2 nL. where 
n is any integer. Thus h (x — at) is discontinuous when x — at = x 0 + 2 nL, or when 
x — at = —xq + 2 nL. For a fixed x in [0, L] the discontinuity that was originally at 
X{\ will reappear in h (x — at) at the times t — (x ± .to — 2 nL)/a. Similarly, h (x + at) 
is discontinuous at the point x at the times t = (—x ± xq + 2 mL) /a, where m is any 
integer. If we refer to Eq. (28), it then follows that the solution u(x, t) is also discon- 
tinuous at the given point x at these times. Since the physical problem is posed for 
t > 0, only those values of m and n that yield positive values of t are of interest. 

General Problem for the Elastic String. Let us modify the problem just considered by sup- 
posing that the string is set in motion from its equilibrium position with a given 
velocity. Then the vertical displacement u(x,t) must satisfy the wave equation (1) 



the boundary conditions (3) 


m( 0, t) = 0, u(L,t) = 0, t > 0; 
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and the initial conditions 


u(x,0) — 0, u t (x,0 ) — g(x), 0 < x < L, (31) 


where g(x ) is the initial velocity at the point a: of the string. 

The solution of this new problem can be obtained by following the procedure 
described above for the problem (1), (3), and (9). Upon separating variables, we 
find that the problem for X(x) is exactly the same as before. Thus, once again, 
X = n 2 n 2 /L 1 and X (x) is proportional to sin(«jrx/L). The differential equation for 
T (t) is again Eq. (17), but the associated initial condition is now 

T( 0) = 0, (32) 


corresponding to the first of the initial conditions (31). The general solution of 
Eq. (17) is given by Eq. (18), but now the initial condition (32) requires that k\ — 0. 
Therefore T (t) is now proportional to sin(njtat/L) and the fundamental solutions 
for the problem (1), (3), and (31) are 

. nnx . htt at 

u n (x, t ) = sin —j— sin , n= 1,2,3, (33) 

Each of the functions u n (x, t) satisfies the wave equation (1), the boundary conditions 
(3), and the first of the initial conditions (31). The main consequence of using the 
initial conditions (31) rather than (9) is that the time-dependent factor in u n (x,t) 
involves a sine rather than a cosine. 

To satisfy the remaining (nonhomogeneous) initial condition, we assume that u (x, t) 
can be expressed as a linear combination of the fundamental solutions (33); that is. 


OO OO 

E v — "\ . . nnx 

k„u„(x , t ) = kn sin —j— sin 

n = 1 n= 1 


nnat 

L 


(34) 


To determine the values of the coefficients k n , we differentiate Eq. (34) with respect 
to t, set t — 0, and use the second initial condition (31); this gives the equation 


u t (x, 0) 


E nna . nnx 

-j~k n sm— =g(x). 

n = 1 


(35) 


Hence the quantities ( nna/L)k n are the coefficients in the Fourier sine series of 
period 2 L for g. Therefore 


ima 

~L 


kn 



n = 1,2, ... . 


(36) 


Thus Eq. (34), with the coefficients given by Eq. (36), constitutes a formal solution 
to the problem of Eqs. (1), (3), and (31). The validity of this formal solution can 
be established by arguments similar to those previously outlined for the solution of 
Eqs. (1), (3), and (9). 

Finally, we turn to the problem consisting of the wave equation (1), the boundary 
conditions (3), and the general initial conditions (4), (5): 


u(x, 0) =/(*), u t (x, 0) = g(x), 0 <x <L. (37) 


where f(x) and g(x) are the given initial position and velocity, respectively, of the 
string. Although this problem can be solved by separating variables, as in the cases 
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discussed previously, it is important to note that it can also be solved simply by adding 
together the two solutions that we obtained above. To show that this is true, let v(x, t ) 
be the solution of the problem (1), (3), and (9), and let w(x, t) be the solution of the 
problem (1), (3), and (31). Thus v{x,t) is given by Eqs. (20) and (22), and w(x,t) is 
given by Eqs. (34) and (36). Now let u(x,t ) = v{x, t) + w(x,t); what problem does 
a(x,t) satisfy? First, observe that 

a 2 u xx - ii t , = ( a 2 v xx - v tt ) + ( a 2 w xx - w„) — 0 + 0 = 0, (38) 

so u(x, t ) satisfies the wave equation (1). Next, we have 

m(0, t) — u(0, t) + u)(0, t) = 0 + 0 = 0, u(L, t ) = v(L, t) + w(L , t) = 0 + 0 = 0, 

(39) 

so u(x, t ) also satisfies the boundary conditions (3). Finally, we have 

u(x, 0) = v(x, 0) + w(x, 0) = fix) + 0 = f{x) (40) 

and 

u t (x, 0) = v,(x,0) + w t (x, 0) = 0 + g(x) = g(x). (41) 

Thus u(x, t) satisfies the general initial conditions (37). 

We can restate the result we have just obtained in the following way. To solve 
the wave equation with the general initial conditions (37), you can solve instead 
the somewhat simpler problems with the initial conditions (9) and (31), respectively, 
and then add together the two solutions. This is another use of the principle of 
superposition. 


PROBLEMS Consider an elastic string of length L whose ends are held fixed. The string is set in motion 
with no initial velocity from an initial position u(x, 0) = f(x). In each of Problems 1 through 
4 carry out the following steps. Let L = 10 and a = 1 in parts (b) through (d). 

(a) Find the displacement u(x, t) for the given initial position/(x). 

(b) Plot u(x, t) versus x for 0 < x < 10 and for several values of t between t = 0 and t — 20. 

(c) Plot u(x, t) versus t for 0 < t < 20 and for several values of x. 

(d) Construct an animation of the solution in time for at least one period. 

(e) Describe the motion of the string in a few sentences. 


4fl !•/(*> 


1 2 x/L, 0 < x < L/ 2, 

|2(L — x)/L, L/2<x<L 


2. f(x ) 
3- fix) 

41 4 •/(*) 


4 x/L, 0 < x < L/4, 

1, L/4 < x < 3L/4 , 

4 (L — x)/L, 3L/4 < x < L 

8 x{L — x) 2 /L? 

1 1, L/2 — 1 < x < L/2 +1 (L > 2), 
1 0, otherwise 


Consider an elastic string of length L whose ends are held fixed. The string is set in motion 
from its equilibrium position with an initial velocity u,(x, 0) = g(x). In each of Problems 5 
through 8 carry out the following steps. Let L = 10 and a = 1 in parts (b) through (d). 
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(a) Find the displacement u(x, t) for the given g(x). 

(b) Plot u(x , t) versus x for 0 < x < 10 and for several values of t between t = 0 and t = 20. 

(c) Plot u( x, t) versus t for 0 < t < 20 and for several values of x. 

(d) Construct an animation of the solution in time for at least one period. 

(e) Describe the motion of the string in a few sentences. 


4fl 5. g(x) 


j 2 x/L, 0 < x < L/2, 

|2(L — x)/L , L/2 < x < L 


6. g(x) 

7. g(x) 

4fl 8 - ?w 


4 x/L, 0 < x < L/4 , 

■ 1, L/4 < x < 3L/4 , 

4(L — x)/L, 3L/4 < x < L 

8x(L — x) 2 /L 3 

1 1, L/2 — 1 < x < L/2 + 1 (L > 2), 

1 0, otherwise 


9. If an elastic string is free at one end, the boundary condition to be satisfied there is that 
u x — 0. Find the displacement u(x,t) in an elastic string of length L, fixed at x = 0 and 
free at x = L, set in motion with no initial velocity from the initial position u(x. , 0) = fix), 
where / is a given function. 

Hint: Show that the fundamental solutions for this problem, satisfying all conditions except 
the nonhomogeneous initial condition, are 


u n (x,t) = sin k n x cos X n at, 


where X n = (2 n — l);r/2L, n = 1,2,.... Compare this problem with Problem 15 of Section 
10.6; pay particular attention to the extension of the initial data out of the original interval 
[0,L], 

10. Consider an elastic string of length L. The end x = 0 is held fixed, while the end x = L 
is free; thus the boundary conditions are u(0,t) = 0 and u x (L,t) = 0. The string is set in 
motion with no initial velocity from the initial position uix, 0) = f(x), where 


1 1, L/2 — 1 < x < L/2 + 1 (L > 2), 

1 0, otherwise. 

(a) Find the displacement u{x,t). 

(b) With L = 10 and a — 1, plot u versus x for 0 < x < 10 and for several values of t. Pay 
particular attention to values of t between 3 and 7. Observe how the initial disturbance is 
reflected at each end of the string. 

(c) With L = 10 and a = 1, plot u versus t for several values of x. 

(d) Construct an animation of the solution in time for at least one period. 

(e) Describe the motion of the string in a few sentences. 

11. Suppose that the string in Problem 10 is started instead from the initial position 
fix) = 8 x(L — x) 2 /L 3 . Follow the instructions in Problem 10 for this new problem. 

12. Dimensionless variables can be introduced into the wave equation a 2 u xx = u„ in the fol- 
lowing manner: 

(a) Let s — x/L and show that the wave equation becomes 


ci u$s — 7 , n i[ . 
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(b) Show that L/a has the dimensions of time and thus can be used as the unit on the 
time scale. Thus, let r = at/L and show that the wave equation then reduces to 

Uss — W TT . 

Problems 13 and 14 indicate the form of the general solution of the wave equation and the 
physical significance of the constant a. 

13. (a) Show that the wave equation 

2 _ 

U U-xx — M-tt 

can be reduced to the form = 0 by the change of variables £ = x — at, ?; = x + at. 

(b) Show that u(x , t) can be written as 

u(x,t ) — (f>(x — at) + i/T-v + at), 

where $ and i/r are arbitrary functions. 

14. (a) Plot the value of <f> (x — at) for t = 0,1/a, 2/a, and t 0 /a if 0 (s) = sin s . Note that for any 
t 0 the graph of y = <p(x — at) is the same as that ofy = <p{x) when t = 0,but displaced a 
distance at in the positive x direction. Thus a represents the velocity at which a disturbance 
moves along the string. 

(b) What is the interpretation of 4>(x + at)2 

15. A steel wire 5 ft in length is stretched by a tensile force of 50 lb. The wire has a weight per 
unit length of 0.026 lb/ft. 

(a) Find the velocity of propagation of transverse waves in the wire. 

(b) Find the natural frequencies of vibration. 

(c) If the tension in the wire is increased, how are the natural frequencies changed? Are 
the natural modes also changed? 

16. Consider the wave equation 

2 _ 

^ Mxx — Utt 

in an infinite one-dimensional medium subject to the initial conditions 
u(x, 0) =/(x), n t (x, 0) = 0, — oo < x < oo. 

(a) Using the form of the solution obtained in Problem 13, show that 4> and t/r must satisfy 

<p(x) + i fr{x) = f{x), 

-<p'(x) + i/r'(x) = 0. 

(b) Solve the equations of part (a) for (j> and i//, and thereby show that 

u(x,t) = j \f(x — at) + fix + at)] . 

This form of the solution was obtained by D’Alembert in 1746. 

Hint: Note that the equation xf'(x) = cf>'(x) is solved by choosing \js{x) = fix) + c. 

(c) Let 

{ 2, — 1 < x < 1, 

0, otherwise. 
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Show that 


f(x — at) 


{ 2, — 1 + at < x < 1 + at, 

0, otherwise. 


Also determine f(x + at). 

(d) Sketch the solution found in part (b) at t = 0, t = l/2a, t = 1/ a, and t = 2/a, obtain- 
ing the results shown in Figure 10.7.7. Observe that an initial displacement produces two 
waves moving in opposite directions away from the original location; each wave consists 
of one-half of the initial displacement. 
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FIGURE 10.7.7 Propagation of initial disturbance in an infinite one-dimensional medium. 


17. Consider the wave equation 


@ Uxx — Utt 


in an infinite one-dimensional medium subject to the initial conditions 


ll(X, 0) = 0, = g(x), 


— OO < X < 00. 
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(a) Using the form of the solution obtained in Problem 13, show that 

4>(x) + i/r(x) = 0, 

-a<p'(x) + af(x) = g(x). 


(b) Use the first equation of part (a) to show that ifr'(x) = —cf>'(x). Then use the second 
equation to show that —2 a<f>'(x) = g(.r) and therefore that 



where xo is arbitrary. Finally, determine i/r(x). 

(c) Show that 



18. By combining the results of Problems 16 and 17, show that the solution of the problem 


u(x,0) = f(x), u t (x, 0) = g(x), —oo < x < oo 


is given by 



Problems 19 and 20 indicate how the formal solution (20), (22) of Eqs. (1), (3), and (9) can be 

shown to constitute the actual solution of that problem. 

19. By using the trigonometric identity sin/1 cos B = ' [sin(/l + B) + sin(A — £)], show that 
the solution (20) of the problem of Eqs. (1), (3), and (9) can be written in the form (28). 

20. Let /z(£) represent the initial displacement in [0, L], extended into (— L,0) as an odd 
function and extended elsewhere as a periodic function of period 2 L. Assuming that h , h' , 
and h" are continuous, show by direct differentiation that u (x, t ) as given in Eq. (28) satisfies 
the wave equation (1) and also the initial conditions (9). Note also that since Eq. (20) 
clearly satisfies the boundary conditions (3), the same is true of Eq. (28). Comparing 
Eq. (28) with the solution of the corresponding problem for the infinite string (Problem 
16), we see that they have the same form, provided that the initial data for the finite string, 
defined originally only on the interval 0 < x < L, are extended in the given manner over 
the entire .r-axis. If this is done, the solution for the infinite string is also applicable to the 
finite one. 

21. The motion of a circular elastic membrane, such as a drumhead, is governed by the two- 
dimensional wave equation in polar coordinates 


u rr + (1 /r)u r + (1 /r 2 )u eB = a 2 u„. 


Assuming that u(r,9,t) = R(r)@(6)T(t ), find ordinary differential equations satisfied by 
R(r), @(6>), and T(t). 

22. The total energy E(t) of the vibrating string is given as a function of time by 
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the first term is the kinetic energy due to the motion of the string, and the second term is 
the potential energy created by the displacement of the string away from its equilibrium 
position. 

For the displacement u(x,t ) given by Eq. (20) — that is, for the solution of the string 
problem with zero initial velocity — show that 

2 rj-t OO 

= (ii) 

n = 1 

Note that the right side of Eq. (ii) does not depend on t. Thus the total energy £ is a 
constant and therefore is conserved during the motion of the string. 

Hint: Use Parseval’s equation (Problem 37 of Section 10.4 and Problem 17 of Section 
10.3), and recall that a 2 = T/p. 

23. Dispersive Waves. Consider the modified wave equation 

a~ 2 u tt + y 2 u = u xx , 0 < x < L, t > 0 (i) 


with the boundary conditions 

«(0,f)=0 , u(L,t) = 0, f>0 

and the initial conditions 

u(x, 0) = f(x), u t (x,0) = 0, 0 < x < L. 

(a) Show that the solution can be written as 


u(x,t) = E C„< 


~TF + r at sm_ ZT’ 


where 


— - . nnx 

c n = — I fix) sin —j-- ax. 


2 r L 

° n = lJ 0 

(b) By using trigonometric identities, rewrite the solution as 

A OO 

1 r . mi nit i 

u(x , t) = — 2 c n |^ sm j ( x + a n t) + sin — (x — n„0J • 


(ii) 

(hi) 


Determine a n , the speed of wave propagation. 

(c) Observe that a n , found in part (b), depends on n. This means that components of 
different wave lengths (or frequencies) are propagated at different speeds, resulting in 
a distortion of the original wave form over time. This phenomenon is called dispersion. 
Find the condition under which a n is independent of n — that is, there is no dispersion. 

24. Consider the situation in Problem 23 with a 2 = 1, L = 10, and 


fix) = 


x — 4, 
6 — x, 
0 , 


4 <x <5, 

5 <x <6, 
otherwise. 


(a) Determine the coefficients c n in the solution of Problem 23(a). 


646 


Chapter 1 0. Partial Differential Equations and Fourier Series 


(b) Plot 

N 

E . nnx 

c n sin for 0 < x < 10, 

"10 “ - 

n = 1 

choosing N large enough so that the plot accurately displays the graph of/(.r). Use this 
value of N for the remaining plots called for in this problem. 

(c) Let y = 0. Plot u(x, t) versus x for t = 60. 

(d) Let y = 1/8. Plot u(x, t ) versus x for t = 20, 40, and 60. 

(e) Let y = 1/4. Plot u(x, t) versus x for t = 20, 40, and 60. 


10.8 Laplace’s Equation 


One of the most important of all partial differential equations occurring in applied 
mathematics is that associated with the name of Laplace 11 : in two dimensions 

U XX Uyy — 0, (1) 

and in three dimensions 

ll XX T Uyy "}~ U -- — 0. (2) 

For example, in a two-dimensional heat conduction problem, the temperature 
u(x,y,t) must satisfy the differential equation 

a 2 (U xx + Uyy) = u t , (3) 

where a 2 is the thermal diffusivity. If a steady state exists, u is a function of x andy only, 
and the time derivative vanishes; in this case Eq. (3) reduces to Eq. (1). Similarly, for 
the steady-state heat conduction problem in three dimensions, the temperature must 
satisfy the three-dimensional form of Laplace’s equation. Equations (1) and (2) also 
occur in other branches of mathematical physics. In the consideration of electrostatic 
fields, the electric potential function in a dielectric medium containing no electric 
charges must satisfy either Eq. (1) or Eq. (2), depending on the number of space 
dimensions involved. Similarly, the potential function of a particle in free space acted 
on only by gravitational forces satisfies the same equations. Consequently, Laplace’s 
equation is often referred to as the potential equation. Another example arises in 
the study of the steady (time-independent), two-dimensional, inviscid, irrotational 
motion of an incompressible fluid. This study centers on two functions, known as 
the velocity potential function and the stream function, both of which satisfy Eq. (1). 
In elasticity, the displacements that occur when a perfectly elastic bar is twisted are 
described in terms of the so-called warping function, which also satisfies Eq. (1). 


n Laplace’s equation is named for Pierre-Simon de Laplace, who, beginning in 1782, studied its solutions 
extensively while investigating the gravitational attraction of arbitrary bodies in space. However, the 
equation first appeared in 1752 in a paper by Euler on hydrodynamics. 
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Since there is no time dependence in any of the problems mentioned previously, 
there are no initial conditions to be satisfied by the solutions of Eq. (1) or (2). They 
must, however, satisfy certain boundary conditions on the bounding curve or surface 
of the region in which the differential equation is to be solved. Since Laplace’s equa- 
tion is of second order, it might be plausible to expect that two boundary conditions 
would be required to determine the solution completely. This, however, is not the 
case. Recall that in the heat conduction problem for the finite bar (Sections 10.5 
and 10.6) it was necessary to prescribe one condition at each end of the bar — that 
is, one condition at each point of the boundary. If we generalize this observation 
to multidimensional problems, it is then natural to prescribe one condition on the 
function u at each point on the boundary of the region in which a solution of Eq. (1) 
or (2) is sought. The most common boundary condition occurs when the value of 
u is specified at each boundary point; in terms of the heat conduction problem, this 
corresponds to prescribing the temperature on the boundary. In some problems the 
value of the derivative, or rate of change, of u in the direction normal to the bound- 
ary is specified instead; the condition on the boundary of a thermally insulated body, 
for example, is of this type. It is entirely possible for more complicated boundary 
conditions to occur; for example, u might be prescribed on part of the boundary and 
its normal derivative specified on the remainder. The problem of finding a solution 
of Laplace’s equation that takes on given boundary values is known as a Dirichlet 
problem, in honor of R G. L. Dirichlet. 12 In contrast, if the values of the normal 
derivative are prescribed on the boundary, the problem is said to be a Neumann 
problem, in honor of K. G. Neumann. 13 The Dirichlet and Neumann problems are 
also known as the first and second boundary value problems of potential theory, 
respectively. 

Physically, it is plausible to expect that the types of boundary conditions just men- 
tioned will be sufficient to determine the solution completely. Indeed, it is possible 
to establish the existence and uniqueness of the solution of Laplace’s equation under 
the boundary conditions mentioned, provided that the shape of the boundary and 
the functions appearing in the boundary conditions satisfy certain very mild require- 
ments. However, the proofs of these theorems, and even their accurate statement, 
are beyond the scope of the present book. Our only concern will be solving some 
typical problems by means of separation of variables and Fourier series. 

While the problems chosen as examples have interesting physical interpretations 
(in terms of electrostatic potentials or steady state temperature distributions, for 
instance), our purpose here is primarily to point out some of the features that may 
occur during their mathematical solution. It is also worth noting again that more 
complicated problems can sometimes be solved by expressing the solution as the 
sum of solutions of several simpler problems (see Problems 3 and 4). 


12 Peter Gustav Lejeune Dirichlet (1805-1859) was a professor at Berlin and, after the death of Gauss, at 
Gottingen. In 1829 he provided the first set of conditions sufficient to guarantee the convergence of a 
Fourier series. The definition of function that is usually used today in elementary calculus is essentially 
the one given by Dirichlet in 1837. Although he is best known for his work in analysis and differential 
equations, Dirichlet was also one of the leading number theorists of the nineteenth century. 

13 Karl Gottfried Neumann ( 1832-1925), professor at Leipzig, made contributions to differential equations, 
integral equations, and complex variables. 
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Dirichlet Problem for a Rectangle. Consider the mathematical problem of finding the 
function u satisfying Laplace’s equation (1) 

U xx “h llyy = 0, 

in the rectangle 0 < x < a, 0 < y < b, and also satisfying the boundary conditions 
u(x, 0) = 0, u(x,b ) = 0, 0 < x < a. 

(4) 

u(0,y) = 0, u{a,y) = f(y ), 0 < y < b, 

where / is a given function on 0 < y < b (see Figure 10.8.1). 



FIGURE 10.8.1 Dirichlet problem for a rectangle. 


To solve this problem, we wish to construct a fundamental set of solutions satisfying 
the partial differential equation and the homogeneous boundary conditions; then we 
will superpose these solutions so as to satisfy the remaining boundary condition. Let 
us assume that 

u(x,y) = X(x)Y(y) (5) 


and substitute for u in Eq. (1). This yields 


X" 

~Y 


Y" 


= K 


where X is the separation constant. Thus we obtain the two ordinary differential 
equations 


X" — XX = 0, (6) 

Y" + XY = 0. (7) 


If we now substitute for u from Eq. (5) in each of the homogeneous boundary con- 
ditions, we find that 

X(0) = 0 (8) 

and 


Y(0) = 0, Y(b) = 0. (9) 

We will first determine the solution of the differential equation (7) subject to 
the boundary conditions (9). However, this problem is essentially identical to one 
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encountered previously in Sections 10.1, 10.5, and 10.7. We conclude that there are 
nontrivial solutions if and only if X is an eigenvalue, namely, 

X — {rm/b) 2 , n = l,2,..., (10) 

and Y (y) is proportional to the corresponding eigenfunction sin(njty/b). Next, we 
substitute from Eq. (10) for X in Eq. (6) and solve this equation subject to the bound- 
ary condition (8). It is convenient to write the general solution of Eq. (6) as 

X(x ) = k\ cosh(wrx/b) + ki smh(nnx/b), (11) 


and the boundary condition (8) then requires that k\ = 0. Therefore X(x) must be 
proportional to sinh(wrx/b). Thus we obtain the fundamental solutions 

. mix . mry „ ^ 

u n (x,y) = smh — — sin — — , n= 1,2, (12) 

These functions satisfy the differential equation (1) and all the homogeneous bound- 
ary conditions for each value of n. 

To satisfy the remaining nonhomogeneous boundary condition at.v = a, we assume, 
as usual, that we can represent the solution u(x,y) in the form 

OO OO 

nnx nny 

c n u n (x,y ) = 2_^ Cn smh -j- sin (13) 

n—1 n = 1 

The coefficients c„ are determined by the boundary condition 

OO 

E . rina . nny 

c n smh — sin — = f(y). (14) 

n—l 


Therefore the quantities c n sin\\(nna/b) must be the coefficients in the Fourier sine 
series of period 2b for / and are given by 


c n 


sinh 


nna 



(15) 


Thus the solution of the partial differential equation (1) satisfying the boundary 
conditions (4) is given by Eq. (13) with the coefficients c„ computed from Eq. (15). 

From Eqs. (13) and (15) we see that the solution contains the factor 
smh(mtx/b)/ smh(mta/b). To estimate this quantity for large n , we can use the 
approximation sinh £ = £ /2, and thereby obtain 


sinh(n7Tx/b) 
sinh(wr a/b) 


\ exp(njrx/b ) 
\ exp(mrfl/b) 


exp[— mt{a — x)/b]. 


Thus this factor has the character of a negative exponential; consequently, the series 
(13) converges quite rapidly unless a — x is very small. 
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EXAMPLE 

1 


To illustrate these results, let a — 3, b = 2, and 


m = 



0 < y < 1, 

1 < y < 2. 


By evaluating c„ from Eq. (15), we find that 


8sin(njr/2) 
n 2 n 2 sinh(3«7r/2) 


(16) 


(17) 


Then u(x,y) is given by Eq. (13). Keeping 20 terms in the series, we can plot u versus x and y, 
as shown in Figure 10.8.2. Alternatively, we can construct a contour plot showing level curves 
of u(x,y); Figure 10.8.3 is such a plot, with an increment of 0.1 between adjacent curves. 




FIGURE 10.8.3 Level curves of u(x,y) for Example 1. 
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Dirichlet Problem for a Circle. Consider the problem of solving Laplace’s equation in a 
circular region r < a subject to the boundary condition 

u(a,6)=f(d), (18) 

where / is a given function on 0 < 9 <2: x (see Figure 10.8.4). In polar coordinates 
Laplace’s equation has the form 

Urr H Mr H ^Ugg = 0. (19) 

r r L 

To complete the statement of the problem, we note that for ufj , 9) to be single-valued, 
it is necessary that u be periodic in 6 with period 2 j r. Moreover, we state explicitly 
that u{r, 9) must be bounded for r < a, since this will become important later. 



FIGURE 10.8.4 Dirichlet problem for a circle. 


To apply the method of separation of variables to this problem, we assume that 

u(r.O) = R(rme), (20) 


and substitute for u in the differential equation (19). This yields 


or 


f?"©+-f? , @+4f?©" = 0, 

r r A 


n R" R ' 0 " 

r l b r — = = X, 

R R 0 


(21) 


where X is the separation constant. Thus we obtain the two ordinary differential 
equations 

rR" + rR' -XR = 0, (22) 

©" + X® = 0. (23) 


In this problem there are no homogeneous boundary conditions; recall, however, 
that solutions must be bounded and also periodic in 9 with period 2tx. It is possible 
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to show (Problem 9) that the periodicity condition requires X to be real. We will 
consider in turn the cases in which X is negative, zero, and positive. 

If X < 0, let X = — /x 2 , where /x > 0. Then Eq. (23) becomes ©" — p 2 © = 0, and 
consequently, 

0(0) = ci + c 2 e~ lie . (24) 

Thus ©(0) can be periodic only if cj = C 2 = 0, and we conclude that X cannot be 
negative. 

If X — 0, then Eq. (23) becomes ©" = 0, and thus 

©(0) = ci + C 2 O. (25) 

For 0(0) to be periodic we must have C 2 = 0, so that 0(0) is a constant. Further, for 
X = 0, Eq. (22) becomes 

r 2 R " + rR' = 0. (26) 

This equation is of the Euler type and has the solution 

R(r) — k\ + k 2 In r. (27) 

The logarithmic term cannot be accepted if u(r,6 ) is to remain bounded as r —*■ 0; 
hence k 2 = 0. Thus, corresponding to X — 0, we conclude that u(r,9 ) must be a 
constant — that is, proportional to the solution 

u 0 (r,9) = 1. (28) 

Finally, if X > 0, we let X = p 2 , where /x > 0. Then Eqs. (22) and (23) become 

r 2 R" + rR' - p 2 R = 0 (29) 

and 

©" + p 2 @ = 0, (30) 

respectively. Equation (29) is an Euler equation and has the solution 

R(r) = ktr* + fer-'*, (31) 

while Eq. (30) has the solution 

0(0) = ci sin/r0 + C 2 cos/x0. (32) 

In order for © to be periodic with period 27r, it is necessary for /x to be a positive 
integer n. With p — n it follows that the solution r~ jl in Eq. (31) must be discarded 
since it becomes unbounded as r —*■ 0. Consequently, ki_ — 0 and the appropriate 
solutions of Eq. (19) are 

u„(r, 0) = r”cosn0, v n (r,9) — r" sinn0, n — 1,2 (33) 

These functions, together with uo(r,9) = 1, form a set of fundamental solutions for 
the present problem. 

In the usual way we now assume that u can be expressed as a linear combination 
of the fundamental solutions; that is, 

OO 

u(r,6 ) = — + ^ r n (c n cos n9 + k n sin«0). 

n= 1 


( 34 ) 
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The boundary condition (18) then requires that 

OO 

u(a,6 ) = ^ cos nO + k n sinnd) —f(0) (35) 

2 n=t 


for 0 < 0 <2 n. The function / may be extended outside this interval so that it is 
periodic with period 2n and therefore has a Fourier series of the form (35). Since 
the extended function has period 2: r, we may compute its Fourier coefficients by 
integrating over any period of the function. In particular, it is convenient to use the 
original interval (0,2jt); then 


1 f l7T 

a n c n = I f(0)cosndd6, 
x Jo 

1 f l7T 

a n k n = — / f(6)smn9dd, 
x Jo 


n = 0,1,2, ; 

(36) 

77 = 1,2,. . . . 

(37) 


With this choice of the coefficients, Eq. (34) represents the solution of the boundary 
value problem of Eqs. (18) and (19). Note that in this problem we needed both sine 
and cosine terms in the solution. This is because the boundary data were given on 
0 < 0 < 2jt and have period 2tt. Asa consequence, the full Fourier series is required, 
rather than sine or cosine terms alone. 


PROBLEMS 

1. (a) Find the solution u(x,y ) of Laplace’s equation in the rectangle 0 < jc < a, 0 < y < £>, 
that satisfies the boundary conditions 

w(0,y) = 0, t/(a,y) = 0, 0 < y < b, 

u(x, 0) = 0, u(x,b) = g(x), 0 < x < a. 

(b) Find the solution if 


{ x, 0 < x < all , 

a — x, a/2 < x < a. 

(c) For a = 3 and b = 1 plot u versus x for several values of y and also plot u versus y for 
several values of x. 

(d) Plot u versus both x and y in three dimensions. Also draw a contour plot showing 
several level curves of u{x,y) in the xy-plane. 

2. Find the solution u(x.y) of Laplace’s equation in the rectangle 0 < x < a,0 < y < b, that 
satisfies the boundary conditions 


u(a.y) = 0, 0 < y < b, 

u(x, b) = 0, 


u(0,y) = 0, 
u(x, 0) = h(x), 


0 < x < a. 
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3. (a) Find the solution u{x,y) of Laplace’s equation in the rectangle 0 < x < a,0 < y < b, 
that satisfies the boundary conditions 

u(0,y ) = 0, u(a,y) = fiy), 0 < y < b, 
u(x, 0) = h(x), u(x,b) = 0, 0 < x < a. 

Hint: Consider the possibility of adding the solutions of two problems, one with homo- 
geneous boundary conditions except for u(a,y) = /(y), and the other with homogeneous 
boundary conditions except for u(x, 0) = h(x). 

(b) Find the solution if h(x) = ( x/a ) 2 and f(y) = 1 — (y/b). 

(c) Let a = 2 and b = 2. Plot the solution in several ways: u versus x, u versus y, u versus 
both x and y, and a contour plot. 

4. Show how to find the solution u(x,y) of Laplace’s equation in the rectangle 0 < x < a, 
0 < y < b, that satisfies the boundary conditions 

u(0,y) = kiy), u(a,y) = f(y), 0 < y < b, 

u(x, 0) = h(x), u(x,b) = g(x), 0 < x < a. 

Hint: See Problem 3. 

5. Find the solution u(r, 9) of Laplace’s equation 

Urr + (1 /r)U r + (l/r 2 )Ugg = 0 

outside the circle r = a, that satisfies the boundary condition 

u(a, 9) = f(6), 0 < 9 < 2n, 

on the circle. Assume that u(r , 9) is single-valued and bounded for r > a. 

6. (a) Find the solution u(r,9) of Laplace’s equation in the semicircular region r < a, 
0 < 9 < 7T, that satisfies the boundary conditions 

u(r,0) = 0, u(r,n) = 0, 0 < r < a, 

u(a,9)=f(6), 0 < 9 < n. 

Assume that u is single-valued and bounded in the given region. 

(b) Find the solution if f(9) = 9 in — 9). 

(c) Let a — 2 and plot the solution in several ways: u versus r, u versus 9, u versus both r 
and 9 , and a contour plot. 

7. Find the solution u{r, 9) of Laplace’s equation in the circular sector 0 < r < a, 
0 <9 < a, that satisfies the boundary conditions 

u(r, 0) = 0, u(r, a) = 0, 0 < r < a, 

u(a, 9) = f(6), 0 <9 <ot. 

Assume that u is single-valued and bounded in the sector. 

8. (a) Find the solution u(x,y) of Laplace’s equation in the semi-infinite strip 0 < x < a, 
y > 0, that satisfies the boundary conditions 

m(0, y) — 0, u(a,y ) = 0, y > 0, 

u(x, 0) = f(x), 0 < x < a 

and the additional condition that u(x,y) — > 0 as y — *■ oo. 
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(b) Find the solution if/(x) = x(a — x). 

(c) Let a = 5. Find the smallest value of yo for which u(x,y ) < 0.1 for all y > yo- 

9. Show that Eq. (23) has periodic solutions only if k is real. 

Hint: Let k — — ft 2 , where /x = v + ia with v and a real. 

10. Consider the problem of finding a solution w(x,y) of Laplace’s equation in the rectangle 
0 < x < a.O < y < b, that satisfies the boundary conditions 

Ux(0,y) = 0, u x (a,y) = f(y), 0 < y < b, 

u y (x, 0) = 0, u y (x,b) =0, 0 < x < a. 

This is an example of a Neumann problem. 

(a) Show that Laplace’s equation and the homogeneous boundary conditions determine 
the fundamental set of solutions 


uo(x.y) = c 0 , 

u n (x,y ) = c„ cosh(nnx/b) cos(nny/b), n = 1,2,3,.... 

(b) By superposing the fundamental solutions of part (a), formally determine a function 
it satisfying the nonhomogeneous boundary condition u x (a,y) = /(y). Note that when 
u x (a,y) is calculated, the constant term in it(x,y ) is eliminated, and there is no condition 
from which to determine cq. Furthermore, it must be possible to express / by means of a 
Fourier cosine series of period 2b, which does not have a constant term. This means that 

f(y) dy = 0 

is a necessary condition for the given problem to be solvable. Finally, note that Co remains 
arbitrary, and hence the solution is determined only up to this additive constant. This is a 
property of all Neumann problems. 

11. Find a solution u{r, 6) of Laplace’s equation inside the circle r = a that satisfies the bound- 
ary condition on the circle 



u r (a,0 ) = g(8), 0 < 8 < 2 n. 


Note that this is a Neumann problem and that its solution is determined only up to an 
arbitrary additive constant. State a necessary condition on g{6) for this problem to be 
solvable by the method of separation of variables (see Problem 10). 

12. (a) Find the solution u(x,y) of Laplace’s equation in the rectangle 0 < x < a, 
0 < y < b, that satisfies the boundary conditions 

u(0,y) = 0, u(a,y) = 0, 0 < y < b, 

u y (x, 0) = 0, u(x, b ) = g(x), 0 < x < a. 

Note that this is neither a Dirichlet nor a Neumann problem, but a mixed problem in 
which u is prescribed on part of the boundary and its normal derivative on the rest. 

(b) Find the solution if 

0 < x < a/2, 
a/2 < x < a. 



(c) Let a = 3 and b = 1 . By drawing suitable plots, compare this solution with the solution 
of Problem 1. 
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13. (a) Find the solution u(x,y ) of Laplace’s equation in the rectangle 0 < x < a, 

0 < y < b, that satisfies the boundary conditions 

u(0,y) = 0, u(a,y)=f(y ), 0 < y < b, 

u(x, 0) = 0, u y (x,b) =0, 0 < x < a. 

Hint: Eventually, it will be necessary to expand f(y) in a series that makes use of the 
functions sin(jry/2fr), sin(3try/2Z>), sm(5ny/2b), . . . (see Problem 39 of Section 10.4). 

(b) Find the solution if f{y) = y(2b — y). 

(c) Let a = 3 and b = 2; plot the solution in several ways. 

14. (a) Find the solution u(x,y ) of Laplace’s equation in the rectangle 0 < x < a, 

0 < y < b, that satisfies the boundary conditions 

u x (fl,y) = 0, u x (a,y) = 0, 0 < y < b, 

u(x, 0) = 0, u(x,b) = g(x), 0 < x < a. 

(b) Find the solution if g(x) = 1 + x 2 (x — a) 2 . 

(c) Let a = 3 and b = 2; plot the solution in several ways. 

15. By writing Laplace’s equation in cylindrical coordinates r, 9, and z and then assuming that 
the solution is axially symmetric (no dependence on 9), we obtain the equation 

Urr 4” ( 1 / r)llj- 4“ u,, = 0 . 

Assuming that u(r, z) = R(r)Z(z), show that R and Z satisfy the equations 
rR" + R' + X 2 rR = 0, Z" - )cZ = 0. 

The equation for R is Bessel’s equation of order zero with independent variable kr. 

if fy) 16. Flow in an Aquifer. Consider the flow of water in a porous medium, such as sand, in 
an aquifer. The flow is driven by the hydraulic head, a measure of the potential energy 
of the water above the aquifer. Let R : 0 < x < a, 0<z<febea vertical section of an 
aquifer. In a uniform, homogeneous medium the hydraulic head u(x, z) satisfies Laplace’s 
equation 

Uxx + u zz = 0 in R. (i) 

If water cannot flow through the sides and bottom of R, then the boundary conditions 
there are 

Mx(0,z) = 0, u x (a,z) = 0, 0<z<fc (ii) 

n z (x, 0) = 0, 0 < x < a. (iii) 

Finally, suppose that the boundary condition at z = b is 

u(x, b) = b + ax, 0 < x < a, (iv) 

where a is the slope of the water table. 

(a) Solve the given boundary value problem for u(x, z). 

(b) Let a = 1000, b — 500, and a = 0.1. Draw a contour plot of the solution in R\ that is, 
plot some level curves of u(x, z). 

(c) Water flows along paths in R that are orthogonal to the level curves of u(x,z) in the 
direction of decreasing a. Plot some of the flow paths. 
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Derivation of the Heat Conduction Equation. In this section we derive the differ- 
APPENDIX ential equation that, to a first approximation at least, governs the conduction of heat 

A in solids. It is important to understand that the mathematical analysis of a physical 

situation or process such as this ultimately rests on a foundation of empirical knowl- 
edge of the phenomenon involved. The mathematician must have a place to start, so 
to speak, and this place is furnished by experience. Consider a uniform rod insulated 
on the lateral surfaces so that heat can flow only in the axial direction. It has been 
demonstrated many times that if two parallel cross sections of the same area A and 
different temperatures 7) and T 2 , respectively, are separated by a small distance d, 
an amount of heat per unit time will pass from the warmer section to the cooler one. 
Moreover, this amount of heat is proportional to the area A and to the temperature 
difference | T 2 — T \ |, and is inversely proportional to the separation distance d. Thus 

Amount of heat per unit time = kA\ T 2 — T\\/d, (1) 

where the positive proportionality factor k is called the thermal conductivity and de- 
pends primarily on the material 14 of the rod. The relation (1) is often called Fourier’s 
law of heat conduction. We repeat that Eq. (1) is an empirical, not a theoretical, 
result and that it can be, and has often been, verified by careful experiment. It is the 
basis of the mathematical theory of heat conduction. 

Now consider a straight rod of uniform cross section and homogeneous material, 
oriented so that the x-axis lies along the axis of the rod (see Figure 10.A.1). Let x = 0 
and x — L designate the ends of the bar. 


n 



H = -kAu x 



H = kAu x 


x = Xq x = Xq + Ax 

FIGURE 10.A.1 Conduction of heat in an element of a rod. 


We will assume that the sides of the bar are perfectly insulated so that there is no 
passage of heat through them. We will also assume that the temperature u depends 
only on the axial position x and the time t , and not on the lateral coordinates y and 
z. In other words, we assume that the temperature remains constant on any cross 
section of the bar. This assumption is usually satisfactory when the lateral dimensions 
of the rod are small compared to its length. 

The differential equation governing the temperature in the bar is an expression of 
a fundamental physical balance; the rate at which heat flows into any portion of the 
bar is equal to the rate at which heat is absorbed in that portion of the bar. The terms 
in the equation are called the flux (flow) term and the absorption term, respectively. 

We will first calculate the flux term. Consider an element of the bar lying between 
the cross sections x — xq and x = xq + Ax, where xq is arbitrary and Ax is small. The 


14 Actually, k also depends on the temperature, but if the temperature range is not too great, it is satisfactory 
to assume that k is independent of temperature. 
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instantaneous rate of heat transfer H(x o, t) from left to right across the cross section 
x = xq is given by 


H(x 0 , t) = — lim kA 

d^O 


u(x o + d/2, t) — u(xq — d/2, t) 
d 


= -KAu x (xo,t). (2) 

The minus sign appears in this equation because there will be a positive flow of heat 
from left to right only if the temperature is greater to the left of x — xq than to the 
right; in this case u x (xo,t ) is negative. In a similar manner, the rate at which heat 
passes from left to right through the cross section x = x 0 + Ax is given by 

H(x o + Ax, t) = —kAu x (xo + Ax, t). (3) 

The net rate at which heat flows into the segment of the bar between x = xo and 
x — xo + Ax is thus given by 

Q = H(x o, t) - H(x o + Ax, t) = kA[u x (xq + Ax, t) - u x {x 0 , f)], (4) 

and the amount of heat entering this bar element in time At is 

QAt = kA[u x (xq + Ax, t) — u x (x o, f)] At. (5) 


Let us now calculate the absorption term. The average change in temperature A u, 
in the time interval At, is proportional to the amount of heat QAt introduced and 
inversely proportional to the mass Ain of the element. Thus 

Au = * QAt = gA * , (6) 

s Am spAAx 

where the constant of proportionality 5 is known as the specific heat of the material 
of the bar, and p is its density. 15 The average temperature change Au in the bar 
element under consideration is the actual temperature change at some intermediate 
point x = xo + 9 Ax, where 0 < 6 < 1. Thus Eq. (6) can be written as 

QAt 

u(x o + 9 Ax, t + At) — u(x o + 9 Ax, t) = , (7) 

spAAx 


or as 


Q At = [u(x o + 9 Ax, t + At) — u(x o + 9 Ax, t)]spA Ax. 


( 8 ) 


To balance the flux and absorption terms, we equate the two expressions for Q At: 


kA[u x (x o + Ax, t) - u x (x 0 ,t)]At 

= spA[u{xQ + 9 Ax, t + At) — u(x . o + 9 Ax, /)] Ax. (9) 

Upon dividing Eq. (9) by Ax At and then letting Ax — »■ 0 and At —*■ 0, we obtain the 
heat conduction or diffusion equation 

a 2 u xx = u t . (10) 


1:> The dependence of the density and specific heat on temperature is relatively small and will be neglected. 
Thus both p and s will be considered as constants. 
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The quantity a 2 defined by 

or = k/ps (11) 

is called the thermal diffusivity and is a parameter that depends only on the material 
of the bar. The units of a 2 are (length) 2 /time. Typical values of a 2 are given in 
Table 10.5.1. 

Several relatively simple conditions may be imposed at the ends of the bar. For 
example, the temperature at an end may be maintained at some constant value T . 
This might be accomplished by placing the end of the bar in thermal contact with 
some reservoir of sufficient size so that any heat that flows between the bar and the 
reservoir does not appreciably alter the temperature of the reservoir. At an end 
where this is done the boundary condition is 

u = T. (12) 

Another simple boundary condition occurs if the end is insulated so that no heat 
passes through it. Recalling the expression (2) for the amount of heat crossing any 
cross section of the bar, we conclude that the condition of insulation is that this 
quantity vanish. Thus 

u x = 0 (13) 

is the boundary condition at an insulated end. 

A more general type of boundary condition occurs if the rate of flow of heat through 
an end of the bar is proportional to the temperature there. Let us consider the end 
x = 0, where the rate of flow of heat from left to right is given by — kAu x (0, f); see 
Eq. (2). Hence the rate of heat flow out of the bar (from right to left) at x = 0 is 
kAu x { 0, t). If this quantity is proportional to the temperature u( 0, f), then we obtain 
the boundary condition 


u x (0,t) — h\u(0,t) = 0, t > 0, (14) 

where hi is a nonnegative constant of proportionality. Note that h\ = 0 corresponds 
to an insulated end and that h\ -* oo corresponds to an end held at zero temperature. 

If heat flow is taking place at the right end of the bar (x = L), then in a similar way 
we obtain the boundary condition 


u x {Lj) + h2u(L,t) = 0, t > 0, (15) 

where again h 2 is a nonnegative constant of proportionality. 

Finally, to determine completely the flow of heat in the bar, it is necessary to state 
the temperature distribution at one fixed instant, usually taken as the initial time 
t = 0. This initial condition is of the form 

u(x,0) = f(x), 0 < x < L. (16) 

The problem then is to determine the solution of the differential equation (10) 
subject to one of the boundary conditions (12) to (15) at each end and to the initial 
condition (16) at t = 0. 
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Several generalizations of the heat equation (10) also occur in practice. First, the 
bar material may be nonuniform and the cross section may not be constant along the 
length of the bar. In this case, the parameters K,p,s, and A may depend on the axial 
variable x. Going back to Eq. (2), we see that the rate of heat transfer from left to 
right across the cross section at x — xq is now given by 

H(x 0 ,t ) = -k(x 0 )A{x 0 )u x (x 0 J) (17) 

with a similar expression for H(xq + Ax, t). If we introduce these quantities into 
Eq. (4) and eventually into Eq. (9), and proceed as before, we obtain the partial 
differential equation 

(kAu x ) x = spAu t . (18) 

We will usually write Eq. (18) in the form 

r(x)u t — | p(x)u x ] x , (19) 

where p(x) = k (x)A(x) and r(x) = s(x)p(x)A{x). Note that both of these quantities 
are intrinsically positive. 

A second generalization occurs if there are other ways in which heat enters or 
leaves the bar. Suppose that there is a source that adds heat to the bar at a rate 
G(x, t, u) per unit time per unit length, where G(x, t, u ) > 0. In this case we must add 
the term G(x, t, u) Ax At to the left side of Eq. (9), and this leads to the differential 
equation 

r{x)u t = [p(x)u x ] x + G(x, t, u). (20) 

If G(x, t, u) < 0, then we speak of a sink that removes heat from the bar at the rate 
G(x,t,u ) per unit time per unit length. To make the problem tractable, we must 
restrict the form of the function G. In particular, we assume that G is linear in u and 
that the coefficient of u does not depend on t. Thus we write 

G(x,t,u) = F(x,t) — q{x)u. (21) 

The minus sign in Eq. (21) has been introduced so that certain equations that appear 
later will have their customary forms. Substituting from Eq. (21) into Eq. (20), we 
obtain 

r(x)u, = [p(x)u x ] x - q{x)u + F(x, t). (22) 

This equation is sometimes called the generalized heat conduction equation. Bound- 
ary value problems for Eq. (22) will be discussed to some extent in Chapter 11. 

Finally, if instead of a one-dimensional bar, we consider a body with more than 
one significant space dimension, then the temperature is a function of two or three 
space coordinates rather than of x alone. Considerations similar to those leading to 
Eq. (10) can be employed to derive the heat conduction equation in two dimensions 

a 2 (U XX + Uyy) = u t , (23) 

or in three dimensions 

a 2 (u xx + Uyy + u zz ) — u t . (24) 

The boundary conditions corresponding to Eqs. (12) and (13) for multidimensional 
problems correspond to a prescribed temperature distribution on the boundary, or to 
an insulated boundary. Similarly, the initial temperature distribution will in general 
be a function of x and y for Eq. (23) and a function of x, y, and z for Eq. (24). 
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Derivation of the Wave Equation. In this appendix we derive the wave equation in 
APPENDIX one space dimension as it applies to the transverse vibrations of an elastic string, or 

3 cable; the elastic string may be thought of as a violin string, a guy wire, or possibly 

an electric power line. The same equation, however, with the variables properly 
interpreted, occurs in many other wave problems having only one significant space 
variable. 

Consider a perfectly flexible elastic string stretched tightly between supports fixed 
at the same horizontal level (see Figure lO.B.ln). Let the x-axis lie along the string 
with the endpoints located at x = 0 and x = L. If the string is set in motion at some 
initial time t = 0 (by plucking, for example) and is thereafter left undisturbed, it will 
vibrate freely in a vertical plane, provided that damping effects, such as air resistance, 
are neglected. To determine the differential equation governing this motion, we will 
consider the forces acting on a small element of the string of length Ax lying between 
the points x andx + Ax (see Figure lO.B.lh). We assume that the motion of the string 
is small and that, as a consequence, each point on the string moves solely in a vertical 
line. We denote by u(x, t ) the vertical displacement of the point x at the time t. Let 
the tension in the string, which always acts in the tangential direction, be denoted by 
T (x, f), and let p denote the mass per unit length of the string. 



(b) (c) 

FIGURE 10.B.1 (a) An elastic string under tension, (b) An element of the displaced string, 

(c) Resolution of the tension T into components. 


Newton’s law, as it applies to the element Ax of the string, states that the net 
external force, due to the tension at the ends of the element, must be equal to the 
product of the mass of the element and the acceleration of its mass center. Since 


662 


Chapter 1 0. Partial Differential Equations and Fourier Series 


there is no horizontal acceleration, the horizontal components must satisfy 

T(x+ Ax, f)cos(d + AO) — T(x, f)cos0 = 0. (1) 

If we denote the horizontal component of the tension (see Figure lO.B.lc) by H, then 
Eq. (1) states that H is independent of x. 

On the other hand, the vertical components satisfy 

T (x + Ax, t) sin(0 + Ad) — T (x, t) sin 6 = p Ax u tt ( x, t), (2) 

where x is the coordinate of the center of mass of the element of the string under 
consideration. Clearly, x lies in the interval x < x < x + Ax. The weight of the string, 
which acts vertically downward, is assumed to be negligible and has been neglected 
in Eq. (2). 

If the vertical component of T is denoted by V, then Eq. (2) can be written as 

V(x+ Ax,t) - V(x,t) 

= pu„(x,t). 

Ax 

Passing to the limit as Ax — »■ 0 gives 

V x (x,t) = pu„(x,t). (3) 

To express Eq. (3) entirely in terms of u, we note that 

V(x,t) = H(t) tand = H(t)u x (x,t). 


Flence Eq. (3) becomes 


(Hu x ) x = pu„. 


or, since H is independent of x, 

Hu xx = pu„. (4) 

For small motions of the string it is permissible to replace H = T cos 6 by T. Then 
Eq. (4) takes its customary form 

cru xx = u„, (5) 


where 

a 2 = T/p. (6) 

We will assume further that a 2 is a constant, although this is not required in our 
derivation, even for small motions. Equation (5) is called the wave equation for one 
space dimension. Since T has the dimension of force, and p that of mass/length, it 
follows that the constant a has the dimension of velocity. It is possible to identify 
a as the velocity with which a small disturbance (wave) moves along the string. 
According to Eq. (6), the wave velocity a varies directly with the tension in the string, 
but inversely with the density of the string material. These facts are in agreement 
with experience. 

As in the case of the heat conduction equation, there are various generalizations of 
the wave equation (5). One important equation is known as the telegraph equation 
and has the form 

ii tt + cu, + ku — a 2 u xx + F(x, t), (7) 

where c and k are nonnegative constants. The terms cu,, ku, and F(x, t) arise from a 
viscous damping force, an elastic restoring force, and an external force, respectively. 
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Boundary Value 
Problems and 
Sturm— Liouville 
Theory 

As a result of separating variables in a partial differential equation in Chapter 10, we 
repeatedly encountered the differential equation 

X" + XX = 0, 0 <x <L 

with the boundary conditions 

X(0) = 0, X(L) = 0. 

This boundary value problem is the prototype of a large class of problems that are 
important in applied mathematics. These problems are known as Sturm-Liouville 
boundary value problems. In this chapter we discuss the major properties of Sturm- 
Liouville problems and their solutions; in the process we are able to generalize some- 
what the method of separation of variables for partial differential equations. 


11.1 The Occurrence of Two-Point Boundary Value Problems 


In Chapter 10 we described the method of separation of variables as a means of 
solving certain problems involving partial differential equations. The heat conduction 
problem consisting of the partial differential equation 


& Uxx — m : 

0 < x < L, 

t > 0, 

(1) 

subject to the boundary conditions 




u(0,t) = 0, 

u{L,t) — 0, 

t > 0 

(2) 
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and the initial condition 


u(x,0)=f(x), 0 <x<L (3) 


is typical of the problems considered there. A crucial part of the process of solv- 
ing such problems is to find the eigenvalues and eigenfunctions of the differential 
equation 


X" + XX = 0, 

0 < x < L 

( 4 ) 

with the boundary conditions 



X(0) = 0 , 

X (L) = 0 

( 5 ) 

or perhaps 

X'(0) = 0, 

X’(L) = 0. 

(6) 


The sine or cosine functions that result from solving Eq. (4) subject to the boundary 
conditions (5) or (6) are used to expand the initial temperature distribution f(x) in a 
Fourier series. 

In this chapter we extend and generalize the results of Chapter 10. Our main goal 
is to show how the method of separation of variables can be used to solve problems 
somewhat more general than that of Eqs. (1), (2), and (3). We are interested in three 
types of generalizations. 

First, we wish to consider more general partial differential equations — for example, 
the equation 

r(x)u t = [p(x)u x ] x - q(x)u + F(x, t). (7) 

This equation can arise, as indicated in Appendix A of Chapter 10, in the study of heat 
conduction in a bar of variable material properties in the presence of heat sources. If 
p and r are constants, and if the source terms qu and F are zero, then Eq. (7) reduces 
to Eq. (1). The partial differential equation (7) also occurs in the investigation of 
other phenomena of a diffusive character. 

A second generalization is to allow more general boundary conditions. In partic- 
ular, we wish to consider boundary conditions of the form 

Mjf(0, t) — h\u(0, t) — 0, u x {L , t) + h 2 u(L, t) = 0. (8) 

Such conditions occur when the rate of heat flow through an end of the bar is pro- 
portional to the temperature there. Usually, h\ and hi are nonnegative constants, 
but in some cases they may be negative or depend on t. The boundary conditions (2) 
are obtained in the limit as h\ — > oo and hi — > oo. The other important limiting case, 
hi = h 2 = 0, gives the boundary conditions for insulated ends. 

The final generalization that we discuss in this chapter concerns the geometry of 
the region in which the problem is posed. The results of Chapter 10 are adequate 
only for a rather restricted class of problems, chiefly those in which the region of 
interest is rectangular or, in a few cases, circular. Later in this chapter we consider 
certain problems posed in a few other geometrical regions. 

Let us consider the equation 


r(x)u, = [p(x)u x ] x - q(x)u 


( 9 ) 
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obtained by setting the term F(x, t) in Eq. (7) equal to zero. To separate the variables, 
we assume that 

u{x, t) = X(x)T{t), (10) 

and substitute for u in Eq. (9). We obtain 

r(x)XT’ = [p(x)X']'T- q(x)XT (11) 

or, upon dividing by r(x)XT, 

r \p(x)XJ q(x) 

T r(x)X r(x) ' “ 

We have denoted the separation constant by — A in anticipation of the fact that usually 
it will turn out to be real and negative. From Eq. (12) we obtain the following two 
ordinary differential equations for X and T : 

[p{x)X'\ — q{x)X + Xr(x)X = 0, (13) 

T' + XT = 0. (14) 

If we substitute from Eq. (10) for u in Eqs. (8) and assume that h\ and h 2 are constants, 
then we obtain the boundary conditions 

X’({)) - /ijX(O) = 0, X\L) + h 2 X(L) = 0. (15) 

To proceed further, we need to solve Eq. (13) subject to the boundary conditions 
(15). Although this is a more general linear homogeneous two-point boundary value 
problem than the problem consisting of the differential equation (4) and the bound- 
ary conditions (5) or (6), the solutions behave in very much the same way. For every 
value of X, the problem (13), (15) has the trivial solution X{x) = 0. For certain values 
of X, called eigenvalues, there are also other, nontrivial, solutions called eigenfunc- 
tions. These eigenfunctions form the basis for series solutions of a variety of problems 
in partial differential equations, such as the generalized heat conduction equation (9) 
subject to the boundary conditions (8) and the initial condition (3). 

In this chapter we discuss some of the properties of solutions of two-point boundary 
value problems for second order linear equations. Sometimes we consider the general 
linear homogeneous equation 

P(x)y" + Q(x)y' + R{x)y = 0, (16) 

investigated in Chapter 3 . However, for most purposes it is better to discuss equations 
in which the first and second derivative terms are related as in Eq. (13). It is always 
possible to transform the general equation (16) so that the derivative terms appear 
as in Eq. (13) (see Problem 11). 

Boundary value problems with higher order differential equations can also occur; 
in them the number of boundary conditions must equal the order of the differential 
equation. As a rule, the order of the differential equation is even, and half the 
boundary conditions are given at each end of the interval. It is also possible for a 
single boundary condition to involve values of the solution and/or its derivatives at 
both boundary points; for example, 


y(0) — y(L) = 0. 


(17) 
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The following example involves one boundary condition of the form (15) and is 
therefore more complicated than the problems in Section 10.1. 


Find the eigenvalues and the corresponding eigenfunctions of the boundary value problem 

EXAMPLE 

1 y" + Ay = 0, (18) 

y(0) = o, /(i) + y(i) = o. (19) 

One place where this problem occurs is in the heat conduction problem in a bar of unit length. 
The boundary condition at x = 0 corresponds to a zero temperature there. The boundary 
condition at x = 1 corresponds to a rate of heat flow that is proportional to the temperature 
there, and units are chosen so that the constant of proportionality is 1 (see Appendix A of 
Chapter 10). 

The solution of the differential equation may have one of several forms, depending on X, so 
it is necessary to consider several cases. First, if X = 0, the general solution of the differential 
equation is 

y = c\x + C 2 . (20) 

The two boundary conditions require that 

C 2 = 0, 2ci + c 2 = 0, (21) 

respectively. The only solution of Eqs. (21) is C\ = c 2 = 0, so the boundary value problem has 
no nontrivial solution in this case. Flence X = 0 is not an eigenvalue. 

If X > 0, then the general solution of the differential equation (18) is 

y = ci sin y/x x + c 2 cos y/x x, (22) 

where y/x > 0. The boundary condition at x = 0 requires that c 2 = 0; from the boundary 
condition at x = 1 we then obtain the equation 

Cj (sin y/x + y/x cos y/x) = 0. 

For a nontrivial solution y we must have c 2 f 0, and thus X must satisfy 

sin y/x + y/x cos y/x = 0. (23) 

Note that if X is such that cos y/x = 0, then sin y/x / 0, and Eq. (23) is not satisfied. Hence we 
may assume that cos y/x ^ 0; dividing Eq. (23) by cos yfx. we obtain 

y/x = — tanVX. (24) 

The solutions of Eq. (24) can be determined numerically. They can also be found approximately 
by sketching the graphs of f(y/X) = y/x and g(y/X) = — tan y/x for y/x > 0 on the same set of 
axes, and identifying the points of intersection of the two curves (see Figure 11.1.1). The point 
yfx = 0 is specifically excluded from this argument because the solution (22) is valid only for 
yjx 0. Despite the fact that the curves intersect there, X = 0 is not an eigenvalue, as we have 
already shown. The first three positive solutions of Eq. (24) are y[X[ = 2.029, yfx^ = 4.913, 
and y/X ^ = 7.979. As can be seen from Figure 11.1.1, the other roots are given with reasonable 

accuracy by y[X n = (2 n — l)n/2 for n = 4, 5 the precision of this estimate improving as n 

increases. Hence the eigenvalues are 


Aj = 4.116, X 2 = 24.14, 

X„ = (2 n — l) 2 7r 2 /4 for n = 4, 5, 


A3 = 63.66, 


(25) 
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Finally, since C 2 = 0, the eigenfunction corresponding to the eigenvalue A„ is 

<p n (x.X„) = k„ sin-/^*; n = 1,2,..., (26) 

where the constant k n remains arbitrary. 



FIGURE 11.1.1 Graphical solution of = — tan 


Next consider A < 0. In this case it is convenient to let A = — n so that /x > 0. Then Eq. (14) 
becomes 


y" -ny = o, (27) 

and its general solution is 

y = cisinh v //xx + C 2 Coshy(iIx, (28) 

where ^JJi > 0. Proceeding as in the previous case, we find that /x must satisfy the equation 

= — tanh V /x . (29) 

From Figure 11.1.2 it is clear that the graphs oif{^fix) = V //Zandg(, v //I) = — tanh intersect 
only at the origin. Hence there are no positive values of JJi that satisfy Eq. (29), and hence 
the boundary value problem (18), (19) has no negative eigenvalues. 

Finally, it is necessary to consider the possibility that A may be complex. It is possible to 
show by direct calculation that the problem (18), (19) has no complex eigenvalues. However, 
in Section 11.2 we consider in more detail a large class of problems that includes this example. 
One of the things we show there is that every problem in this class has only real eigenvalues. 
Therefore we omit the discussion of the nonexistence of complex eigenvalues here. Thus we 
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PROBLEMS 


u 

1 


-1 


U = fi'fc) = v'u 


1 2 
u =g{'JjI) = -tanhA^u 




FIGURE 11.1.2 Graphical solution of JJi — — tanh Jji. 


conclude that all the eigenvalues and eigenfunctions of the problem (18), (19) are given by 
Eqs. (25) and (26). 


In each of Problems 1 through 6 state whether the given boundary value problem is homoge- 
neous or nonhomogeneous. 

1. y" + 4 y = 0, y(— 1) = 0, y( 1) = 0 

2. [(1 + x 2 )y'Y + 4y = 0, y(0) = 0, y(l) = 1 

3. y" + 4 y = sinx, y(0) = 0, y( 1) = 0 

4. + x 2 y = Xy, y'(0) - y( 0) = 0, y'( 1) + y( 1) = 0 

5. -[(1 + x 2 )y'V =Xy + l, y(- 1) = 0, y(l) = 0 

6. -y" = Ml + x 2 )y, y(0) = 0, y'(l) + 3y(l) = 0 


In each of Problems 7 through 10 assume that all eigenvalues are real. 

(a) Determine the form of the eigenfunctions and the determinantal equation satisfied by the 
nonzero eigenvalues. 

(b) Determine whether X = 0 is an eigenvalue. 

( c) Find approximate values for M and X 2 , the nonzero eigenvalues of smallest absolute value. 

(d) Estimate X„ for large values of n. 


7. 

8 . 

9. 

10 . 

11 . 


y" + M = 0, y(0) = 0, y(rc) + y'(jr) = 0 

y" + Xy = 0, y'(0) = 0, y(l) + y'(l) = 0 

y" + Xy = 0, y(0) - y'(0) = 0, y(l) + y'(l) = 0 

y" - Xy = 0, y(0) + y'(0) = 0, y(l) = 0 

Consider the general linear homogeneous second order equation 


P(x)y" + Q(x)y' + R(x)y = 0. 


(i) 


We seek an integrating factor fi(x) such that, upon multiplying Eq. (i) by /x(x), we can 
write the resulting equation in the form 


(ii) 


[/x(x)P(x)y']' + n(x)R(x)y = 0. 
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(a) By equating coefficients of y' in Eqs. (i) and (ii), show that p. must be a solution of 

/V = (G - (in) 

(b) Solve Eq. (iii) and thereby show that 

, , i r q<?) . r . 

ll(x) = W) cxp lw> ds - (1V) 

Compare this result with that of Problem 41 in Section 3.2. 

In each of Problems 12 through 15 use the method of Problem 11 to transform the given 
equation into the form [p(x)y']' + q(x)y = 0. 

12. y" — 2 xy' + Xy = 0, Hermite equation 

13. x 2 y" + xy' + (x 2 — v 2 )y = 0, Bessel equation 

14. xy" + (1 — x)y' + Xy = 0, Laguerre equation 

15. (1 — x 2 )y" — xy' + a 2 y = 0, Chebyshev equation 

16. The equation 

u„ + cu t + ku = a 2 u xx + F{x , f), (i) 

where a 2 > 0, c > 0, and k > 0 are constants, is known as the telegraph equation. It arises 
in the study of an elastic string under tension (see Appendix B of Chapter 10). Equation 
(i) also occurs in other applications. Assuming that F(x,t) = 0, let u(x,t) = X(x)T(t), 
separate the variables in Eq. (i), and derive ordinary differential equations for X and T. 

17. Consider the boundary value problem 

y" - 2 y' + (1 + X)y = 0, y(0) = 0, y(l) = 0. 

(a) Introduce a new dependent variable ii by the relation y = s(x)u. Determine j(x) so 
that the differential equation for u has no u' term. 

(b) Solve the boundary value problem for u and thereby determine the eigenvalues and 
eigenfunctions of the original problem. Assume that all eigenvalues are real. 

(c) Also solve the given problem directly (without introducing u). 

18. Consider the boundary value problem 

y " + 4 y' + (4 + 9 X)y = 0, y(0) = 0, y'(L) = 0. 

(a) Determine, at least approximately, the real eigenvalues and the corresponding eigen- 
functions by proceeding as in Problem 17(a, b). 

(b) Also solve the given problem directly (without introducing a new variable). 

Hint: In part (a) be sure to pay attention to the boundary conditions as well as the differ- 
ential equation. 

The differential equations in Problems 19 and 20 differ from those in previous problems in 
that the parameter X multiplies the / term as well as the y term. In each of these problems 
determine the real eigenvalues and the corresponding eigenfunctions. 

19. /' + / + X(y' + y) = 0, / (0) = 0, y( 1) = 0 

20. x 2 y" - X(xy' - y) = 0, y( 1) = 0, y(2) - y'( 2) = 0 

21. Consider the problem 


y" + Xy = 0, 


2y(0)+/(0) = 0, y(l) = 0. 
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(a) Find the determinantal equation satisfied by the positive eigenvalues. 

(b) Show that there is an infinite sequence of such eigenvalues. 

(c) Find A.i and X 2 . Then show that X n = [(2 n + l)jr/2] 2 for large n. 

(d) Find the determinantal equation satisfied by the negative eigenvalues. 

(e) Show that there is exactly one negative eigenvalue and find its value. 

22. Consider the problem 

y" + Xy = 0, ay(0) + y'(0) = 0, y{l) = 0, 


where o' is a given constant. 

(a) Show that for all values of a there is an infinite sequence of positive eigenvalues. 

(b) If a < 1, show that all (real) eigenvalues are positive. Show that the smallest eigen- 
value approaches zero as a approaches 1 from below. 

(c) Show that X = 0 is an eigenvalue only if a = 1. 

(d) If at > 1 , show that there is exactly one negative eigenvalue and that this eigenvalue 
decreases as a increases. 


23. Consider the problem 

y" + Ay = 0, y(0) = 0, y'(L) = 0. 

Show that if tj> m and cf>„ are eigenfunctions corresponding to the eigenvalues X m and X„, 
respectively, with X m ^ X n , then 



<f> m (x)(j> n (x) dx = 0. 


Hint: Note that 

= 0, (j) n + X n (j) n = 0. 


Multiply the first of these equations by 4 > n , the second by <f > m , and integrate from 0 to L, 
using integration by parts. Finally, subtract one equation from the other. 

24. In this problem we consider a higher order eigenvalue problem. The analysis of transverse 
vibrations of a uniform elastic bar is based on the differential equation 

y < 4 > -xy = 0, 

where y is the transverse displacement and X = mw 2 /El; m is the mass per unit length of 
the rod, E is Young’s modulus, I is the moment of inertia of the cross section about an 
axis through the centroid perpendicular to the plane of vibration, and w is the frequency 
of vibration. Thus for a bar whose material and geometrical properties are given, the 
eigenvalues determine the natural frequencies of vibration. Boundary conditions at each 
end are usually one of the following types: 


y = y' = 0, clamped end, 
y = y" = 0, simply supported or hinged end, 
y" = y'" = 0, free end. 

For each of the following three cases find the form of the eigenfunctions and the equation 
satisfied by the eigenvalues of this fourth order boundary value problem. Determine X\ 
and X 2 , the two eigenvalues of smallest magnitude. Assume that the eigenvalues are real 
and positive. 
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(a) >-(0) = /'(()) = 0, y{L) = y"{L) = 0 

(b) >-(0) = y"(0) = 0, y(L) = y'(L) = 0 

(c) y(0) = y(0) = 0, y" (L) = y'" (L) = 0 (cantilevered bar) 

25. This problem illustrates that the eigenvalue parameter sometimes appears in the boundary 
conditions as well as in the differential equation. Consider the longitudinal vibrations of 
a uniform straight elastic bar of length L. ft can be shown that the axial displacement 
u(x,t) satisfies the partial differential equation 

( E/p)u xx = lit,', 0 < x < L, t > 0, (i) 

where E is Young’s modulus and p is the mass per unit volume. If the end x = 0 is fixed, 
then the boundary condition there is 


M(0,r)=0, t > 0. (ii) 

Suppose that the end x = L is rigidly attached to a mass m but is otherwise unrestrained. 
We can obtain the boundary condition here by writing Newton’s law for the mass. From 
the theory of elasticity it can be shown that the force exerted by the bar on the mass is 
given by —EAu x (L,t). Hence the boundary condition is 


EAu x (L,t) + mu„(L,t ) = 0, t > 0. (iii) 

(a) Assume that u(x,t) = X{x)T{t), and show that X(x) and T(t) satisfy the differential 
equations 


X" + XX = 0, (iv) 

T" + k(E/p)T = 0. (v) 

(b) Show that the boundary conditions are 

Y(0) = 0, X'(L) - yXLX(L) = 0, (vi) 

where y = m/ pAL is a dimensionless parameter that gives the ratio of the end mass to 
the mass of the bar. 

Hint: Use the differential equation for T (?) in simplifying the boundary condition at x = L. 

(c) Determine the form of the eigenfunctions and the equation satisfied by the real eigen- 
values of Eqs. (iv) and (vi). 

(d) Find the first two eigenvalues X\ and X 2 if y = 0.5. 


11.2 Sturm-Liouville Boundary Value Problems 


We now consider two-point boundary value problems of the type obtained in Section 
11.1 by separating the variables in a heat conduction problem for a bar of variable 
material properties and with a source term proportional to the temperature. This 
kind of problem also occurs in many other applications. 
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These boundary value problems are commonly associated with the names of Sturm 
and Liouville. 1 They consist of a differential equation of the form 

IX*)/]' - q(x)y + kr(x)y = 0 (1) 

on the interval 0 < x < 1, together with the boundary conditions 

«ry(0) + a 2 /(0) = 0, /Siy(l) + X/(l) = 0 (2) 

at the endpoints. It is often convenient to introduce the linear homogeneous differ- 
ential operator L defined by 


L[y] = ~[p(x)y']' + q{x)y. 
Then the differential equation (1) can be written as 

L[y] = Xr(x)y. 


( 3 ) 

( 4 ) 


We assume that the functions p.p',q , and r are continuous on the interval 0 < x < 1 
and, further, that p(x) > 0 and r{x ) > 0 at all points in 0 < x < 1. These assumptions 
are necessary to render the theory as simple as possible while retaining considerable 
generality. It turns out that these conditions are satisfied in many significant prob- 
lems in mathematical physics. For example, the equation y" + Xy = 0, which arose 
repeatedly in the preceding chapter, is of the form (1) with p{x) = 1 , q(x) = 0, and 
r{x) — 1. The boundary conditions (2) are said to be separated; that is, each involves 
only one of the boundary points. These are the most general separated boundary 
conditions that are possible for a second order differential equation. 

Before proceeding to establish some of the properties of the Sturm-Liouville prob- 
lem (1), (2), it is necessary to derive an identity, known as Lagrange’s identity, which 
is basic to the study of linear boundary value problems. Let u and v be functions 
having continuous second derivatives on the interval 0 < x < 1. Then 2 

/ L[u]v dx = / [— (pu')'v + quv] dx. 

Jo Jo 

Integrating the first term on the right side twice by parts, we obtain 


f 


L[u]v dx = —p(x)u\x)v(x) + p(x)u(x)v r (x) 


LA- 

0 Jo 


u(pv'Y + uqv \ dx 


= — p(x)[u'(x)v(x) — ll(x)v\x)] 


+ / uL[v] dx. 
0 Jo 


1 Charles-Frangois Sturm (1803-1855) and Joseph Liouville (1809-1882), in a series of papers in 1836 
and 1837, set forth many properties of the class of boundary value problems associated with their names, 
including the results stated in Theorems 11.2.1 to 11.2.4. Sturm is also famous for a theorem on the number 
of real zeros of a polynomial and, in addition, did extensive work in physics and mechanics. Besides his 
own research in analysis, algebra, and number theory, Liouville was the founder, and for 39 years the 
editor, of the influential Journal de mathematiques pares et appliquees. One of his most important results 
was the proof (in 1844) of the existence of transcendental numbers. 

2 For brevity we sometimes use the notation fg f dx rather than fg f(x) dx in this chapter. 
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Hence, upon transposing the integral on the right side, we have 


f {L[u\ 
Jo 


v — uL[ i>]} dx = — p(x) [u'(x)v(x) — u(x) i/(x)] 


(5) 


which is Lagrange’s identity. 

Now let us suppose that the functions u and v in Eq. (5) also satisfy the boundary 
conditions (2). Then, if we assume that <*2 0 and fo / 0, the right side of Eq. (5) 

becomes 


-p(x) \u' (x)v{x) — u(x)v\x)] 

= —p{ 1) \u'{ l)a(l) - n(l)i/(l)] + p( 0) [i/(0)u(0) - h(0)i/(0)] 


= -pa) 
= 0. 


. P2 P2 


+ P(0) 


— — z/(0) v(0) H -zr(0)z2(0) 

a 2 «2 


The same result holds if either 012 or ^2 is zero; the proof in this case is even simpler 
and is left for you. Thus, if the differential operator L is defined by Eq. (3), and if 
the functions u and v satisfy the boundary conditions (2), then Lagrange’s identity 
reduces to 



uL[v]\ dx = 0. 


( 6 ) 


Let us now write Eq. (6) in a slightly different way. In Eq. (4) of Section 10.2 we 
introduced the inner product (u, v ) of two real-valued functions u and v on a given 
interval; using the interval 0 < x < 1, we have 


(u, v ) 



u{x)v(x) dx. 


In this notation Eq. (6) becomes 


(L[u],v) - (u,L[ v]) = 0. 


(7) 

( 8 ) 


In proving Theorem 11.2.1 below, it is necessary to deal with complex-valued func- 
tions. By analogy with the definition in Section 7.2 for vectors, we define the inner 
product of two complex-valued functions on 0 < x < 1 as 


(u, v ) 



u(x) v(x) dx , 


(9) 


where v is the complex conjugate of v. Clearly, Eq. (9) coincides with Eq. (7) if u(x) 
and v(x) are real. It is important to know that Eq. (8) remains valid under the stated 
conditions if u and v are complex-valued functions and if the inner product (9) is 


used. To see this, one can start with the quantity / L[u]v dx and retrace the steps 

Jo 

leading to Eq. (6) , making use of the fact that p(x), q(x),a \ , «2, fi \ , and ^2 are all real 
quantities (see Problem 22). 
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Theorem 11.2.1 


We now consider some of the implications of Eq. (8) for the Sturm-Liouville 
boundary value problem (1), (2). We assume without proof 3 that this problem ac- 
tually has eigenvalues and eigenfunctions. In Theorems 11.2.1 to 11.2.4 below, we 
state several of their important, but relatively elementary, properties. Each of these 
properties is illustrated by the basic Sturm-Liouville problem 

y" + Ly = 0, y(0) = 0, y(l) = 0, (10) 

whose eigenvalues are X n — n 2 jt 2 , with the corresponding eigenfunctions 
<p n (x) = sin njrx. 


All the eigenvalues of the Sturm-Liouville problem (1), (2) are real. 


To prove this theorem, let us suppose that X is a (possibly complex) eigenvalue 
of the problem (1), (2) and that 0 is a corresponding eigenfunction, also possibly 
complex-valued. Let us write X = /i + iv and (p(x) = U(x) + iV{x), where /x, v, U(x), 
and V (x) are real. Then, if we let u = (p and also v — cp in Eq. (8), we have 

(L[0],0) = (0,L[</>]). (11) 


However, we know that L[<p ] = Xrcp, so Eq. (11) becomes 

(kr<t>,<j>) = {(p.Xrcp). (12) 

Writing out Eq. (12) in full, using the definition (9) of the inner product, we obtain 

[ Xr(x)(p(x)(p(x) dx = f (p(x)Xr(x)(/)(x) dx. (13) 

Jo Jo 

Since r(x) is real, Eq. (13) reduces to 

(X — X) ( r(x)<p{x)<p(x) dx = 0, 

Jo 

or 

(k — X) f r(x)[U 2 (x) + V 2 (x)] dx = 0. (14) 

Jo 

The integrand in Eq. (14) is nonnegative and not identically zero. Since the inte- 
grand is also continuous, it follows that the integral is positive. Therefore the factor 
X — X — 2 iv must be zero. Hence v = 0 and X is real, so the theorem is proved. 

An important consequence of Theorem 11.2.1 is that in finding eigenvalues and 
eigenfunctions of a Sturm-Liouville boundary value problem, we need look only for 
real eigenvalues. Recall that this is what we did in Chapter 10. It is also possible to 
show that the eigenfunctions of the boundary value problem (1), (2) are real. A proof 
is sketched in Problem 23. 


3 The proof may be found, for example, in the references by Sagan (Chapter 5) or Birkhoff and Rota 
(Chapter 10). 
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Theorem 11.2.2 


Theorem 11.2.3 


If <p\ and 02 are two eigenfunctions of the Sturm-Liouville problem (1), (2) corre- 
sponding to eigenvalues Ai and A 2 , respectively, and if Ai ^ A 2 , then 


L 


1 


r(x)0i(x)02(x) dx = 0. 


(15) 


This theorem expresses the property of orthogonality of the eigenfunctions with 
respect to the weight function r. To prove the theorem, we note that <p\ and 02 satisfy 
the differential equations 

L[0l] = A. 17-01 (16) 


and 

L[0 2 ] = A 2 r0 2 , (17) 

respectively. If we let u = 0 1; v = 0 2 , and substitute for L[u] and L[v] in Eq. (8), we 
obtain 

(A. 1 7-0 1 ,0 2 ) - (01, A 2 r02) = 0, 


or, using Eq. (9), 


A,i / t-(x)0i(x)0 2 (x) dx — A 2 / 0i(x)t-(x)0 2 (x) dx — 0. 
Jo " Jo 

Because /, 2 , r(x), and 0 2 (x) are real, this equation becomes 
(A.! - A 2 ) / 7-(x)0i(x)02(.r) dx = 0. 




(18) 


Since by hypothesis An ^ A 2 , it follows that 0 1 and 0 2 must satisfy Eq. (15), and the 
theorem is proved. 


The eigenvalues of the Sturm-Liouville problem (1), (2) are all simple; that is, to 
each eigenvalue there corresponds only one linearly independent eigenfunction. 
Further, the eigenvalues form an infinite sequence and can be ordered according 
to increasing magnitude so that 

Ai < A 2 < A3 < • • • < A.„ < • • • . 

Moreover, A„ — > 00 as n — 00. 


The proof of this theorem is somewhat more advanced than those of the two 
previous theorems and will be omitted. However, a proof that the eigenvalues are 
simple is outlined in Problem 20. 

Again we note that all the properties stated in Theorems 11.2.1 to 11.2.3 are ex- 
emplified by the eigenvalues A„ = irn 2 and eigenfunctions 0„(x) = sin nnx of the 
example problem (10). Clearly, the eigenvalues are real. The eigenfunctions satisfy 
the orthogonality relation 

/ 0 m (x)0„ (x) dx = I sin mrcx sin hjtx dx = 0, 

Jo Jo 


m ^ n, 


(19) 
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which was established in Section 10.2 by direct integration. Further, the eigenval- 
ues can be ordered so that A; < X 2 < • • • , and k„ — > oo as n —>■ oo. Finally, to each 
eigenvalue there corresponds a single linearly independent eigenfunction. 

We will now assume that the eigenvalues of the Sturm-Liouville problem (1), (2) 
are ordered as indicated in Theorem 11.2.3. Associated with the eigenvalue X n is 
a corresponding eigenfunction </>„, determined up to a multiplicative constant. It is 
often convenient to choose the arbitrary constant multiplying each eigenfunction so 
as to satisfy the condition 

l 

r(x)(tr n (x) dx — 1, n = 1,2, (20) 

Equation (20) is called a normalization condition, and eigenfunctions satisfying this 
condition are said to be normalized. Indeed, in this case, the eigenfunctions are said 
to form an orthonormal set (with respect to the weight function r) since they already 
satisfy the orthogonality relation (15). It is sometimes useful to combine Eqs. (15) 
and (20) into a single equation. To this end we introduce the symbol S mn , known as 
the Kronecker 4 delta and defined by 


S 


mn 


0, if m ^ n, 

1, if m = n. 


( 21 ) 


Making use of the Kronecker delta, we can write Eqs. (15) and (20) as 


f 


r(x)<p m (x)<p n (x) dx = S mn . 


( 22 ) 


EXAMPLE 

1 


Determine the normalized eigenfunctions of the problem (10): 

y" + Xy = 0, y(0) = 0, y( 1) = 0. 

The eigenvalues of this problem are Ai = n 2 , X 2 = 4jr 2 , . . . ,X„ = n 2 n 2 , . . . , and the corre- 
sponding eigenfunctions are k\ sin jrx, k 2 sin 2nx, . . . , k n sin nnx, . . . , respectively. In this case 
the weight function is r(x ) = 1. To satisfy Eq. (20), we must choose k n so that 

/ (k n sinrarx) 2 dx = 1 (23) 

Jo 

for each value of n. Since 

k 2 I sin 2 nnx dx = k 2 I () — 4 cos2n7rx) dx = \k 2 n , 

Jo Jo 

Eq. (23) is satisfied if k n is chosen to be ~J2 for each value of n. Hence the normalized 
eigenfunctions of the given boundary value problem are 

<p n (x) = V2smnnx, n = 1,2,3, (24) 


4 Leopold Kronecker (1823-1891), a student of Dirichlet, was associated with the University of Berlin for 
most of his life, although (since he was independently wealthy) he held a faculty position only from 1883 
onward. He worked in number theory, elliptic functions, algebra, and their interconnections. 
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EXAMPLE 

2 


Determine the normalized eigenfunctions of the problem 


y" + Xy = 0, y(0) = 0, / (1) + y( 1) = 0. 

In Example 1 of Section 11.1 we found that the eigenvalues X„ satisfy the equation 

sin yf\n + sf\n COS = 0 
and that the corresponding eigenfunctions are 

= k n sin-/X^x, 

where k„ is arbitrary. We can determine k n from the normalization condition (20). 
r(x) = 1 in this problem, we have 



where in the last step we have used Eq. (26). Hence, to normalize the eigenfunctions <j> n , we 
must choose 


(25) 

(26) 

(27) 

Since 


k n — 


1/2 


, 1 + COS 2 ■sfx'n 

The normalized eigenfunctions of the given problem are 

V2 sin s/Ki x 


<p„(x) = 


(1 + cos 2 VK ) 1 ' 2 ' 


n = 1,2,.... 


(28) 


(29) 


We now turn to the question of expressing a given function / as a series of eigen- 
functions of the Sturm-Liouville problem (1), (2). We have already seen examples 
of such expansions in Sections 10.2 to 10.4. For example, it was shown there that if 
/ is continuous and has a piecewise continuous derivative on 0 < x < 1, and satisfies 
the boundary conditions /(0) =/( 1) = 0, then / can be expanded in a Fourier sine 
series of the form 


OO 

/(*) = E b„smnjtx. (30) 

«=i 

The functions smtnrx,?! — 1,2, ... , are precisely the eigenfunctions of the boundary 
value problem (10). The coefficients b n are given by 

b n = 2 1 f{x) sinmrxr/x 

Jo 


( 31 ) 
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Theorem 11.2.4 


and the series (30) converges for each x in 0 < x < 1. In a similar way / can be 
expanded in a Fourier cosine series using the eigenfunctions cos mix, n = 0, 1, 2, , 
of the boundary value problem y" + Xy = 0, y'(0) = 0, y'( 1) = 0. 

Now suppose that a given function /, satisfying suitable conditions, can be ex- 
panded in an infinite series of eigenfunctions of the more general Sturm-Liouville 
problem (1), (2). If this can be done, then we have 

OO 

m = ^n(x), (32) 

n = 1 

where the functions 0„(x) satisfy Eqs. (1), (2) and also the orthogonality condition 
(22). To compute the coefficients in the series (32), we multiply Eq. (32) by r(x)cp m (x), 
where m is a fixed positive integer, and integrate from x = 0 to x = 1. Assuming that 
the series can be integrated term by term, we obtain 

1 OO 1 OO 

/ r(x)f(x)<p m (x)dx = y2c n r(x)(j) m (x)<l> n (x)dx = i y'c n 8 mn . (33) 

Jo „= i Jo „= i 

Hence, using the definition of 8 mn , we have 

— I r(x)f(x)4) m (x) dx = (f , rcp m ), m = 1,2,.... (34) 

Jo 

The coefficients in the series (32) have thus been formally determined. Equation (34) 
has the same structure as the Euler-Fourier formulas for the coefficients in a Fourier 
series, and the eigenfunction series (32) also has convergence properties similar to 
those of Fourier series. The following theorem is analogous to Theorem 10.3.1. 


Let 0i , be the normalized eigenfunctions of the Sturm-Liouville prob- 

lem (1), (2): 


[p(x)y']' - q(x)y + Xr(x)y = 0, 

aiy(0) + a 2 y (0) = 0, fryfl) + /? 2 / (1) = 0. 

Let / and f be piecewise continuous on 0 < x < 1. Then the series (32) whose 
coefficients c m are given by Eq. (34) converges to [f(x+) +f(x — )]/2 at each point 
in the open interval 0 < x < 1. 


If / satisfies further conditions, then a stronger conclusion can be established. 
Suppose that, in addition to the hypotheses of Theorem 11.2.4, the function / is 
continuous on 0 < x < 1. If a 2 = 0 in the first of Eqs. (2) [so that 0„(O) = 0], then 
assume that /( 0) = 0. Similarly, if p 2 = 0 in the second of Eqs. (2), assume that 
/( 1) = 0. Otherwise no boundary conditions need be prescribed for /. Then the 
series (32) converges to/(x) at each point in the closed interval 0 < x < 1. 
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EXAMPLE 

3 


Expand the function 

fix) = x, 0 < x < 1 (35) 

in terms of the normalized eigenfunctions 4>„{x) of the problem (25). 

In Example 2 we found the normalized eigenfunctions to be 

<p„(x) = k n sin^/^n x, (36) 

where k n is given by Eq. (28) and X„ satisfies Eq. (26). To find the expansion for / in terms of 
the eigenfunctions </>„, we write 


/(*) = Y.CnMx), (37) 

n = 1 


where the coefficients are given by Eq. (34). Thus 

c n = / f(x)<j) n (x) dx = k n / xsin ypk n xdx. 
Jo Jo 


sin^A^ cos 2sinv^« 

— k- n 


Integrating by parts, we obtain 

Cn = k ’" X n VK ) 

where we have used Eq. (26) in the last step. Upon substituting for k„ from Eq. (28), we obtain 

2-J2 sin VKi 


c n — 


X„(l + cos 2 VKi) 1 ' 1 ' 


(38) 


Thus 


n = 1 


sin VKi sin x 
A„(l + cos 2 VK) 


(39) 


Observe that although the right side of Eq. (39) is a series of sines, it is not included in the 
discussion of Fourier sine series in Section 10.4. 


Self-Adjoint Problems. Sturm-Liouville boundary value problems are of great impor- 
tance in their own right, but they can also be viewed as belonging to a much more 
extensive class of problems that have many of the same properties. For example, 
there are many similarities between Sturm-Liouville problems and the algebraic 
system 

Ax = Xx, (40) 

where the n x n matrix A is real symmetric or Hermitian. Comparing the results 
mentioned in Section 7.3 with those of this section, we note that in both cases the 
eigenvalues are real and the eigenfunctions or eigenvectors form an orthogonal set. 
Further, the eigenfunctions or eigenvectors can be used as the basis for expressing an 
essentially arbitrary function or vector, respectively, as a sum. The most important 
difference is that a matrix has only a finite number of eigenvalues and eigenvec- 
tors, while a Sturm-Liouville problem has infinitely many. It is interesting and of 
fundamental importance in mathematics that these seemingly different problems — 
the matrix problem (40) and the Sturm-Liouville problem (1), (2) — which arise in dif- 
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ferent ways, are actually parts of a single underlying theory. This theory is usually 
referred to as linear operator theory and is part of the subject of functional analysis. 

We now point out some ways in which Sturm-Liouville problems can be gen- 
eralized, while still preserving the main results of Theorems 11.2.1 to 11.2.4 — the 
existence of a sequence of real eigenvalues tending to infinity, the orthogonality of 
the eigenfunctions, and the possibility of expressing an arbitrary function as a series 
of eigenfunctions. These generalizations depend on the continued validity of the 
crucial relation (8). 

Let us consider the boundary value problem consisting of the differential equation 
L[y] = kr(x)y, 0 < x < 1, (41) 


where 


L[y] = P„(x)^ H f P\ (x) + P 0 (x)y, (42) 

and n linear homogeneous boundary conditions at the endpoints. If Eq. ( 8) is valid for 
every pair of sufficiently differentiable functions that satisfy the boundary conditions, 
then the given problem is said to be self-adjoint. It is important to observe that Eq. (8) 
involves restrictions on both the differential equation and the boundary conditions. 
The differential operator L must be such that the same operator appears in both terms 
of Eq (8). This requires L to be of even order. Further, a second order operator must 
have the form (3), a fourth order operator must have the form 

L[y] = [p(x)y"]" - [q(x)y'Y + s(x)y, (43) 

and higher order operators must have an analogous structure. In addition, the bound- 
ary conditions must be such as to eliminate the boundary terms that arise during the 
integration by parts used in deriving Eq. (8). For example, in a second order problem 
this is true for the separated boundary conditions (2) and also in certain other cases, 
one of which is given in Example 4 below. 

Let us suppose that we have a self-adjoint boundary value problem for Eq. (41), 
where L[y ] is given now by Eq. (43). We assume that p, q, r, and s are continuous on 
0 < x < 1 and that the derivatives of p and q indicated in Eq. (43) are also continuous. 
If in addition p(x) > 0 and r(x) > 0 for 0 < x < 1, then there is an infinite sequence 
of real eigenvalues tending to +oo, the eigenfunctions are orthogonal with respect 
to the weight function r , and an arbitrary function can be expressed as a series of 
eigenfunctions. However, the eigenvalues may not be simple in these more general 
problems. 

We turn now to the relation between Sturm-Liouville problems and Fourier series. 
We have noted previously that Fourier sine and cosine series can be obtained by using 
the eigenfunctions of certain Sturm-Liouville problems involving the differential 
equation y" + Xy = 0. This raises the question of whether we can obtain a full Fourier 
series, including both sine and cosine terms, by choosing a suitable set of boundary 
conditions. The answer is provided by the following example, which also serves to 
illustrate the occurrence of nonseparated boundary conditions. 
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EXAMPLE 

4 


PROBLEMS 


Find the eigenvalues and eigenfunctions of the boundary value problem 


y" + Xy = 0, (44) 

y(-L) - y(L) = 0, y'i-L) - y'(L) = 0. (45) 

This is not a Sturm-Liouville problem because the boundary conditions are not separated. 
The boundary conditions (45) are called periodic boundary conditions since they require that 
y and y' assume the same values at x = L as at x = — L. Nevertheless, it is straightforward to 
show that the problem (44), (45) is self-adjoint. A simple calculation establishes that Xo = 0 is 
an eigenvalue and that the corresponding eigenfunction is 4> 0 (x) = 1. Further, there are addi- 
tional eigenvalues ki = (n/L) 2 ,\ 2 = (2jr/L) 2 , . . . ,X n = ( nn/L ) 2 , . . . . To each of these nonzero 
eigenvalues there correspond two linearly independent eigenfunctions; for example, corre- 
sponding to X n are the two eigenfunctions <p n (x) = cos(mtx/L) and ij/ n (x) = sm(nnx/L). This 
illustrates that the eigenvalues may not be simple when the boundary conditions are not sepa- 
rated. Further, if we seek to expand a given function/ of period 2 L in a series of eigenfunctions 
of the problem (44), (45), we obtain the series 


fix) = 


«o 

~2 


+ 


OO 

cos 

n = 1 


nnx 


+ b n sin 


nnx\ 

~L)' 


which is just the Fourier series for /. 


We will not give further consideration to problems that have nonseparated bound- 
ary conditions, nor will we deal with problems of higher than second order, except 
in a few problems. There is, however, one other kind of generalization that we do 
wish to discuss. That is the case in which the coefficients p, q, and r in Eq. (1) do 
not quite satisfy the rather strict continuity and positivity requirements laid down 
at the beginning of this section. Such problems are called singular Sturm-Liouville 
problems and are the subject of Section 11.4. 


In each of Problems 1 through 5 determine the normalized eigenfunctions of the given problem. 

1 . y" + Xy = 0, y(0) = 0, /(!) = 0 


2. y" + Xy = 0, y'(0) = 0, 

3. y" + Xy = 0, y'(0) = 0, 

4. y" + Xy = 0, y'(0) = 0, 

5. y» - 2/ + (1 + X)y = 0, 


y( 1) = o 
yd) = o 

/( 1) + y( 1) = 0; see Section 11.1, Problem 8. 
y(0) = 0, y(l) = 0; see Section 11.1, Problem 17. 


In each of Problems 6 through 9 find the coefficients in the eigenfunction expansion 

CO 

y a n <p n (x) of the given function, using the normalized eigenfunctions of Problem 1. 

«=i 


6. fix) = 1, 


8. fix) = 


0 <x <1 
0<x< \ 
\ <x <1 


7. f{x) = x, 0 < x < 1 


9- fix) = 



0<x< 1 
\<x<l 


684 


Chapter 11. Boundary Value Problems 


In each of Problems 10 through 13 find the coefficients in the eigenfunction expansion 

CO 

Y a n (p n (x) of the given function, using the normalized eigenfunctions of Problem 4. 

n = 1 

10. fix) = 1, 0 < x < 1 11. fix) = x, 0 < x < 1 

f 1, 0 < x < \ 

12. f{x) = 1 — x, 0 < x < 1 13. fix) = \ 

[0, ±<x<l 

In each of Problems 14 through 18 determine whether the given boundary value problem is 
self-adjoint. 

14. y" + / + 2y = 0, y(0) = 0, yi 1) = 0 

15. (1 + x 2 )y" + 2xy' + y = 0, y'(0) = 0, y(l) + 2/(1) = 0 

16. y" + y = Xy , y(0) - y'( 1) = 0, y'(0) - y(l) = 0 

17. (1 + x 2 )y" + 2xy' + y = Xil + x 2 )y, y(0) - y'(l) = 0, y'(0) + 2y(l) = 0 

18. y" + Xy = 0, y(0) = 0, yin) + yfn) — 0 

19. Show that if the functions u and v satisfy Eqs. (2), and either a 2 = 0 or f} 2 = 0, or both, 
then 

,i 

pix) I u'ix)vix) — uix)v'ix )1 = 0. 

L J lo 

20. In this problem we outline a proof of the first part of Theorem 11.2.3: that the eigenvalues 
of the Sturm-Liouville problem (1), (2) are simple. The proof is by contradiction. 

(a) Suppose that a given eigenvalue X is not simple. Then there exist two corresponding 
eigenfunctions <f > i and c/> 2 that are linearly independent, that is, not multiples of each other. 

(b) Compute the Wronskian Wiij>i,<f> 2 )ix) and use the boundary conditions (2) to show 
that W( 0 i, 02 )(O) = 0. 

(c) Use Theorem 3.2.6 to reach a contradiction, which establishes that the eigenvalues 
must be simple, as asserted in Theorem 11.2.3. 

21. Consider the Sturm-Liouville problem 

-\p(x)y'f + qix)y = Xrix)y, 
aiy(0) + a 2 y\0) = 0, fiyil) + fry' (1) = 0, 

where p, q , and r satisfy the conditions stated in the text. 

(a) Show that if X is an eigenvalue and <j> a corresponding eigenfunction, then 

xf rf 2 dx = f ip<f>' 2 + qcjr) dx + ^-p(l)0 2 (l) - — p(O)0 2 (O), 

Jo Jo P2 <* 2 

provided that a 2 y 0 and f 2 y 0. How must this result be modified if or 2 = 0 or f 2 = 0? 

(b) Show that if qix) > 0 and if fi/ f 2 and —a\/a 2 are nonnegative, then the eigenvalue X 
is nonnegative. 

(c) Under the conditions of part (b) show that the eigenvalue X is strictly positive unless 
a\ = Pi = 0 and qix) = 0 for each x in 0 < x < 1. 

22. Derive Eq. (8) using the inner product (9) and assuming that u and v are complex-valued 
functions. 

Hint: Consider the quantity / L\u\v dx, split u and v into real and imaginary parts, and 

Jo 

proceed as in the text. 
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23. In this problem we outline a proof that the eigenfunctions of the Sturm-Liouville problem 
(1), (2) are real. 

(a) Let A be an eigenvalue and <f> a corresponding eigenfunction. Let <p (x) = U{x) + iV (x), 
and show that U and V are also eigenfunctions corresponding to A. 

(b) Using Theorem 11.2.3, or the result of Problem 20, show that U and V are linearly 
dependent. 

(c) Show that 0 must be real, apart from an arbitrary multiplicative constant that may be 
complex. 

24. Consider the problem 


x 2 y" = X(xy' -y), y( 1) = 0, y( 2) = 0. 

Note that A appears as a coefficient of y' as well as of y itself. It is possible to extend 
the definition of self-adjointness to this type of problem and to show that this particular 
problem is not self-adjoint. Show that the problem has eigenvalues but that none of them 
is real. This illustrates that in general nonself-adjoint problems may have eigenvalues that 
are not real. 

Buckling of an Elastic Column. An investigation of the buckling of a uniform elastic column 
of length L by an axial load P (Figure 11.2.1a) leads to the differential equation 

y (4) + A y" = 0, 0 < x < L. (i) 

The parameter A is equal to P /El, where E is Young’s modulus and I is the moment of inertia 
of the cross section about an axis through the centroid perpendicular to the xy-plane. The 
boundary conditions at x = 0 and x = L depend on how the ends of the column are supported. 
Typical boundary conditions are 

y = y' = 0, clamped end; 
y = y" = 0, simply supported (hinged) end. 

The bar shown in Figure 11.2.1a is simply supported at x = 0 and clamped at x = L. It is 
desired to determine the eigenvalues and eigenfunctions of Eq. (i) subject to suitable boundary 
conditions. In particular, the smallest eigenvalue Ai gives the load at which the column buckles, 
or can assume a curved equilibrium position, as shown in Figure 11.2.1ft. The corresponding 
eigenfunction describes the configuration of the buckled column. Note that the differential 
equation (i) does not fall within the theory discussed in this section. It is possible to show, 
however, that in each of the cases given here all the eigenvalues are real and positive. Problems 
25 and 26 deal with column buckling problems. 


y 


y 






p 



S' ^ i . 

i 

L 

X 

L x 


x = 0 x = L 

(a) (ft) 

FIGURE 11.2.1 (a) A column under compression, (ft) Shape of the buckled column. 
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25. For each of the following boundary conditions find the smallest eigenvalue (the buckling 
load) of y (4) + ky" = 0, and also find the corresponding eigenfunction (the shape of the 
buckled column). 

(a) y(0) = y"(Q) = 0, y(L) = y"(L) = 0 

(b) y(0) = /'( 0) = 0, y(L) = y'(L) = 0 

(c) y(0) = /(0) = 0, y(L) = y'(L) = 0 

26. In some buckling problems the eigenvalue parameter appears in the boundary conditions 
as well as in the differential equation. One such case occurs when one end of the column 
is clamped and the other end is free. In this case the differential equation y <4) + ky " = 0 
must be solved subject to the boundary conditions 

y(0) = 0, y' (0) = 0, y"(L ) = 0, y"'(L) + ky'(L) = 0. 

Find the smallest eigenvalue and the corresponding eigenfunction. 

27. Solutes in an aquifer are transported by two separate mechanisms. The process by which 
a solute is transported by the bulk motion of the flowing groundwater is called advection. 
In addition, the solute is spread by small-scale fluctuations in the groundwater velocity 
along the tortuous flow paths within individual pores, a process called mechanical disper- 
sion. The one-dimensional form of the advection-dispersion equation for a nonreactive 
dissolved solute in a saturated, homogeneous, isotropic porous medium under steady, 
uniform flow is 

c, + vc x = Dc xx , 0 < x < L, t > 0, (i) 

where c(x, t) is the concentration of the solute, v is the average linear groundwater velocity, 
D is the coefficient of hydrodynamic dispersion, and L is the length of the aquifer. Suppose 
that the boundary conditions are 

c(0, t) = 0, c x (L,t) = 0, t > 0 (ii) 

and that the initial condition is 

c(x, 0)=/(x), 0 < x < L, (iii) 

where f(x) is the given initial concentration of the solute. 

(a) Assume that c(x,t) — X(x)T(t), use the method of separation of variables, and find 
the equations satisfied by X(x) and T(t), respectively. Show that the problem for X (x) 
can be written in the Sturm-Liouville form 

[p(x)X']' + kr(x)X = 0, 0 < x < L. (iv) 

A(0) = 0, X'(L ) = 0, (v) 

where p(x) = r{x) = exp(— vx/D). Hence the eigenvalues are real and the eigenfunctions 
are orthogonal with respect to the weight function r(x). 

(b) Let p? = k — (v 2 /4 D 2 ). Show that the eigenfunctions are 

X n (x) = e vxl2D sin p n x, (vi) 


where p n satisfies the equation 


tan pL = —2Dp/v. (vii) 

(c) Show graphically that Eq. (vii) has an infinite sequence of positive roots and that 
p n = (2 n — l)n/2L for large n. 
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(d) Show that 


L 



r(x)X"(x) dx 


o 


(e) Find a formal solution of the problem (i), (ii), (iii) in terms of a series of the eigen- 
functions X n (x). 

(f) Let v = 1,D — 0.5, L = 10, and/(x) = S(x — 3), where S is the Dirac delta 5 function. 
Using the solution found in part (e), plot c(x, t) versus x for several values of f, such as 
t = 0.5,l,3,6,andl0. Also plot c(x,t) versus? for several values of x. Note that the number 
of terms that are needed to obtain an accurate plot depends strongly on the values of t 
and .v. 

(g) Describe in a few words how the solution evolves as time advances. 


28. A nonreactive tracer at concentration c 0 is continuously introduced into a steady flow at 
the upstream end of a column of length L packed with a homogeneous granular medium. 
Assuming that the tracer concentration in the column is initially zero, the boundary value 
problem that models this process is 


c, + vc x — Dc xx , 0 < x < L, t > 0, 
c(0, t) — Co, c x (L,t) — 0, t > 0, 
c(x, 0) = 0, 0 < x < L, 


where c(x , t), v, and D are as in Problem 27. 

(a) Assume that c(x,t) = c 0 + u(x,t) and find the boundary value problem satisfied by 
u(x, t). 

(b) Proceeding as in Problem 27, find u(x, t ) in terms of an eigenfunction expansion. 

(c) Letu = 1,D = 0.5, c 0 = l,andL = 10. Plot c(x, t) versus x for several values of f, and 
also plot c(x, t) versus t for several values of x. 

(d) Describe in a few words how the solution evolves with time. For example, about how 
long does it take for the steady-state solution to be essentially attained? 


11.3 Nonhomogeneous Boundary Value Problems 


In this section we discuss how to solve nonhomogeneous boundary value problems 
for both ordinary and partial differential equations. Most of our attention is directed 
toward problems in which the differential equation alone is nonhomogeneous, while 
the boundary conditions are homogeneous. We assume that the solution can be 
expanded in a series of eigenfunctions of a related homogeneous problem, and then 
we determine the coefficients in this series so that the nonhomogeneous problem 
is satisfied. We first describe this method as it applies to boundary value problems 
for second order linear ordinary differential equations. Later we illustrate its use 
for partial differential equations by solving a heat conduction problem in a bar with 
variable material properties and in the presence of source terms. 


5 See Section 6.5, especially Eq. (16) of that section. 
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Nonhomogeneous Sturm-Liouville Problems. Consider the boundary value problem con- 
sisting of the nonhomogeneous differential equation 

L[y] = + q{x)y = pr(x)y + /(*), (1) 

where p is a given constant and / is a given function on 0 < x < 1, and the boundary 
conditions 

«ry( 0 ) + a 2 /( 0 ) = 0 , fay(l) + fe/CD = 0 . ( 2 ) 


As in Section 11.2, we assume that p , p' , q , and r are continuous on 0 < x < 1 and 
that p{x) > 0 and r(x) > 0 there. We will solve the problem (1), (2) by making use 
of the eigenfunctions of the corresponding homogeneous problem consisting of the 
differential equation 

L[y] = Xr{x)y (3) 

and the boundary conditions (2). Let Xi < X 2 < • ■ ■ < < • • • be the eigenvalues of 

this problem, and let <pi , . be the corresponding normalized eigenfunc- 
tions. 

We now assume that the solution y = cpix) of the nonhomogeneous problem (1), 
(2) can be expressed as a series of the form 

OO 

fix) = ^b n <p„(x). (4) 

n = 1 


From Eq. (34) of Section 11.2 we know that 


b 


n 



r(x)(j)(x)<p n (x) dx. 


n= 1,2, .... 


( 5 ) 


Flowever, since we do not know (p(x), we cannot use Eq. (5) to calculate b n . Instead, 
we will try to determine b n so that the problem (1), (2) is satisfied and then use 
Eq. (4) to find cp ( x ). Note first that cp as given by Eq. (4) always satisfies the boundary 
conditions (2) since each (p n does. 

Now consider the differential equation that <p must satisfy. This is just Eq. (1) with 
y replaced by (p: 

L[(p](x ) = fir{x)<p(x) + f(x). (6) 


We substitute the series (4) into the differential equation (6) and attempt to determine 
b n so that the differential equation is satisfied. The term on the left side of Eq. (6) 
becomes 


LWix) = L 


OO 


-n=l 


00 

(x) = ^2b n L[(/) n ](x) 

n = 1 


00 

= ^ b n X. n r(x)4> n (x), (7) 

22=1 

where we have assumed that we can interchange the operations of summation and 
differentiation. 

Note that the function r appears in Eq. (7) and also in the term ju'(x)(j)(x) inEq. (6). 
This suggests that we rewrite the nonhomogeneous term in Eq. (6) as r(x)\f(x)/r(x)] 
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so that r(x ) also appears as a multiplier in this term. If the function f /r satisfies the 
conditions of Theorem 11.2.4, then 

fix) °° 

'— = T 1 c n<Pn(x), (8) 

r(x) ^ 

n= 1 

where, using Eq. (5) with </> replaced by f/r, 

C n = f r(x)’^-(/)n(x) dx — [ f(x)<p n {x) dx, n= 1,2,.... (9) 

Jo r(x) Jo 

Upon substituting for cp(x), L[(p](x), and f(x) in Eq. (6) from Eqs. (4), (7), and (8), 
respectively, we find that 

OO OO OO 

^ b n \ n r(x)<p n (x) = iir(x) ^ b n 4> n (x) + r(x) ^ c„0„(x). 

n= 1 n= 1 n= 1 

After collecting terms and canceling the common nonzero factor r(x), we have 

OO 

[(X n - n)b n - C n \(p„(x) = 0. (10) 

72=1 

If Eq. (10) is to hold for each x in the interval 0 < x < 1, then the coefficient of r/;,, (x) 
must be zero for each n; see Problem 14 for a proof of this fact. Hence 

0-n - d)bn - C n = o, n= 1 , 2 ,.... ( 11 ) 


We must now distinguish two main cases, one of which also has two subcases. 

First suppose that fi ^ X„ for n = 1, 2, 3, ... ; that is, p is not equal to any eigenvalue 
of the corresponding homogeneous problem. Then 

b n = - — , n — 1,2,3, ... , (12) 

X n ~ li 

and 


y = (j>(x) = — </>„(x). (13) 

Equation (13), with c„ given by Eq. (9), is a formal solution of the nonhomogeneous 
boundary value problem (1), (2). Our argument does not prove that the series (13) 
converges. However, any solution of the boundary value problem (1), (2) clearly 
satisfies the conditions of Theorem 11.2.4; indeed, it satisfies the more stringent con- 
ditions given in the paragraph following that theorem. Thus it is reasonable to expect 
that the series (13) does converge at each point, and this fact can be established, pro- 
vided, for example, that / is continuous. 

Now suppose that q is equal to one of the eigenvalues of the corresponding ho- 
mogeneous problem, say, q = then the situation is quite different. In this event, 
for n = m , Eq. (11) has the form 0 • b m — c m — 0. Again we must consider two cases. 

If q = k m and c m / 0, then there is no value of b m that satisfies Eq, (11), and 
therefore the nonhomogeneous problem (1), (2) has no solution. 

If /x = X m and c m = 0, then Eq. (11) is satisfied regardless of the value of b m \ in 
other words, b m remains arbitrary. In this case the boundary value problem (1), (2) 
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Theorem 11.3.1 


Theorem 11.3.2 


EXAMPLE 

1 


does have a solution, but it is not unique, since it contains an arbitrary multiple of 
the eigenfunction (p m . 

Since c m is given by Eq. (9), the condition c m = 0 means that 

f(x)<p m (x) dx = 0. (14) 

Thus, if n — X m , the nonhomogeneous boundary value problem (1), (2) can be solved 
only if / is orthogonal to the eigenfunction corresponding to the eigenvalue X m . 

The results we have formally obtained are summarized in the following theorem. 


The nonhomogeneous boundary value problem (1), (2) has a unique solution for 
each continuous / whenever /x is different from all the eigenvalues of the corre- 
sponding homogeneous problem; the solution is given by Eq. (13), and the series 
converges for each x in 0 < x < 1. If /x is equal to an eigenvalue X m of the corre- 
sponding homogeneous problem, then the nonhomogeneous boundary value prob- 
lem has no solution unless / is orthogonal to </> m , that is, unless the condition (14) 
holds. In that case, the solution is not unique and contains an arbitrary multiple of 
<Pm(x). 


The main part of Theorem 11.3.1 is sometimes stated in the following way: 


For a given value of /x, either the nonhomogeneous problem (1), (2) has a unique 
solution for each continuous / (if /x is not equal to any eigenvalue X m of the corre- 
sponding homogeneous problem), or else the homogeneous problem (3), (2) has a 
nontrivial solution (the eigenfunction corresponding to X m ). 


This latter form of the theorem is known as the Fredholm 6 alternative theorem. 
This is one of the basic theorems of mathematical analysis and occurs in many differ- 
ent contexts. You may be familiar with it in connection with sets of linear algebraic 
equations where the vanishing or nonvanishing of the determinant of coefficients 
replaces the statements about /x and X m . See the discussion in Section 7.3. 


Solve the boundary value problem 


y" + 2y = -x, 

y(0) = 0, y(l)+/(l) = 0. 


(15) 

( 16 ) 


6 The Swedish mathematician Erik Ivar Fredholm (1866-1927), professor at the University of Stockholm, 
established the modern theory of integral equations in a fundamental paper in 1903. Fredholm's work 
emphasized the similarities between integral equations and systems of linear algebraic equations. There 
are also many interrelations between differential and integral equations; for example, see Section 2.8 and 
Problem 22 of Section 6.6. 
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This particular problem can be solved directly in an elementary way and has the solution 

sini/2x x 

^ sin s/2 + s/2 cos s/2 2 

The method of solution described below illustrates the use of eigenfunction expansions, a 
method that can be employed in many problems not accessible by elementary procedures. 
We begin by rewriting Eq. (15) as 

-y" = 2y + x (18) 

so that it will have the same form as Eq. (1). We seek the solution of the given problem as a 
series of normalized eigenfunctions <p„ of the corresponding homogeneous problem 

y" + Xy = 0, 3/(0) = 0, y(l) +/(1) = 0. (19) 


These eigenfunctions were found in Example 2 of Section 11.2 and are 

<j> n (x) = k n sin-y/k^x, 

where 


k n — 


1/2 


, 1 + cos 2 s/K / 

and X n satisfies 

sin s[K + sfK cos s[K = 0. 

Recall that in Example 1 of Section 11.1 we found that 

Xi £4.116, X 2 = 24.14, 

X3 = 63.66, X n = (2n — l) 2 7r 2 /4 for n = 4, 5, 


( 20 ) 


( 21 ) 


( 22 ) 


We assume that y is given by Eq. (4) 

OO 

y = 'Y^ l b n <t>„(x), 

n = 1 


and it follows that the coefficients b n are found from Eq. (12) 


b 


n — 


C n 


K- 2 ’ 


where the c„ are the expansion coefficients of the nonhomogeneous term/(.r) = x in Eq. (18) 
in terms of the eigenfunctions 4>„. These coefficients were found in Example 3 of Section 11.2 
and are 


2s/2 sin s/Ki 
k„(l + cos 2 s/K/ 12 ' 


(23) 


Putting everything together, we finally obtain the solution 


OO 



sin s/K, 

2)(1 + cos 2 s/K) 


sin s/Xn x. 


(24) 


Although Eqs. (17) and (24) are quite different in appearance, they are actually two different 
expressions for the same function. This follows from the uniqueness part of Theorem 11.3.1 
or 11.3.2 since X = 2 is not an eigenvalue of the homogeneous problem (19). Alternatively, 
you can show the equivalence of Eqs. (17) and (24) by expanding the right side of Eq. (17) 
in terms of the eigenfunctions <j>„(x). For this problem it is fairly obvious that Eq. (17) is a 
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more convenient expression for the solution than Eq. (24). However, we emphasize again that 
in other problems we may not be able to obtain the solution except by series (or numerical) 
methods. 


Nonhomogeneous Heat Conduction Problems. To show how eigenfunction expansions can 
be used to solve nonhomogeneous problems for partial differential equations, let us 
consider the generalized heat conduction equation 

r(x)u t = [p(x)u x ] x - q(x)u + F(x, t) (25) 

with the boundary conditions 

u x (0, t) — hiu(0, t) = 0, u x (l, t) + t) — 0 (26) 

and the initial condition 

u(x,0) = /(*). (27) 

This problem was previously discussed in Appendix A of Chapter 10 and in Section 
11.1. In the latter section we let u(x,t ) = X(x)T(t) in the homogeneous equation 
obtained by setting F{x,t) = 0, and showed that X{x) must be a solution of the 
boundary value problem 

— [p(x)X’y + q(x)X — Xr(x)X, (28) 

X'(0) - hiX(0) = 0, X'(l) + h 2 X( 1) = 0. (29) 

If we assume that p , q. and r satisfy the proper continuity requirements and that p (x) 
and r(x) are always positive, the problem (28), (29) is a Sturm-Liouville problem as 
discussed in Section 11.2. Thus we obtain a sequence of eigenvalues A.i < X 2 < ■ ■ ■ < 

X n < • • • and corresponding normalized eigenfunctions <pi (x ) , <p 2 (x) , . . . ,cp n (x), 

We will solve the given nonhomogeneous boundary value problem (25) to (27) by 
assuming that u(x, t ) can be expressed as a series of eigenfunctions 

OO 

u(x,t ) = b n {t)<p n (x ), (30) 

n = 1 

and then showing how to determine the coefficients b n {t). The procedure is basically 
the same as that used in the problem (1), (2) considered earlier, although it is more 
complicated in certain respects. For instance, the coefficients b„ must now depend 
on t, because otherwise u would be a function of x only. Note that the boundary 
conditions (26) are automatically satisfied by an expression of the form (30) because 
each <p n (x) satisfies the boundary conditions (29). 

Next we substitute from Eq. (30) for u in Eq. (25). From the first two terms on the 
right side of Eq. (25) we formally obtain 


[p(x)u x ] x - q(x)u = 


9 

dx 


OO 

p(x)'Y / bn(t)<P' n ( x ) 

n = 1 


oo 

q(x) Y bn(t)4>n(x) 
n = 1 


oo 

n = 1 


( 31 ) 
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Since [p(x)(p' n (x)]' — q(x)cp n (x) — —X n r(x)(f> n (x), we obtain finally 

OO 

[p(x)u x ] x - q{x)u = -r{x) Y b n (t)\ n <p n (x). 


(32) 


n= 1 


Now consider the term on the left side of Eq. (25). We have 


9 


(x)u, = r(jc) — Y b„(t)<p n {x) 


n = 1 


= r(x) Y b' n (t)(p n (x). 


(33) 


n = 1 


We must also express the nonhomogeneous term in Eq. (25) as a series of eigen- 
functions. Once again, it is convenient to look at the ratio F(x, t) /r(x) and to write 


Fix, t ) 
r(x) 

where the coefficients are given by 


= Y yn^nix). 


(34) 


n = 1 


F(X, t ) 

Yn(t)= r{x) (pn(x)dx 


=f< 

= f Fix , t)(f)„ 
Jo 


ix) dx, n = 1 , 2 ,.... 


(35) 


Since Fix, t) is given, we can consider the functions y n (t) to be known. 

Gathering all these results together, we substitute from Eqs. (32), (33), and (34) in 
Eq. (25), and find that 

OO OO OO 

rix ) Y b' n (f)<Pn(x) = - rix ) Y b n {t)\„<p n ix ) + r(x) Y Ynit)<f>nix). (36) 


n= 1 


n=l 


n = 1 


To simplify Eq. (36), we cancel the common nonzero factor rix ) from all terms and 
write everything in one summation: 

OO 

Y \ b 'n W + k nbn(t) ~ YniO](pnix) = 0. (37) 

n = 1 


Once again, if Eq. (37) is to hold for all x in 0 < x < 1, it is necessary for the quantity 
in square brackets to be zero for each n (again see Problem 14). Hence h„(f) is a 
solution of the first order linear ordinary differential equation 

b' n (t) + X n bn(t) = y n (t), n = l,2,..., (38) 

where y„(t) is given by Eq. (35). To determine b„it) completely, we must have an 
initial condition 


MO) = B n 


77 = 1 , 2 , . . . 


(39) 
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for Eq. (38). This we obtain from the initial condition (27). Setting t = 0 in Eq. (30) 
and using Eq. (27), we have 


u(x, 0) = ^ b n (0)<j> n (x) = ^2 B n <j> n {x) = f{x). (40) 

n = 1 n = 1 

Thus the initial values B n are the coefficients in the eigenfunction expansion for fix). 
Therefore 

B„— f r(x)fix)<p n (x)dx, n= 1,2,.... (41) 

Jo 

Note that everything on the right side of Eq. (41) is known, so we can consider B n as 
known. 

The initial value problem (38), (39) is solved by the methods of Section 2.1. The 
integrating factor is pit) = exp (A.„f), and it follows that 

b n it) = B n e~ knt + f e~ K(t ~ s) y n is)ds, n = 1,2 (42) 

Jo 

The details of this calculation are left to you. Note that the first term on the right side 
of Eq. (42) depends on the function / through the coefficients B„ , while the second 
depends on the nonhomogeneous term F through the coefficients y„(s). 

Thus an explicit solution of the boundary value problem (25) to (27) is given by 
Eq. (30) 

OO 

u(x,t) = Y2b n (t)4>„(x), 

n= 1 

where the coefficients h„(f) are determined from Eq. (42). The quantities B n and 
y n (s) in Eq. (42) are found in turn from Eqs. (41) and (35), respectively. 

Summarizing, to use this method to solve a boundary value problem such as that 
given by Eqs. (25) to (27), we must 

1. Find the eigenvalues X n and the normalized eigenfunctions of the homogeneous prob- 
lem (28), (29). 

2. Calculate the coefficients B n and y„it) from Eqs. (41) and (35), respectively. 

3. Evaluate the integral in Eq. (42) to determine b n it). 

4. Sum the infinite series (30). 

Since any or all of these steps may be difficult, the entire process can be quite 
formidable. One redeeming feature is that often the series (30) converges rapidly, in 
which case only a very few terms may be needed to obtain an adequate approximation 
to the solution. 


Find the solution of the heat conduction problem 

u t — u xx + xe~‘, 

«( 0, t) = 0, u x { 1 , t) + m(1, t) = 0, 

«(.r,0) = 0. 


(43) 

(44) 

(45) 
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Again we use the normalized eigenfunctions <p n of the problem (19) and assume that u is given 
by Eq. (30) 

OO 

u(x,t) = y^ £ b n (t)(/) n (x). 

n = 1 

The coefficients b n are determined from the differential equation 

b' n + k n b n = Yn(t), (46) 

where X n is the nth eigenvalue of problem (19) and y„(f) is the nth expansion coefficient of the 
nonhomogeneous term xe~‘ in terms of the eigenfunctions 4> n . Thus we have 

y„(f) = / x e~‘ 4> n (x) dx = e~‘ I xcf> n (x) dx 

Jo Jo 


= c n e~‘, (47) 

where c n = x<p„(x) dx is given by Eq. (23). The initial condition for Eq. (46) is 
Jo 

b n (0) = 0 (48) 

since the initial temperature distribution (45) is zero everywhere. The solution of the initial 
value problem (46), (48) is 

r‘ e (A„-l)t _ i 

b n (t) = e~ x "‘ / e x " s c n e~ s ds = c n e~ k "‘ — — 

JO 1 


= (49) 

Thus the solution of the heat conduction problem (43) to (45) is given by 

, ^ (sin^AnHe - ' -e -A "')sinV^* 

u(x,t) = 4> — — . (50) 

/„a„-l)(l + cos 2 V^) 

The solution given by Eq. (50) is exact but complicated. To judge whether a satisfactory 
approximation to the solution can be obtained by using only a few terms in this series, we must 
estimate its speed of convergence. First we split the right side of Eq. (50) into two parts: 


OO 

u(x,t) = 4e~‘ Y 

n=l 


sinv^ sinv^* 

k n (k n — 1)(1 + COS 2 -s/^n) 


4E 


e sin VKi sin *J\~ n x 
k n (k n - 1)0 + cos 2 Va„) ' 


(51) 


Recall from Example 1 in Section 11.1 that the eigenvalues X n are very nearly proportional to 
n 2 . In the first series on the right side of Eq. (51) the trigonometric factors are all bounded 

OO OO 

as n — > oo', therefore this series converges similarly to the series K? or 11 Hence at 

n = 1 n = 1 

most two or three terms are required for us to obtain an excellent approximation to this part 
of the solution. The second series contains the additional factor e~ 2n ‘, so its convergence is 
even more rapid for t > 0; all terms after the first are almost surely negligible. 


Further Discussion. Eigenfunction expansions can be used to solve a much greater va- 
riety of problems than the preceding discussion and examples may suggest. For 
example, time-independent nonhomogeneus boundary conditions can be handled 
much as in Section 10.6. To reduce the problem to one with homogeneous boundary 
conditions, subtract from u a function v that is chosen to satisfy the given boundary 
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conditions. Then the difference w = u — v satisfies a problem with homogeneous 
boundary conditions, but with a modified forcing term and initial condition. This 
problem can be solved by the procedure described in this section. 

One potential difficulty in using eigenfunction expansions is that the normalized 
eigenfunctions of the corresponding homogeneous problem must be found. For a 
differential equation with variable coefficients this may be difficult, if not impossible. 
In such a case it is sometimes possible to use other functions, such as eigenfunctions 
of a simpler problem, that satisfy the same boundary conditions. For instance, if the 
boundary conditions are 


m(0,0 = 0, u(l,t) — 0, (52) 

then it may be convenient to replace the functions cp n (x) in Eq. (30) by sin nnx. These 
functions at least satisfy the correct boundary conditions, although in general they 
are not solutions of the corresponding homogeneous differential equation. Next we 
expand the nonhomogeneous term F(x, t) in a series of the form (34), again with 
<p n (x) replaced by sin htcx, and then substitute for both u and F in Eq. (25). Upon 
collecting the coefficients of sinnjrx for each n, we have an infinite set of linear first 
order differential equations from which to determine b\{t), b 2 (t ), .... The essential 
difference between this case and the one considered earlier is that now the equations 
for the functions b n (f) are coupled. Thus they cannot be solved one by one, as before, 
but must be dealt with simultaneously. In practice, the infinite system is replaced 
by an approximating finite system, from which approximations to a finite number of 
coefficients are calculated. 

Boundary value problems for equations of higher than second order can also often 
be solved by eigenfunction expansions. In some cases the procedure parallels almost 
exactly that for second order problems. However, a variety of complications can also 
arise. 

Finally, we emphasize that the discussion in this section has been purely formal. 
Separate and sometimes elaborate arguments must be used to establish convergence 
of eigenfunction expansions or to justify some of the steps used, such as term-by-term 
differentiation of eigenfunction series. 

There are also other, altogether different, methods for solving nonhomogeneous 
boundary value problems. One of these leads to a solution expressed as a definite 
integral rather than as an infinite series. This approach involves certain functions 
known as Green’s functions and, for ordinary differential equations, is the subject of 
Problems 28 through 36. 


PROBLEMS 


In each of Problems 1 through 5 solve the given problem by means of an eigenfunction ex- 
pansion. 


1. 

y" 

+ 

■U 

ii 

— . X, 

y(0) = 

0, 

yd) = o 

2. 

y" 

+ 

ii 

— . X, 

y(0) = 

0, 

y'(l) = 0; see S 

3. 

y" 

+ 

ii 

-X, 

/(0) = 

= 0, 

/(l) = 0; see 1 

4. 

y" 

+ 

ii 

-X, 

y'(0) = 

= 0, 

y'(l) +y(l) = 0; 

5. 

y" 

+ 

ii 

-1 + 11 

-2x1, 


y(0) = 0, y(l) = 


see Section 11.2, Problem 11. 
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In each of Problems 6 through 9 determine a formal eigenfunction series expansion for the 
solution of the given problem. Assume that/ satisfies the conditions of Theorem 11.3.1. State 
the values of /x for which the solution exists. 


6. 

y" + i^y = 

-fix), 

o 

II 

o 

y'(i) = o 

7. 

y" + w = 

-fix). 

y'(0) = o, 

y( 1) = o 

8. 

y" + m.v = 

-fix). 

y'(0) = o, 

y'i i) = o 

9. 

y" + m.v = 

-fix), 

y'(0) = o, 

y'(l) + y(i) = o 


In each of Problems 10 through 13 determine whether there is any value of the constant a for 
which the problem has a solution. Find the solution for each such value. 

10. y" + n 2 y = a + x, _y(0) = 0, y(l) = 0 

11. y" + 4jr 2 _y = a + x, y(0) = 0, y{ 1) = 0 

12. y" + n 2 y = a, /(0) = 0, y'(l) = 0 

13. y" + n 2 y = a — cosnx, y(0) = 0, y(l) = 0 

14. Let <pi , . . . , 4> n , . . . be the normalized eigenfunctions of the differential equation (3) subject 

OO 

to the boundary conditions (2). If c n <p„(x) converges to/(x), where /(jt) = 0 for each x 

n = 1 

in 0 < x < 1, show that c n = 0 for each n. 

Hint: Multiply by r(x)<j) m {x), integrate, and use the orthogonality property of the eigen- 
functions. 

15. Let L be a second order linear differential operator. Show that the solution y = <p(x) of 
the problem 

L[y\ = f(x), 

ffiyfO) + a 2 /(0) = a, Piyil) + PitfO) = b 
can be written as y = u + v, where it = cf>i(x) and v = 4>i(x) are solutions of the problems 

L[n] = 0, 

tyi!<(0) + oiiu'i 0) = a, /iu(l) + / 2 H, ( 1 ) — b 

and 

L[u] — fix), 

«i w(0) + a 2 v'(0) = 0, Piv(l) + fav'il) = 0, 

respectively. 

16. Show that the problem 

y" + 7t 2 y = 7i 2 x, y(0) = 1, y( 1) = 0 

has the solution 

y = C\ sin nx + cos nx + x. 

Also show that this solution cannot be obtained by splitting the problem as suggested in 
Problem 15, since neither of the two subsidiary problems can be solved in this case. 

17. Consider the problem 


/ + P(x)y' + q(x)y = 0, 


y(0) = a, yH) = b. 
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Let y = u + v, where v is any twice differentiable function satisfying the boundary con- 
ditions (but not necessarily the differential equation). Show that u is a solution of the 
problem 

u" + p(x)u' + q(x)u = g(x), z/(0) = 0, zz(l) = 0, 

where g(x ) = —\v" + p(x) v' + q(x)v] and is known once v is chosen. Thus nonhomo- 
geneities can be transferred from the boundary conditions to the differential equation. 
Find a function v for this problem. 

18. Using the method of Problem 17, transform the problem 

y" + 2y = 2 — 4x, y( 0) = 1, y(l) + /(l) = -2 

into a new problem in which the boundary conditions are homogeneous. Solve the latter 
problem by reference to Example 1 of the text. 

In each of Problems 19 through 22 use eigenfunction expansions to find the solution of the 
given boundary value problem. 

19. u, = u xx — x, u(0,t) = 0 , u x (l,t) — 0, u(x,0) — sin(7rx/2); 

see Problem 2. 

20. u, = Uxx + e - ', u x ( 0, t) = 0, n x (1 , t) + u(l, t ) = 0, iz(x, 0) = 1 — x; 
see Section 11.2, Problems 10 and 12. 

21. u, = u xx + 1 — |1 — 2x|, u(0, t) = 0, m( 1,/) = 0, t/(x,0) = 0; 

see Problem 5. 

22. u, = Uxx + e~‘(l — x), z/(0,r) = 0 , u x (l.t) = 0, m(x, 0) = 0; 

see Section 11.2, Problems 6 and 7. 

23. Consider the boundary value problem 

r{x)u, = \p{x)Ux\ x - q(x)u + F(x), 
u(0 ,t) = Ti, u(l,t) = T 2 , m(x, 0) = / (x) . 

(a) Let t>(x) be a solution of the problem 

Ip(x)ij')' - q(x)v = —F(x), u(0) = T\, u(l) = T 2 . 

If w(x,t ) = u(x,t) — v(x), find the boundary value problem satisfied by w. Note that this 
problem can be solved by the method of this section. 

(b) Generalize the procedure of part (a) to the case where u satisfies the boundary con- 
ditions 

u x (0,t) — h\u{Q, t) = T\, Mi(l,0 +h 2 u(l,t) = T 2 . 


In each of Problems 24 and 25 use the method indicated in Problem 23 to solve the given 
boundary value problem. 

24. u, = Uxx ~ 2, 25. u, = Uxx ~ nl cos nx, 

i/(0, t) = l, m(1,0 = 0, u x (().t) = 0, h(1, 0 = 1, 

iz(x, 0) = x 2 — 2x + 2 zz(x, 0) = cos(37rx/2) — cos jrx 

26. The method of eigenfunction expansions is often useful for nonhomogeneous problems 
related to the wave equation or its generalizations. Consider the problem 


r(x)u tl — [p(x)Ux\ x — q{x)u + F(x,t), 

(i) 

u x (0, 0 — h] zz(0, 0 = 0, 

z/ v (l,0 + h 2 u(l,t) = 0, 

(ii) 

u(x, 0) =/(x), 

W/(x,0) = g(x). 

(hi) 
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This problem can arise in connection with generalizations of the telegraph equation (Prob- 
lem 16 in Section 11.1) or the longitudinal vibrations of an elastic bar (Problem 25 in 
Section 11.1). 

(a) Let u(x,t ) = X(x)T{t) in the homogeneous equation corresponding to Eq. (i), and 
show that X{x) satisfies Eqs. (28) and (29) of the text. Let X n and <f>„(x ) denote the 
eigenvalues and normalized eigenfunctions of this problem. 

OO 

(b) Assume that u(x,t) — b n (t)<j>„(x ), and show that b n (t) must satisfy the initial value 

n = 1 

problem 

K (f ) + K b n (0 = Yn (0 , b n (0) = a n , b' n (0) = /3 n , 

where a n , /S„, and y„(f) are the expansion coefficients for f(x), g(x), and F(x,t)/r(x) in 
terms of the eigenfunctions ij>i(x), ... .<f> n {x). 

27. In this problem we explore a little further the analogy between Sturm-Liouville bound- 
ary value problems and Hermitian matrices. Let A be an n x n Hermitian matrix with 
eigenvalues Ai, . . . , X„ and corresponding orthogonal eigenvectors £ (1) ,...,§ (n) . 

Consider the nonhomogeneous system of equations 

Ax — fix = b, (i) 


where /r is a given real number and b is a given vector. We will point out a way of solving 
Eq. (i) that is analogous to the method presented in the text for solving Eqs. (1) and (2). 

(a) Show that b = b&®, where ft, = (b,§ (1) ). 

i = 1 


(b) Assume that x = ^ a^ tn and show that for Eq. (i) to be satisfied, it is necessary that 

(=i 

a, = bi/(ki — fi). Thus 


x = E 


(M t0 ) 

h - M S 


(ii) 


provided that fi is not one of the eigenvalues of A A, for i = 1 Compare this 
result with Eq. (13). 

Green’s 7 Functions. Consider the nonhomogeneous system of algebraic equations 


Ax — /rx = b. 


(i) 


where A is an n x n Hermitian matrix, /r is a given real number, and b is a given vector. Instead 
of using an eigenvector expansion as in Problem 27, we can solve Eq. (i) by computing the 
inverse matrix (A — which exists if /r is not an eigenvalue of A. Then 

x = (A — /rl) _1 b. (ii) 


7 Green's functions are named after George Green (1793-1841) of England. He was almost entirely self- 
taught in mathematics and made significant contributions to electricity and magnetism, fluid mechanics, 
and partial differential equations. His most important work was an essay on electricity and magnetism 
that was published privately in 1828. In this paper Green was the first to recognize the importance of 
potential functions. He introduced the functions now known as Green’s functions as a means of solving 
boundary value problems and developed the integral transformation theorems, of which Green’s theorem 
in the plane is a particular case. However, these results did not become widely known until Green’s essay 
was republished in the 1850s through the efforts of William Thomson (Lord Kelvin). 
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Problems 28 through 36 indicate a way of solving nonhomogeneous boundary value problems 
that is analogous to using the inverse matrix for a system of linear algebraic equations. The 
Green’s function plays a part similar to the inverse of the matrix of coefficients. This method 
leads to solutions expressed as definite integrals rather than as infinite series. Except in 
Problem 35, we will assume that fj. = 0 for simplicity. 


28. (a) Show by the method of variation of parameters that the general solution of the dif- 
ferential equation 


-y"=m 


can be written in the form 

y = (p(x) = Cl + C2X —I (x — s)f(s) ds, 

Jo 


where ci and C 2 are arbitrary constants. 

(b) Let y = <p(x) also be required to satisfy the boundary conditions y( 0) = 0, y(l) = 0. 
Show that in this case 

Ci = 0, c 2 = ( (1 — s)f(s) ds. 

Jo 

(c) Show that, under the conditions of parts (a) and (b), <j>(x) can be written in the form 

<p(x) = J s(l — x)f(s) ds + J x(l — s)f(s) ds. 

(d) Defining 


G(x, s) 


{s(l — x), 0 < 5 < X, 
x(l — s). x < s < 1, 


show that the solution can be written as 

4>(x) = f G(x,s)f(s)ds. 
Jo 


The function G(x , .v) appearing under the integral sign is a Green’s function. The usefulness 
of a Green’s function solution rests on the fact that the Green’s function is independent of 
the nonhomogeneous term in the differential equation. Thus, once the Green’s function 
is determined, the solution of the boundary value problem for any nonhomogeneous 
term /(x) is obtained by a single integration. Note further that no determination of 
arbitrary constants is required, since <p (x) as given by the Green’s function integral formula 
automatically satisfies the boundary conditions. 


29. By a procedure similar to that in Problem 28 show that the solution of the boundary value 
problem 

-O'" + y) = fix), v(0) = 0, y(l) = 0 


is 


where 


y ~ <P(x) = 


f G(x,s)f(s ) 
Jo 


ds , 


G(x,s) = 


sins sin(l — x) 
sinl 

sinx sin(l — s) 


, 0 < s < x, 

, x < s < 1. 


sinl 
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30. It is possible to show that the Sturm-Liouville problem 

L\y\ = -\p(x)y f Y + q(x)y = fix), 


a iy (0) + a 2 y\ 0) = 0, fty(l) + fcy'O) = 0 


(ii) 


has a Green’s function solution 



J o 


(hi) 


provided that X = 0 is not an eigenvalue of L[y ] = Xy subject to the boundary conditions 
(ii). Further, G(x,s) is given by 


■yi{s)y 2 {x)lp{x)W{ yi ,y 2 ){x), 0 < s < x, 

y\(x)y 2 (s) /p(x)W(y u y 2 )(x), x < s < 1, 


(iv) 


G(x,s) — 


where y\ is a solution of L[y] = 0 satisfying the boundary condition at x = 0, y 2 is a solution 
of L[y ] = 0 satisfying the boundary condition at x — 1, and Wiyi,y 2 ) is the Wronskian of 
yi and y 2 . 

(a) Verify that the Green’s function obtained in Problem 28 is given by formula (iv). 

(b) Verify that the Green’s function obtained in Problem 29 is given by formula (iv). 

(c) Show that p(x)W(yi,y 2 )(x) is a constant by showing that its derivative is zero. 

(d) Using Eq. (iv) and the result of part (c), show that G(x, s) = G(s,x). 

(e) Verify that y = <j> (x) from Eq. (iii) with G(x, s) given by Eq. (iv) satisfies the differential 
equation (i) and the boundary conditions (ii). 


In each of Problems 3 1 through 34 solve the given boundary value problem by determining the 
appropriate Green’s function and expressing the solution as a definite integral. Use Eqs. (i) 
to (iv) of Problem 30. 

31. —y"=f(x), y'( 0) = 0, y(l) = 0 

32. -y" = f(x), y(0) = 0, y( 1) + /( 1) = 0 

33. -iy" + y) = fix), /( 0) = 0, yi 1) = 0 

34. —y" = fix), y(0) = 0, y'(l) = 0 

35. Consider the boundary value problem 


(i) 

(ii) 


L[y] = -\pix)y'f + qix)y = jirix)y + fix), 

otiyiO) + a 2 y'i 0) = 0, fiyil) + /S 2 y (1) = 0. 


According to the text, the solution y = fix) is given by Eq. (13), where c„ is defined by 
Eq. (9), provided that p. is not an eigenvalue of the corresponding homogeneous problem. 
In this case it can also be shown that the solution is given by a Green’s function integral 
of the form 



(iii) 


Note that in this problem the Green’s function also depends on the parameter fi. 
(a) Show that if these two expressions for fix) are to be equivalent, then 
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where A, and ipi are the eigenvalues and eigenfunctions, respectively, of Eqs. (3), (2) of the 
text. Again we see from Eq. (iv) that fi cannot be equal to any eigenvalue X t . 

(b) Derive Eq. (iv) directly by assuming that G(.r,s, /r) has the eigenfunction expansion 

OO 

G{x,s,p) = (v) 

;=i 

Determine n,(x,/x) by multiplying Eq. (v) by r(s)<pj(s) and integrating with respect to s 
from s = 0 to s = 1. 

Hint: Show first that X t and 4> t satisfy the equation 

4>i(x) = (Xi — /r) / G(x,s, n)r(s)4>j(s) ds. (vi) 

Jo 

36. Consider the boundary value problem 

— d 2 y/ds 2 = 8(s — x), y(0) = 0, y(l) = 0, 

where s is the independent variable, s = x is a definite point in the interval 0 < s < 1, and 
8 is the Dirac delta function (see Section 6.5). Show that the solution of this problem is 
the Green’s function G(x,s) obtained in Problem 28. 

In solving the given problem, note that 5 (s — x) = 0 in the intervals 0 < s < x and 
x < s < 1. Note further that — dy/dsexperiencesajumpofmagnitudelasjpasses through 
the value x. 

This problem illustrates a general property, namely, that the Green’s function G(x, s) can 
be identified as the response at the point s to a unit impulse at the point x. A more general 
nonhomogeneous term / on 0 < x < 1 can be regarded as a continuous distribution of 
impulses with magnitude /(x) at the point x. The solution of a nonhomogeneous boundary 
value problem in terms of a Green’s function integral can then be interpreted as the result 
of superposing the responses to the set of impulses represented by the nonhomogeneous 
term/(x). 


11.4 Singular Sturm-Liouville Problems 

In the preceding sections of this chapter we considered Sturm-Liouville boundary 
value problems: the differential equation 

L\y] = ~[p(x)y']' + q(x)y = Xr(x)y, 0 < x < 1, (1) 

together with boundary conditions of the form 

aiy(0) + a 2 y'( 0) = 0, (2) 

PiyO) + Shy’ ( 1 ) = 0 . ( 3 ) 

Until now, we have always assumed that the problem is regular; that is, p is differen- 
tiable, q and r are continuous, and p(x) > 0 and r(x) > 0 at all points in the closed 
interval. However, there are also equations of physical interest in which some of 
these conditions are not satisfied. 
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For example, suppose that we wish to study Bessel's equation of order v on the 
interval 0 < x < 1. This equation is sometimes written in the form 8 

v 2 

-{xy')' + —y = Xxy (4) 

so that p(x) = x, q(x ) = v 2 /x, and r(x) = x. Thus p( 0) = 0, r(0) = 0, and q(x) is un- 
bounded and hence discontinuous as x -» 0. Flowever, the conditions imposed on 
regular Sturm-Liouville problems are met elsewhere in the interval. 

Similarly, for Legendre’s equation we have 

-[(1 - x 2 )y']’ = Xy, —1 <x<1, (5) 

where X = a(a + 1 ),p(x) — 1 — x 2 , q( x) = 0, and r(x) = 1. Here the required condi- 
tions on p, q, and r are satisfied in the interval 0 < x < 1 except at x = 1, where p is 
zero. 

We use the term singular Sturm-Liouville problem to refer to a certain class of 
boundary value problems for the differential equation ( 1) in which the functions p. q, 
and r satisfy the conditions stated earlier on the open interval 0 < x < 1, but at least 
one of these functions fails to satisfy them at one or both of the boundary points. 
We also prescribe suitable separated boundary conditions of a kind to be described 
in more detail later in this section. Singular problems also occur if the interval is 
unbounded, for example, 0 < x < oo. We do not consider this latter kind of singular 
problem in this book. 

As an example of a singular problem on a finite interval, consider the equation 

xy" + / + Xxy — 0, (6) 


or 

~(xy')' = Xxy , 


( 7 ) 


on the interval 0 < x < 1, and suppose that X > 0. This equation arises in the study of 
free vibrations of a circular elastic membrane and is discussed further in Section 11.5. 
If we introduce the new independent variable t defined by t = ~JXx, then 

dy dy d 2 y d 2 y 

dx dt ’ dx 2 dt 2 

Hence Eq. (6) becomes 


t 

Vx 


d 2 y /— dy t 
X —pr + \fx — — b X—=y = 0, 
dt 2 dt ^fx 


or, if we cancel the common factor «jx in each term, 



dy 

dt 


+ ty = 0. 


( 8 ) 


Equation (8) is Bessel’s equation of order zero (see Section 5.7). The general solution 
of Eq. (8) for t > 0 is 

y = ciJo(t) + c 2 Eo(0; 


s The substitution t = vTx reduces Eq. (4) to the standard form t 2 y" + ty' + (f 2 — v 2 )y = 0. 
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hence the general solution of Eq. (7) for x > 0 is 

y = ciJoi-fkx) + c 2 Y 0 (VXx), (9) 


where Jq and Yo denote the Bessel functions of the first and second kinds of order 
zero. From Eqs. (7) and (13) of Section 5.7 we have 


OO 

J(j(^/Xx) = 1 + y ' 

m=l 


(_l) m X m x 2m 

2 2 m ( m \) 2 


x > 0, 


( 10 ) 


Y 0 (VXx) 


2 
7 r 



OO 

Jq(s/xx) + y ' 

m = 1 


(— l) m+l H m X m x 2m 

2 2 m ( m \) 2 


x > 0, 

(ID 


where H m = 1 + (1/2) H + (1/m) and y = lim (H m — In m). The graphs of 

m—>o o 

y = J 0 (x) and y = Yo(;t) are given in Figure 5.7.2. 

Suppose that we seek a solution of Eq. (7) that also satisfies the boundary condi- 
tions 


y(0) = o, (12) 

y(. i) = o, (13) 

which are typical of those we have met in other problems in this chapter. Since 
/ 0 (0) = 1 and Y 0 (x) — »■ — oo as x 0, the condition y(0) = 0 can be satisfied only by 
choosing cj = c 2 = 0 in Eq. (9). Thus the boundary value problem (7), (12), (13) has 
only the trivial solution. 

One interpretation of this result is that the boundary condition (12) at x — 0 is 
too restrictive for the differential equation (7). This illustrates the general situation, 
namely, that at a singular boundary point it is necessary to consider a modified type 
of boundary condition. In the present problem, suppose that we require only that 
the solution (9) and its derivative remain bounded. In other words, we take as the 
boundary condition at x = 0 the requirement 

y, y’ bounded as x -> 0. (14) 

This condition can be satisfied by choosing c 2 = 0 in Eq. (9), so as to eliminate the 
unbounded solution Yo. The second boundary condition, _y(l) = 0, then yields 

J 0 (VX) = 0. (15) 

It is possible to show 9 that Eq. (15) has an infinite set of discrete positive roots, 
which yield the eigenvalues 0 < ki < k 2 <■■ ■ < X n < ■ ■ ■ of the given problem. The 
corresponding eigenfunctions are 

4> n (x) =/o(v / ^*), (16) 


9 The function Jo is well tabulated; the roots of Eq. (15) can be found in various tables, for example, those 
in Jahnke and Emde or Abramowitz and Stegun. You can also use a computer algebra system to compute 
them quickly. The first three roots of Eq. (15) are y/k = 2.405, 5.520, and 8.654, respectively, to four 
significant figures; = (n — 1/4 )jr for large n. 
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determined only up to a multiplicative constant. The boundary value problem (7), 
(13), and (14) is an example of a singular Sturm-Liouville problem. This example 
illustrates that if the boundary conditions are relaxed in an appropriate way, then a 
singular Sturm-Liouville problem may have an infinite sequence of eigenvalues and 
eigenfunctions, just as a regular Sturm-Liouville problem does. 

Because of their importance in applications, it is worthwhile to investigate singular 
boundary value problems a little further. There are two main questions that are of 
concern: 


1. Precisely what type of boundary conditions can be allowed in a singular Sturm-Liouville 
problem? 

2. To what extent do the eigenvalues and eigenfunctions of a singular problem share the 
properties of eigenvalues and eigenfunctions of regular Sturm-Liouville problems? In 
particular, are the eigenvalues real, are the eigenfunctions orthogonal, and can a given 
function be expanded as a series of eigenfunctions? 


Both these questions can be answered by a study of the identity 

-t 

{L[u] v — uL[v]} dx = 0, 


r 

Jo 


(17) 


which played an essential part in the development of the theory of regular Sturm- 
Liouville problems. We therefore investigate the conditions under which this relation 
holds for singular problems, where the integral in Eq. (17) may now have to be ex- 
amined as an improper integral. To be definite, we consider the differential equation 
(1) and assume that x = 0 is a singular boundary point but that x = 1 is not. The 
boundary condition (3) is imposed at the nonsingular boundary point x = 1, but we 
leave unspecified, for the moment, the boundary condition at x = 0. Indeed, our 
principal objective is to determine what kinds of boundary conditions are allowable 
at a singular boundary point if Eq. (17) is to hold. 

Since the boundary value problem under investigation is singular at x = 0, we 


v dx. as in 


choose e > 0 and consider the integral J L[u]v dx, instead of J L[u] 

Section 11.2. Afterwards we let e approach zero. Assuming that u and v have at least 
two continuous derivatives one < x < 1 , and integrating twice by parts, we find that 

f 1 1 

J [L[u]v — uL[v]} dx = —p(x)[u'(x)v(x) — u(x)v'(x)] . (18) 

The boundary term at x = 1 is again eliminated if both it and v satisfy the boundary 
condition (3), and thus 


f 


Taking the limit as e 

»i 


{L[u]v — uL[v]\ dx — p(e) [«'(e) v(e) — w(e)i/(e)] . 
0 yields 


/ 


[L[u]v — uL[v]} dx — limp(e) \u!{e) v(e) — n(e)i/(e)] . 


(19) 


(20) 


Hence Eq. (17) holds if and only if, in addition to the assumptions stated previously, 
limp(e) [u r (e)v(e) — u(e) i/(e)] = 0 (21) 
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for every pair of functions u and v in the class under consideration. Equation (21) 
is therefore the criterion that determines what boundary conditions are allowable at 
x = 0 if that point is a singular boundary point. A similar condition applies atx = 1 
if that boundary point is singular, namely, 

limp(l — e) [m'( 1 — e)u(l — e) — «( 1 — e)v f (l — e)] = 0. (22) 

In summary, as in Section 11.2, a singular boundary value problem for Eq. (1) is 
said to be self-adjoint if Eq. (17) is valid, possibly as an improper integral, for each 
pair of functions u and v with the following properties: they are twice continuously 
differentiable on the open interval 0 < x < 1, they satisfy a boundary condition of 
the form (2) at each regular boundary point, and they satisfy a boundary condition 
sufficient to ensure Eq. (21) if x = 0 is a singular boundary point, or Eq. (22) if 
x — 1 is a singular boundary point. If at least one boundary point is singular, then 
the differential equation (1), together with two boundary conditions of the type just 
described, are said to form a singular Sturm-Liouville problem. 

For example, for Eq. (7) we have p(x) = x. If both u and v satisfy the boundary 
condition ( 14) at x = 0, it is clear that Eq. (21) will hold. Hence the singular boundary 
value problem, consisting of the differential equation (7) , the boundary condition (14) 
at x = 0, and any boundary condition of the form (3) at x = 1, is self-adjoint. 

The most striking difference between regular and singular Sturm-Liouville prob- 
lems is that in a singular problem the eigenvalues may not be discrete. That is, the 
problem may have nontrivial solutions for every value of k, or for every value of k in 
some interval. In such a case the problem is said to have a continuous spectrum. It 
may happen that a singular problem has a mixture of discrete eigenvalues and also 
a continuous spectrum. Finally, it is possible that only a discrete set of eigenvalues 
exists, just as in the regular case discussed in Section 11.2. For example, this is true 
of the problem consisting of Eqs. (7), (13), and (14). In general, it may be difficult to 
determine which case actually occurs in a given problem. 

A systematic discussion of singular Sturm-Liouville problems is quite sophisti- 
cated 10 indeed, requiring a substantial extension of the methods presented in this 
book. We restrict ourselves to some examples related to physical applications; in 
each of these examples it is known that there is an infinite set of discrete eigenvalues. 

If a singular Sturm-Liouville problem does have only a discrete set of eigenvalues 
and eigenfunctions, then Eq. (17) can be used, just as in Section 11.2, to prove that the 
eigenvalues of such a problem are real and that the eigenfunctions are orthogonal 
with respect to the weight function r. The expansion of a given function in terms of 
a series of eigenfunctions then follows as in Section 11.2. 

Such expansions are useful, as in the regular case, for solving nonhomogeneous 
boundary value problems. The procedure is very similar to that described in Sec- 
tion 11.3. Some examples for ordinary differential equations are indicated in Prob- 
lems 1 to 4, and some problems for partial differential equations appear in Sec- 
tion 11.5. 


10 See, for example. Chapter 5 of the book by Yosida listed in the references at the end of this chapter. 
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For instance, the eigenfunctions <p„ (x) = J 0 (s/X^ x) of the singular Sturm-Liouville 
problem 

— (xy'f = Xxy, 0 < x < 1, 

y, y' bounded as x -» 0, y( 1) = 0 

satisfy the orthogonality relation 

/ x<p m {x)(j) n (x) dx = 0, m^n (23) 

Jo 

with respect to the weight function r(x) = x. Then, if / is a given function, we assume 
that 

OO 

fix) = ^2 CnJois/XnX). (24) 

n= 1 

Multiplying Eq. (24) by xJoi^fX^x) and integrating term by term from x = 0 to x = 1 
yield 

/ xf(x)Jp(- \fxfnX) dx = y, c n 1 xJo(-2K n x)Jo(y/K l x)dx. (25) 

Jo n=1 Jo 

Because of the orthogonality condition (23), the right side of Eq. (25) collapses to a 
single term; hence 

/ xf (x)J 0 *) <Jx 

Cm = JjL -y , (26) 

/ xjQ( y fxf l x)dx 

Jo 

which determines the coefficients in the series (24). 

The convergence of the series (24) is established by an extension of Theorem 11.2.4 
to cover this case. This theorem can also be shown to hold for other sets of Bessel 
functions, which are solutions of appropriate boundary value problems, for Legen- 
dre polynomials, and for solutions of a number of other singular Sturm-Liouville 
problems of considerable interest. 

It must be emphasized that the singular problems mentioned here are not nec- 
essarily typical. In general, singular boundary value problems are characterized by 
continuous spectra, rather than by discrete sets of eigenvalues. The corresponding 
sets of eigenfunctions are therefore not denumerable, and series expansions of the 
type described in Theorem 11.2.4 do not exist. They are replaced by appropriate 
integral representations. 

PROBLEMS 

1. Find a formal solution of the nonhomogeneous boundary value problem 


-(xy'Y = [ixy+f(x), 
y, y' bounded as x — > 0, v(l) = 0, 
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where / is a given continuous function on 0 < x < 1 , and p is not an eigenvalue of the 
corresponding homogeneous problem. 

Hint: Use a series expansion similar to those in Section 11.3. 

2. Consider the boundary value problem 


-(*/)' = kx y, 

y, y' bounded asx — > 0, /( 1) = 0. 


(a) Show that 2 l 0 = 0 is an eigenvalue of this problem corresponding to the eigenfunction 
<pa{x) = 1. If X. > 0, show formally that the eigenfunctions are given by <j> n (x) = Jo(\/K,x), 
where VKi is the nth positive root (in increasing order) of the equation /q(vT) = 0. It is 
possible to show that there is an infinite sequence of such roots. 

(b) Show that if m, n = 0,1,2, , then 



I x<p m (x)<j> n (x) dx = 0, m 7^ n. 
Jo 

(c) Find a formal solution to the nonhomogeneous problem 

-(*/)' = pxy+f(x), 
y, y' bounded as x — *■ 0, y\ 1) = 0, 


where / is a given continuous function on 0 < x < 1 , and p is not an eigenvalue of the 
corresponding homogeneous problem. 

3. Consider the problem 


-(*/)' + ( k 2 /x)y = Xxy, 
y, y' bounded asx — > 0, y(l) = 0, 


where k is a positive integer. 

(a) Using the substitution t = *Jkx, show that the given differential equation reduces to 
Bessel’s equation of order k (see Problem 9 of Section 5.7). One solution is / fc (f); a second 
linearly independent solution, denoted by Yk{t), is unbounded as t -> 0. 

(b) Show formally that the eigenvalues Xi,X 2 ,. . . of the given problem are the squares of 
the positive zeros of /*(- and that the corresponding eigenfunctions are 
<p n {x) — J k (Vkn x ). It is possible to show that there is an infinite sequence of such zeros. 

(c) Show that the eigenfunctions <p n {x) satisfy the orthogonality relation 


X(f> m (x)(j> n (x) dx = 0, m 7^ n. 

(d) Determine the coefficients in the formal series expansion 



OO 


f( x ) = ^ a n4>niX). 


n = 1 

(e) Find a formal solution of the nonhomogeneous problem 

-(*/)' + (k 2 /x)y = pxy+fix), 
y, y' bounded asx — >■ 0, y(l) = 0, 


where / is a given continuous function on 0 < x < 1, and p is not an eigenvalue of the 
corresponding homogeneous problem. 


11.5 A Bessel Series Expansion 
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4. Consider Legendre’s equation (see Problems 22 through 24 of Section 5.3) 

-[(1 -x 2 )/]' = 

subject to the boundary conditions 

y( 0) = 0, y, y 1 bounded as x — > 1 . 


The eigenfunctions of this problem are the odd Legendre polynomials (j>\{x) = P\(x) = x, 
<h(x) = Pi(x) = ( 5x 3 - 3x)/2, . . . , <p n (x) — P^n- iW, ■ ■ • corresponding to the eigenvalues 
X 1 = 2, X 2 = 4 ■ 3, . . . , X n = 2n(2n - 1), . . . . 

(a) Show that 

I <p m (x)(t> n ( x) dx = 0 , m 7 ^ n. 

Jo 


(b) Find a formal solution of the nonhomogeneous problem 

-[(1 - x 2 )/]' = p y+f(x), 
y(0) = 0, y, y' bounded as x — > 1, 


where / is a given continuous function on 0 < x < 1 , and /j. is not an eigenvalue of the 
corresponding homogeneous problem. 

5. The equation 

(1 - x 2 )y" -xy’ + Xy = 0 (i) 


is Chebyshev’s equation; see Problem 10 of Section 5.3. 

(a) Show that Eq. (i) can be written in the form 

— [(1 — x 2 ) 1/2 y']' = A(1 — x 2 ) _1/2 y, — 1 < x < 1 . (ii) 


(b) Consider the boundary conditions 

y, y’ bounded asx -> — 1, y, y' bounded asx -> 1. (iii) 

Show that the boundary value problem (ii), (iii) is self-adjoint. 

(c) It can be shown that the boundary value problem (ii), (iii) has the eigenvalues 
Xo — 0, X\ = 1, Xi = 4, . . . , X n = n 2 ,. . . . The corresponding eigenfunctions are the Cheby- 
shev polynomials T„(x): T 0 (x) = 1, 7)(x) = x, T^x) = 1 — 2x 2 , .... Show that 




T rn (x)T n (x) 


/_! (1-X 2 )V2 

Note that this is a convergent improper integral. 


dx = 0 , m n. 


(iv) 


11.5 Further Remarks on the Method of Separation 
of Variables: A Bessel Series Expansion 

In this chapter we are interested in extending the method of separation of vari- 
ables developed in Chapter 10 to a larger class of problems — to problems involving 
more general differential equations, more general boundary conditions, or different 
geometrical regions. We indicated in Section 11.3 how to deal with a class of more 
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general differential equations or boundary conditions. Here we concentrate on prob- 
lems posed in various geometrical regions, with emphasis on those leading to singular 
Sturm-Liouville problems when the variables are separated. 

Because of its relative simplicity, as well as the considerable physical significance of 
many problems to which it is applicable, the method of separation of variables merits 
its important place in the theory and application of partial differential equations. 
However, this method does have certain limitations that should not be forgotten. In 
the first place, the problem must be linear so that the principle of superposition can 
be invoked to construct additional solutions by forming linear combinations of the 
fundamental solutions of an appropriate homogeneous problem. 

As a practical matter, we must also be able to solve the ordinary differential equa- 
tions, obtained after separating the variables, in a reasonably convenient manner. 
In some problems to which the method of separation of variables can be applied in 
principle, it is of very limited practical value due to a lack of information about the 
solutions of the ordinary differential equations that appear. 

Furthermore, the geometry of the region involved in the problem is subject to 
rather severe restrictions. On the one hand, a coordinate system must be employed 
in which the variables can be separated, and the partial differential equation replaced 
by a set of ordinary differential equations. For Laplace’s equation there are about 
a dozen such coordinate systems; only rectangular, circular cylindrical, and spherical 
coordinates are likely to be familiar to most readers of this book. On the other 
hand, the boundary of the region of interest must consist of coordinate curves or 
surfaces — that is, curves or surfaces on which one variable remains constant. Thus, 
at an elementary level, one is limited to regions bounded by straight lines or circular 
arcs in two dimensions, or by planes, circular cylinders, circular cones, or spheres in 
three dimensions. 

In three-dimensional problems the separation of variables in Laplace’s operator 
u xx + Uyy + u zz leads to the equation X" + XX = 0 in rectangular coordinates, to 
Bessel’s equation in cylindrical coordinates, and to Legendre’s equation in spherical 
coordinates. It is this fact that is largely responsible for the intensive study that 
has been made of these equations and the functions defined by them. It is also 
noteworthy that two of the three most important situations lead to singular, rather 
than regular, Sturm-Liouville problems. Thus singular problems are by no means 
exceptional and may be of even greater interest than regular ones. The remainder 
of this section is devoted to an example involving an expansion of a given function 
as a series of Bessel functions. 

The Vibrations of a Circular Elastic Membrane. In Section 10.7 [Eq. (7)] we noted that the 
transverse vibrations of a thin elastic membrane are governed by the two-dimensional 
wave equation 



(1) 


To study the motion of a circular membrane, it is convenient to write Eq. (1) in polar 
coordinates: 



( 2 ) 


We will assume that the membrane has unit radius, that it is fixed securely around its 
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circumference, and that initially it occupies a displaced position independent of the 
angular variable 0 , from which it is released at time t = 0. Because of the circular 
symmetry of the initial and boundary conditions, it is natural to assume also that u is 


independent of 0; that is, u is a function of r and t only. In this event the differential 
equation (2) becomes 

2 / 1 \ 
a 1 u rr H — u r J = u tt , 

0 < r < 1, t > 0. 

(3) 

The boundary condition at r = 1 is 



s 

*— t- 

II 

o 

t > 0, 

(4) 

and the initial conditions are 



u(r, 0) =/(r), 

0 < r < 1, 

(5) 

u,(r.0) = 0, 

0 < r < 1, 

(6) 


where f(r) describes the initial configuration of the membrane. For consistency we 
also require that/(l) = 0. Finally, we state explicitly the requirement that u(r, t) be 
bounded for 0 < r < 1. 

Assuming that u(r, t ) = R(r)T (t), and substituting for u(r, t) in Eq. (3), we obtain 
R" + (l/r)R' _ 1 T" _ 2 

r (7) 


We have anticipated that the separation constant must be negative by writing it as —X 2 
with X > 0. 11 Then Eq. (7) yields the following two ordinary differential equations: 

r 2 R" + rR' + X 2 r 2 R = 0, (8) 

T" + X 2 a 2 T = 0. (9) 


Thus, from Eq. (9), 


T ( t ) = k\ sin Xat + k 2 cos Xat . 


(10) 


Introducing the new independent variable f = Xr into Eq. (8), we obtain 


7 d 2 R 

dp" 



+ $ 2 R = 0 , 


which is Bessel’s equation of order zero. Thus 


R — d/otf) + C2Yq(^), 


( 11 ) 


( 12 ) 


where Jq and Yo are Bessel functions of the first and second kinds, respectively, of 
order zero (see Section 11.4). In terms of r we have 

R = ciJ 0 (Xr) + c 2 Y 0 (Xr). (13) 


n By denoting the separation constant by —A 2 , rather than simply by —A, we avoid the appearance of 
numerous radical signs in the following discussion. 
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The boundedness condition on u(r,t) requires that R remain bounded as r — »■ 0. Since 
Yo(Ar) — ► — oo as r —>■ 0, we must choose C 2 = 0. The boundary condition (4) then 
requires that 

JoW = 0. (14) 

Consequently, the allowable values of the separation constant are obtained from the 
roots of the transcendental equation (14). Recall from Section 11.4 that Jq(X) has 
an infinite set of discrete positive zeros, which we denote by Xi, X 2 , X 3 , . . . , X n , . . . , 
ordered in increasing magnitude. Further, the functions J 0 (X n r) are the eigenfunc- 
tions of a singular Sturm-Liouville problem and can be used as the basis of a series 
expansion for the given function /. The fundamental solutions of this problem, sat- 
isfying the partial differential equation (3), the boundary condition (4), and bound- 
edness condition, are 

u n {r, t) = Jo(X n r) sin X n at, n — 1,2,..., (15) 

v n (r,t) = Jo(X„r) cos X n at, n = 1,2 (16) 

Next we assume that u(r, t) can be expressed as an infinite linear combination of the 
fundamental solutions (15), (16): 

OO 

u(r,t ) = ^2 [ k nU n (r,t ) + c n v„(r , ?)] 
n = 1 


= ^2 [k„Jo(X n r) sin X n at + c n Jo(X n r) cos X n at], 

n = 1 

The initial conditions require that 

OO 

u(r. 0) = J2c„MX n r) =f{r ) 

n = 1 

and 

OO 

u t (r, 0) = Yx n akJ 0 {X n r) = 0. 

n = 1 

From Eq. (26) of Section 11.4 we obtain 


k n — 0, c n — 


f rf(r)J( } (X n r) dr 

n=y ; n — 1,2, . 


L 


r[J 0 (X n r)] 2 dr 


(17) 


(18) 


(19) 


( 20 ) 


Thus the solution of the partial differential equation (3) satisfying the boundary 
condition (4) and the initial conditions (5) and (6) is given by 


u(r,t) = ^2 c "Jo(X n r) cosX n 


at 


n = 1 


with the coefficients c„ defined by Eq. (20). 


( 21 ) 
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PROBLEMS 1. Consider Laplace’s equation u xx + u yy = 0 in the parallelogram whose vertices are (0,0), 
(2,0), (3,2), and (1,2). Suppose that on the side y = 2 the boundary condition is 
u(x, 2) = f(x) for 1 < x < 3, and that on the other three sides u = 0 (see Figure 11.5.1). 



B (2, 0) 

(a) 

FIGURE 11.5.1 The region in Problem 1. 



(a) Show that there are no nontrivial solutions of the partial differential equation of the 
form u(x,y ) = X(x)Y(y) that also satisfy the homogeneous boundary conditions. 

(b) Let § = x — ^y,ri = y. Show that the given parallelogram in the xy-plane transforms 

into the square 0 < f < 2, 0 < < 2 in the tjrj- plane. Show that the differential equation 

transforms into 

T Urfi] — 11- 

HOW are the boundary conditions transformed? 

(c) Show that in the £ ;?-plane the differential equation possesses no solution of the form 

«(?,»?) = U{£)V(tj). 

Thus in the xy-plane the shape of the boundary precludes a solution by the method of the 
separation of variables, while in the f ?j-plane the region is acceptable but the variables in 
the differential equation can no longer be separated. 

2. Find the displacement u(r,t) in a vibrating circular elastic membrane of radius 1 that 
satisfies the boundary condition 


u(l,t) = 0, t > 0, 


and the initial conditions 


u(r,0) = 0, Ut(r,0) = g(r), 0 < r < 1, 


where g(l) = 0. 

Hint: The differential equation to be satisfied is Eq. (3) of this section. 


714 


Chapter 11. Boundary Value Problems 


3. Find the displacement u(r, t) in a vibrating circular elastic membrane of radius 1 that 
satisfies the boundary condition 


u(l,t) =0, t > 0, 


and the initial conditions 

u(r,0) =/(>), u t {r , 0) = g(r), 0 < r < 1, 

where /(l) = g(l) = 0. 

4. The wave equation in polar coordinates is 

u n + (1 /r)u r + (l/r 2 )u ee = a~ 2 u tt . 

Show that if u(r,9,t ) = R(r)@(9)T(t), then R, 0, and T satisfy the ordinary differential 
equations 


r 2 R" + rR' + (X 2 r 2 - n 2 )R = 0, 
0 " + « 2 0 = 0 , 
T" + X 2 a 2 T = 0. 


5. In the circular cylindrical coordinates r, 6 , z defined by 

x=rco&6, y = rsin6>, z = z, 

Laplace’s equation is 

u rr + (1 /r)u r + (l/r 2 )uee + u zz = 0. 

(a) Show that if u(r,9,z) = R(r)&(9)Z(z), then R, 0, and Z satisfy the ordinary differen- 
tial equations 

r 2 R" + rR' + ( X 2 r 2 - n 2 )R = 0, 

0" + n 2 0 = 0, 

z" - y}z = o. 

(b) Show that if u(r, 9 , z) is independent of 9, then the first equation in part (a) becomes 

rR" + rR’ + X 2 rR = 0, 


the second is omitted altogether, and the third is unchanged. 

6. Find the steady-state temperature in a semi-infinite rod 0 < z < oo, 0 < r < 1, if the tem- 
perature is independent of 9 and approaches zero as z — >■ oo. Assume that the temperature 
u(r, z) satisfies the boundary conditions 


m(1,z) = 0, z > 0, 
u(r,0)=f(r), 0 < r < 1. 


Hint: Refer to Problem 5. 
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7. The equation 

Vxx T % T k v = 0 

is a generalization of Laplace’s equation and is sometimes called the Helmholtz 12 equation. 

(a) In polar coordinates the Helmholtz equation is 

v rr + (1 /r)v r + (l/r 2 )v e g + k 2 v = 0. 

If u(r, 9) — R(r)Q(9), show that R and © satisfy the ordinary differential equations 
r 2 R" + rR' + ( k 2 r 2 - X 2 )R = 0, ©" + X 2 & = 0. 

(b) Consider the Helmholtz equation in the disk r < c. Find the solution that remains 
bounded at all points in the disk, that is periodic in 8 with period 2n , and that satisfies the 
boundary condition v(c,8) = f(9), where / is a given function on 0 < 6 < 2n. 

Hint: The equation for R is a Bessel equation. See Problem 3 of Section 11.4. 

8. Consider the flow of heat in an infinitely long cylinder of radius 1: 0 < /- < 1, 0 < 9 < 2n, 
— oo < z < oo. Let the surface of the cylinder be held at temperature zero, and let the initial 
temperature distribution be a function of the radial variable r only. Then the temperature 
u is a function of r and t only, and satisfies the heat conduction equation 

a 2 [u rr + (l/r)w r ] = u t , 0 < r < 1, t > 0, 

and the following initial and boundary conditions: 

u(r,0)=f(r), 0 < r < 1, 

w(l,f) = 0, t > 0. 


Show that 

OO 

u(r.t) = 'Y^c n Jo{Kr)e~ a2x " t , 

n = 1 

where J 0 (K ) = 0. Find a formula for c n . 

9. In the spherical coordinates p,9,4>(p > 0,0 <9< 2n,0 < <f> < n) defined by the equations 
x = p cos 9 sin 4>, y = psin$sin/, z = pcos/ , 

Laplace’s equation is 

P~ u pp + 2.pUp + (CSC 2 (j>)Ugg + W00 + (cot <f>)u ^ = 0. 

(a) Show that if w(p,0, 4>) — P(p)@(9)^(4>), then P, ©, and 4> satisfy ordinary differential 
equations of the form 

p 2 P" + 2pP' - /x 2 P = 0, 

0" + ^ 2 © = O, 

(sin 2 /)4>" + (sin <p cos /)<!>' + (p 2 sin 2 4> — k 2 )<& = 0. 


12 Hermann von Helmholtz (1821-1894) was trained in medicine and physiology; early in his career he 
made important contributions to physiological optics and acoustics, including the invention of the oph- 
thalmoscope in 185 1 . Later his interests turned to physics, especially fluid mechanics and electrodynamics. 
During his lifetime he held chairs in physiology or physics at several German universities. 
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The first of these equations is of the Euler type, while the third is related to Legendre’s 
equation. 

(b) Show that if u(p,9.cj>) is independent of 9, then the first equation in part (a) is un- 
changed, the second is omitted, and the third becomes 

(sin 2 ij>)<- t>" + (sin(/> cos0)<t>' + (/ 1 1 sin 2 (p)<& = 0. 


(c) Show that if a new independent variable is defined by s = cos <p, then the equation for 
<t> in part (b) becomes 


, d 2 d<t> , 

a-^ 2) — + m 2 * = °, 


-1 <S<1. 


Note that this is Legendre’s equation. 

10. Find the steady-state temperature u(p,<p) in a sphere of unit radius if the temperature is 
independent of 9 and satisfies the boundary condition 


u(l,tp) =f(<p )> 0 <(p<n. 


Hint: Refer to Problem 9 and to Problems 22 through 29 of Section 5.3. Use the fact 
that the only solutions of Legendre’s equation that are finite at both ±1 are the Legendre 
polynomials. 


11.6 Series of Orthogonal Functions: Mean Convergence 


In Section 11.2 we stated that, under certain restrictions, a given function / can be 
expanded in a series of eigenfunctions of a Sturm-Liouville boundary value problem, 
the series converging to \f(x+) +f(x—)]/2 at each point in the open interval. Under 
somewhat more restrictive conditions the series converges to f{x) at each point in 
the closed interval. This type of convergence is referred to as pointwise convergence. 
In this section we describe a different kind of convergence that is especially useful 
for series of orthogonal functions, such as eigenfunctions. 

Suppose that we are given the set of functions (p\,(p 2 , ■ • • , that are continuous 
on the interval 0 < x < 1 and satisfy the orthonormality condition 


f 


r (x) </>/ (x) (f>j (x) dx = 


0, i / j, 

1, i = i, 


( 1 ) 


where r is a nonnegative weight function. Suppose also that we wish to approximate 
a given function/, defined on 0 < x < 1, by a linear combination of /i, . . . , That 
is, if 

n 

S n (x) = y^aj&jx), (2) 

i= 1 


we wish to choose the coefficients a \, . . . , a n so that the function S n will best approx- 
imate / on 0 < x < 1. The first problem that we must face in doing this is to state 
precisely what we mean by “best approximate / on 0 < x < 1.” There are several 
reasonable meanings that can be attached to this phrase. 
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1. We can choose n points xi,...,x n m the interval 0 < x < 1 and require that S„(x) have the 
same value as f(x) at each of these points. The coefficients a\, . . . ,a n are found by solving 
the set of linear algebraic equations 


'Y u a i (t> i (Xj) =f(x j ), j = l,...,n. (3) 

i = 1 

This procedure is known as the method of collocation. It has the advantage that it is very 
easy to write down Eqs. (3); one needs only to evaluate the functions involved at the points 
x\,...,x n . If these points are well chosen, and if n is fairly large, then presumably S„(x) 
will not only be equal to/(x) at the chosen points but will be reasonably close to it at other 
points as well. However, collocation has several deficiencies. One is that if one more 
base function </>„+ 1 is added, then one more point x n+ i is required, and all the coefficients 
must be recomputed. Thus it is inconvenient to improve the accuracy of a collocation 
approximation by including additional terms. Further, the coefficients a, depend on the 
location of the points xi,...,x n , and it is not obvious how best to select these points. 

2. Alternatively, we can consider the difference \f(x) — S n (x) | and try to make it as small 
as possible. The trouble here is that | f{x) — S„(x) | is a function of x as well as of the 
coefficients a±, . . . ,a n , and it is not clear how to calculate a,. The choice of a, that makes 
| f(x) — 5„(x)| small at one point may make it large at another. One way to proceed is to 
consider instead the least upper bound 13 of | f(x) — 5„(^)| for x in 0 < x < 1, and then to 
choose «i, . . . ,a n so as to make this quantity as small as possible. That is, if 

E n (a u ...,a n ) = lub \f(x) — S n (x)\, (4) 


then choose a\,...,a n so as to minimize E n . This approach is intuitively appealing and is 
often used in theoretical calculations. In practice, however, it is usually very hard, if not 
impossible, to write down an explicit formula for E n {a\, . . . ,«„). Further, this procedure 
also shares one of the disadvantages of collocation: upon adding an additional term to 
S n (x ), one must recompute all the preceding coefficients. Thus it is not often useful in 
practical problems. 

3. Another way to proceed is to consider 


In (^1 1 • • • > ) 


f 


r(x)\f(x) 


S„(x)| dx. 


(5) 


If r(x) = 1, then /„ is the area between the graphs of y =f(x) and y = S n (x) (see 
Figure 11.6.1). We can then determine the coefficients a, so as to minimize I„. To avoid 
the complications resulting from calculations with absolute values, it is more convenient 
to consider instead 


Rn(ai, • • ■ i Q-n) 


f 


r(x)[f(x) 


S n (x)] 2 dx 


( 6 ) 


as our measure of the quality of approximation of the linear combination S„(x) to f(x). 
Although R„ is clearly similar in some ways to /„. it lacks the simple geometric interpre- 
tation of the latter. Nevertheless, it is much easier mathematically to deal with R n than 
with The quantity R„ is called the mean square error of the approximation S„ to /. If 
a\,...,a n are chosen so as to minimize R n , then S n is said to approximate / in the mean 
square sense. 


13 The least upper bound (lub) is an upper bound that is smaller than any other upper bound. The lub of 
a bounded function always exists and is equal to the function's maximum if it has one. 
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FIGURE 11.6.1 Approximation of fix) by S n {x). 


To choose a\, . . . , a n so as to minimize R„, we must satisfy the necessary conditions 
dR n /daj = 0 , i=l,...,n. (7) 

Writing out Eq. (7), and noting that 9S„(x;«i, . . . ,a n )/dcii is equal to <pi(x), we obtain 

_ dRn = 2 f r ( x ^f( x) _ S n {x)]4>i(x) dx = 0. (8) 

3 at Jo 

Substituting for S„(x ) from Eq. (2) and making use of the orthogonality relation (1), 
we find that 

aj = / r(x)f(x)(l>i(x) dx, i=\,...,n. (9) 

Jo 

The coefficients defined by Eq. (9) are called the Fourier coefficients off with respect 
to the orthonormal set (pi, fc. ■ ■ ■ , <Pn and the weight function r. Since the conditions 
(7) are only necessary and not sufficient for R n to be a minimum, a separate argument 
is required to show that R„ is actually minimized if the a, are chosen by Eq. (9). This 
argument is outlined in Problem 5. 

Note that the coefficients (9) are the same as those in the eigenfunction series whose 
convergence, under certain conditions, was stated in Theorem 11.2.4. Thus S n (x) is 
the nth partial sum in this series and constitutes the best mean square approximation 
to f(x) that is possible with the functions </>!,...,</>„. We will assume hereafter that 
the coefficients a t in S n (x) are given by Eq. (9). 

Equation (9) is noteworthy in two other important respects. In the first place, it 
gives a formula for each a, separately, rather than a set of linear algebraic equations for 
a \ , . . . , a„ as in the method of collocation, for example. This is due to the orthogonality 
of the base functions </>i, ■ ■ . ,(p n - Further, the formula for «, is independent of n. the 
number of terms in S n (x). The practical significance of this is as follows. Suppose 
that, to obtain a better approximation to /, we desire to use an approximation with 
more terms — say, k terms, where k > n. It is then unnecessary to recompute the first 
n coefficients in Sk ( x ). All that is required is to compute, from Eq. (9), the coefficients 
a n+ 1 , . . . ,ak arising from the additional base functions <p n+ \, . ■ ■ ,(pk- Of course, if/, r, 
and the (p n are complicated functions, it may be necessary to evaluate the integrals 
numerically. 

Now let us suppose that there is an infinite sequence of functions 0i, , 

that are continuous and orthonormal on the interval 0 < x < 1. Suppose further that 
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as n increases without bound, the mean square error R„ approaches zero. In this 
event the infinite series 

OO 

a i<Pii x ) 

i= 1 

is said to converge in the mean square sense (or, more simply, in the mean) to f(x). 
Mean convergence is an essentially different type of convergence from the pointwise 
convergence considered up to now. A series may converge in the mean without 
converging at each point. This is plausible geometrically because the area between 
two curves, which behaves in the same way as the mean square error, may be zero 
even though the functions are not the same at every point. They may differ on 
any finite set of points, for example, without affecting the mean square error. It is 
less obvious, but also true, that even if an infinite series converges at every point, it 
may not converge in the mean. Indeed, the mean square error may even become 
unbounded. An example of this phenomenon is given in Problem 4. 

Now suppose that we wish to know what class of functions, defined on 0 < x < 1, 

can be represented as an infinite series of the orthonormal set fa, i = 1,2, The 

answer depends on what kind of convergence we require. We say that the set 
fa, . . . , fa , . . . is complete with respect to mean square convergence for a set of func- 
tions T, if for each function / in T, the series 

OO 

fix) = Y^ a ifa( x ), ( 10 ) 

i = 1 

with coefficients given by Eq. (9), converges in the mean. There is a similar definition 
for completeness with respect to pointwise convergence. 

Theorems having to do with the convergence of series such as that in Eq. (10) can 
now be restated in terms of the idea of completeness. For example, Theorem 11.2.4 
can be restated as follows: the eigenfunctions of the Sturm-Liouville problem 

-I >(*)/]' + q{x)y = A. r(x)y , 0 < x < 1, (11) 

aiy(0) + ct 2 y'(0) = 0, /iy(l) + fty'CD = 0 (12) 

are complete with respect to ordinary pointwise convergence for the set of functions 
that are continuous on 0 < x < 1 and that have piecewise continuous derivatives 
there. 

If pointwise convergence is replaced by mean convergence, Theorem 11.2.4 can be 
considerably generalized. Before we state such a companion theorem to Theorem 
11.2.4, we first define what is meant by a square integrable function. A function/ is 
said to be square integrable on the interval 0 < x < 1 if both/ and/ 2 are integrable 14 
on that interval. The following theorem is similar to Theorem 11.2.4 except that it 
involves mean convergence. 


14 For the Riemann integral used in elementary calculus the hypotheses that / and f 2 are integrable 
are independent: that is, there are functions such that / is integrable but f 2 is not, and conversely (see 
Problem 6). A generalized integral, known as the Lebesgue integral, has the property (among others) that 
if f 2 is integrable, then / is also necessarily integrable. The term square integrable came into common use 
in connection with this type of integration. 
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Theorem 11.6.1 


EXAMPLE 

1 


The eigenfunctions of the Sturm-Liouville problem (11), (12) are complete with 
respect to mean convergence for the set of functions that are square integrable on 
0 < x < 1. In other words, given any square integrable function/, the series (10), 
whose coefficients are given by Eq. (9), converges to/(x) in the mean square sense. 


It is significant that the class of functions specified in Theorem 11.6.1 is very large 
indeed. The class of square integrable functions contains some functions with many 
discontinuities, including some kinds of infinite discontinuities, as well as some func- 
tions that are not differentiable at any point. All these functions have mean con- 
vergent expansions in the eigenfunctions of the boundary value problem (11), (12). 
However, in many cases these series do not converge pointwise, at least not at every 
point. Thus mean convergence is more naturally associated with series of orthogonal 
functions, such as eigenfunctions, than ordinary pointwise convergence. 

The theory of Fourier series discussed in Chapter 10 is just a special case of the 
general theory of Sturm-Liouville problems. For instance, the functions 

(j) n (x) = \/2sinn7rx (13) 

are the normalized eigenfunctions of the Sturm-Liouville problem 

y" + Xy = 0, y(0) = 0, y(l) = 0. (14) 

Thus, if / is a given square integrable function on 0 < x < 1, then according to Theo- 
rem 11.6.1, the series 


OO OO 

f C'O = ^ ^ 0m C*0 = E b m sin mnx, 

m = 1 m = 1 


(15) 


where 


b m — / f(x)<j> m (x)dx = V 2 / f(x) smimrx dx, (16) 

Jo Jo 

converges in the mean. The series (15) is precisely the Fourier sine series discussed 
in Section 10.4. If / satisfies the further conditions stated in Theorem 11.2.4, then this 
series converges pointwise, as well as in the mean. Similarly, a Fourier cosine series 
is associated with the Sturm-Liouville problem 

f + ky = 0, y'( 0) = 0, y(l) = 0. (17) 


Let/(x) = lforO < x < 1. Expand/(x) using the eigenfunctions (13) and discuss the pointwise 
and mean square convergence of the resulting series. 

The series has the form (15) and its coefficients b m are given by Eq. (16). Thus 

r r 1 . V2 

b m = V2 / sin mnxdx = (1 — cos nut), (18) 

Jo nut 
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and the nth partial sum of the series is 


S n (x) = 2^2 

m = 1 


1 — cos mu . 

sin mnx. 

rmt 


(19) 


The mean square error is then 


R 


n 



S n {x)f dx. 


( 20 ) 


By calculating R n for several values of n and plotting the results, we obtain Figure 11.6.2. This 
figure indicates that R„ steadily decreases as n increases. Of course, Theorem 11.6.1 asserts 
that R„ — > 0 as n — >■ oo. Pointwise, we know that S n {x) — > f(x) = 1 as n — > oo for 0 < x < 1; 
further, S n (x) has the value zero forx = 0 orx = 1 for every n. Although the series converges 
pointwise for each value of x, the least upper bound of the error does not diminish as n 
increases. For each n there are points close to x = 0 and x = 1 where the error is arbitrarily 
close to 1. 



FIGURE 11.6.2 Dependence of the mean square error R n on n in Example 1. 


Theorem 11.6.1 can be extended to cover self-adjoint boundary value problems 
having periodic boundary conditions, such as the problem 

/' + Ay = 0, (21) 

y(-L) - y(L) = 0, y'(-L) - y'(L) = 0 (22) 

considered in Example 4 of Section 11.2. The eigenfunctions of the problem (21), (22) 
are <p n {x ) = cos (nnx/L) for n = 0,1,2,... and ty n (x) = sin(n7rx/L) for n = 1,2, 
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If/ is a given square integrable function on — L < x < L, then its expansion in terms 
of the eigenfunctions </>„ and is of the form 

OO 

s a 0 / nnx . nnx\ __ 

f(x) = — + 2^ (a n cos -j- + b n sin — J , (23) 

n = 1 

where 

1 fL H7I X 

a n = j- J fix) cos -j- dx, n = 0,1,2, , (24) 

b n = ^j f(x)sm^-dx, n — 1,2, ... . (25) 

This expansion is exactly the Fourier series for / discussed in Sections 10.2 and 10.3. 
According to the generalization of Theorem 11.6.1, the series (23) converges in the 
mean for any square integrable function /, even though / may not satisfy the condi- 
tions of Theorem 10.3.1, which ensure pointwise convergence. 


PROBLEMS 1. Extend the results of Example 1 by finding the smallest value of n for which R n < 0.02, 
where R n is given by Eq. (20). 

2. Let fix ) = x for 0 < x < 1 and let 4> m (x) = V2 sin mnx. 

(a) Find the coefficients b m in the expansion of/(x) in terms of 4>i(x),<p2(x), 

(b) Calculate the mean square error R„ for several values of n and plot the results. 

(c) Find the smallest value of n for which R n < 0.01. 


3. Follow the instructions for Problem 2 using /(x) = x(l — x) for 0 < x < 1. 

4. In this problem we show that pointwise convergence of a sequence S n (x) does not imply 
mean convergence, and conversely. 

(a) Let S„{x) = u- v /xe _ '“ 2 /2 , 0 < x < 1. Show that S„(x) — > 0 as n — >■ oo for each x in 
0 < x < 1. Show also that 

Rn= f [0 - Sn{x)f dx = ^(1 - e~ n ) 

Jo 

and hence R n — > oo as n — > oo. Thus pointwise convergence does not imply mean conver- 
gence. 

(b) Let S n (x) = x" for 0 < x < 1 and let/(x) = 0 for 0 < x < 1. Show that 

C l 1 

Rn = / [ fix) - S„(x )] 2 dx = — , 

Jo 2 n + 1 


and hence S n (x) converges to/(x) in the mean. Also show that S„(x) does not converge to 
fix) pointwise throughout 0 < x < 1. Thus mean convergence does not imply pointwise 
convergence. 
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5. Suppose that the functions ,■■■ ,4>n satisfy the orthonormality relation (1) and that a 
given function / is to be approximated by S„ ( x ) = ci <p\ (x) + • • • + c„ <p„ ( x ) , where the coef- 
ficients c, are not necessarily those of Eq. (9). Show that the mean square error R„ given 
by Eq. (6) may be written in the form 

.1 n n 

R„ = / r(x)f-(x) dx - V a 2 + Y' (c,- - at) 2 , 

Jo i= i ,=i 


where the a t are the Fourier coefficients given by Eq. (9). Show that R n is minimized if 
Ci = a,- for each i. 

6. In this problem we show by examples that the (Riemann) integrability of / and f 2 are 
independent. 

fx~ 1/2 , 0 < x < 1, 

0, x = 0. 


(a) Let/(x) = 


Show that J f(x) dx exists as an improper integral, but f f 2 (x) dx does not. 

(b) Let/(x) = 

| — i, x irrational. 

f i 

f(x) dx does not. 


/ ,J 


1, x rational, 
—1, x irrational. 

Show that / / 2 (x) dx exists, but 

Jo Jo 


7. Suppose that it is desired to construct a set of polynomials fo(.x),fi(x),f 2 (x), . . . ,fk(x ), . . . , 
where fk(x) is of degree k , that are orthonormal on the interval 0 < x < 1. That is, the set 
of polynomials must satisfy 


( fj,fk)= [ fj(x)f k (x) dx = S jk . 
Jo 


(a) Find/o(x) by choosing the polynomial of degree zero such that (/o,/o) = 1. 

(b) Find /i(x) by determining the polynomial of degree one such that (fo,fi) = 0 and 

</.-./i) - I- 

(c) Find/ 2 (x). 

(d) The normalization condition (f k ,fk ) = 1 is somewhat awkward to apply. Let go(x), 
gi(x), . . . ,gk(x), . . . be the sequence of polynomials that are orthogonal on 0 < x < 1 and 
that are normalized by the condition g/t(l) = 1. Find go W, gi (x), and g 2 (x) and compare 
them with/o(x),/ 1 (x), and/ 2 (x). 


8. Suppose that it is desired to construct a set of polynomials Pq(x), Pi(x), . . . ,P k {x ), . . . , 
where P k {x) is of degree k, that are orthogonal on the interval — 1 < x < 1; see Problem 
7. Suppose further that P k {x) is normalized by the condition P*(l) = 1. Find P Q (x),P\(x), 
P 2 (x), and P 3 (x). Note that these are the first four Legendre polynomials (see Problem 
24 of Section 5.3). 


9. This problem develops some further results associated with mean convergence. 
R„(a\, a n ),Sn{x), and «,■ be defined by Eqs. (6), (2), and (9), respectively. 


(a) Show that 


,i n 

R n = / r(x)/ 2 (x) dx — Y. a 2 . 

Jo ~~f 


Let 


Hint: Substitute for S n (x) in Eq. (6) and integrate, using the orthogonality relation (1). 
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n r 1 

(b) Show that < I r(x)f 2 (x) dx. This result is known as Bessel’s inequality. 

i=l Jo 

oo 

(c) Show that converges. 

i = 1 

r 1 oo 

(d) Show that lim R„ = / r(x)f 2 (x) dx — £ a ?. 

n ^°° Jo i=l 

OO 

(e) Show that cii4>i( x ) converges to f(x) in the mean if and only if 

;= l 


»1 oo 

/ r(x)f 2 (x) dx = V i 
Jo 


This result is known as Parseval’s equation. 

In Problems 10 through 12 let <f>\ , <p 2 , ■ ■ ■ , 4>n . ■ ■ ■ be the normalized eigenfunctions of the Sturm- 
Liouville problem (11), (12). 

10. Show that if a„ is the nth Fourier coefficient of a square integrable function /, then 
lim a n = 0. 

n— >oo 

Hint: Use Bessel’s inequality, Problem 9(b). 

11. Show that the series 

<Pi( x ) + ‘fcW + ■ ■ ■ + <Pn(x) + ■ ■ ■ 


cannot be the eigenfunction series for any square integrable function. 
Hint: See Problem 10. 


12. Show that the series 


4>i M + 


<t>2 (X) 

V2 


(t>n(x) 

«Jn 


+ ■■■ 


is not the eigenfunction series for any square integrable function. 

Hint: Use Bessel’s inequality, Problem 9(b). 

13. Show that Parseval's equation in Problem 9(e) is obtained formally by squaring the series 
(10) corresponding to /, multiplying by the weight function r, and integrating term by 
term. 
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CHAPTER 1 Section 1.1, page 7 


15. 


2. y diverges from 3/2 as t -*■ oo 
4. y — > — 1/2 as t — > oo 
6. y diverges from — 2 as t -> oo 
8. / = 2 - 3y 
10. y' = 3y - 1 

4 if initial value is positive; y diverges from 


1. y — > 3/2 as t — > oo 

3. y diverges from —3/2 as t oo 

5. y diverges from — 1/2 as t — > oo 

7. y' = 3-y 

9. y = y - 2 

11. y = 0 and y = 4 are equilibrium solutions; y 

0 if initial value is negative. 

12. y = 0 and y = 5 are equilibrium solutions; y diverges from 5 if initial value is greater than 

5; y — 0 if initial value is less than 5. 

13. y = 0 is equilibrium solution; y — 0 if initial value is negative; y diverges from 0 if initial 
value is positive. 

14. y = 0 and y = 2 are equilibrium solutions; y diverges from 0 if initial value is negative; 
y — »• 2 if initial value is between 0 and 2; y diverges from 2 if initial value is greater than 2. 


18. 
g, t in h 


(b) 


19. (h) 


(b) q 
(b) v - 


1250 mg 
sjmgjk 


(j) 16- (c) 17. (g) 

21. (a) dq/dt = 300(10~ 2 — qd0~ 6 7 * * * * * 13 14 ); q in 

(b) q -*■ 10 4 * g; no 

22. dV/dt = —kV 2/3 for some k > 0. 

23. du/dt = — 0.05(u — 70); u in °F, t in min 

24. (a) dq/dt = 500 — 0.4<7; q in mg, t in h 

25. (a) mv' = mg — kv 2 

(c) k- 2/49 

26. y is asymptotic to t — 3 as t — >■ oo 
oo, 0, or — oo depending on the initial value of y 
oo or — oo depending on the initial value of y 
oo or — oo or y oscillates depending on the initial value of y 
—oo or is asymptotic to *j2t — 1 depending on the initial value of y 
0 and then fails to exist after some tf > 0 
oo or — oo depending on the initial value of y 


20. (e) 


27. y — >■ 0 as t — > oo 


28. y 

29. y 

30. y 

31. y 

32. y 

33. y 
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Section 1.2, page 15 


1 . 


2 . 


3. 


4. 

5. 

6 . 

7. 

8 . 
9. 

10 . 

11 . 

12 . 

14. 

15. 

16. 

17. 

18. 


19. 


(a) y = 5 4- (yo - 5)<? ' (b) y = (5/2) + [y 0 - (5/2)]e 2t 

(c) y = 5 4- (y 0 - 5)<r 2 ' 

Equilibrium solution is y = 5in(a)and(c),y = 5/2 in (b); solution approaches equilibrium 
faster in (b) and (c) than in (a). 

(a) y = 5 4- (y 0 - 5)e‘ (b) y = (5/2) + [y 0 - (5/2)]e 2t 

(c) y = 5 + (y 0 - 5)e 2 ' 

Equilibrium solution is y = 5 in (a) and (c), y = 5/2 in (b); solution diverges from equi- 
librium faster in (b) and (c) than in (a). 

(a) y = ce~ at + (b/a) 

(c) (i) Equilibrium is lower and is approached more rapidly, (ii) Equilibrium is higher, 
(iii) Equilibrium remains the same and is approached more rapidly. 

(a) y e = b/a (b) Y' = aY 
(a) y, (t) = ce a ' 
y = ce~ at + (b/a) 

(a) T = 2 In 18 = 5.78 months 
(c) po = 900(1 - e~ 6 ) = 897.8 
(a) r = (ln2)/30 day -1 
(a) 7 = 5 In 50 = 19.56 s 

(a) dv/dt = 9.8, t>(0) = 0 

(c) v = 76.68 m/s 

(b) v = 49tanh(r/5) m/s 
(f) T = 9.48 s 
(a) r = 0.02828 day- 1 

(c) 7 = 24.5 d 
1620 ln(4/3)/ln2 = 672.4 yr 
(a) u = 7 4- (w 0 — T)e~ kt 
6.69 h 

(a) Q(t) = CV( 1 - e~‘ /RC ) 

(c) Q(t) = CV exp [— (t - h)/RC] 

(a) Q = 3(1 - 10- 4 Q), Q(0) = 0 

(b) Q(t) = 10 4 (1 - e- 3 '/ 104 ), t in h; after 1 year Q = 9211 .11 g 


(b) y = ce at + (b/a) 

(b) 7 = 21n[900/(900 — p 0 )l months 

(b) r = (In 2 )/N day^ 1 
(b) 718.34 m 

(b) 7 = V300/4.9 = 7.82 s 
(e) x = 245 In cosh(t/5) m 
(b) Q(t) = 100e-°° 2828 ' 

(b) kx = In 2 

(b) < 2(0 -► CV = Q l 


(c) Q ' = -3Q/10 4 , (2(0) = 9277.77 

(d) Q(t) = 9277.77e- 3t/1 ° 4 , t in h; after 1 year Q = 670.07 g 

(e) 7 = 2.60 yr 

(a) <?' = —<7/300, 9 (0) = 5000 g (b) <7(0 = 5000e- f/3 °° 

(c) no (d) 7 = 3001n(25/6) = 428.13 min = 7.136 h 

(e) r = 2501n(25/6) = 356.78 gal/min 


Section 1.3, page 24 

1. Second order, linear 
3. Fourth order, linear 
5. Second order, nonlinear 
15. r = -2 
17. r= 2,-3 
19. r = -1,-2 
21. Second order, linear 
23. Fourth order, linear 


2. Second order, nonlinear 
4. First order, nonlinear 
6. Third order, linear 
16. r = ± 1 
18. r = 0,1,2 
20. r = 1,4 

22. Second order, nonlinear 
24. Second order, nonlinear 
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CHAPTER 2 


Section 2.1, page 39 


1 . 

2 . 

3. 

4. 

5. 

6 . 

7. 

8 . 
9. 

10 . 

11 . 

12 . 

13. 

15. 

17. 

19. 

21 . 

22 . 

23. 

24. 

25. 

26. 

27. 

29. 

30. 

31. 
39. 
41. 


(c) y = ce~ 3 ' + (1/3) — (1/9) + e~ 2 ‘; y is asymptotic to t/3 — 1/9 as t — > oo 
(c) y = ce 2t + t 3 e 2t / 3; y — >■ oo as t — > oo 
(c) y = ce - ' + 1 + f 2 e - '/ 2; y -> 1 as 1 — >■ oo 

(c) y = (c/2) + (3 cos 2 1) /4f + (3 sin 2r) /2; y is asymptotic to (3 sin 2t) /2 as t — > oo 

(c) y = ce 2 ' — 3e'; y — > oo or — oo as t -> oo 

(c) y = (c — t cost + s'mt)/t 2 ; y — »• 0 as t — »• oo 

(c) y = t 2 e~‘~ + ce - ' ; y -> 0 as r -> oo 

(c) y = (arctan t + c)/(l + t 2 ) 2 ; y — >■ 0 as t — ► oo 

(c) y — ce~' /2 + 32 — 6; y is asymptotic to 3t — 6 as t — > oo 

(c) y = — te~‘ + cl; y -»• oo, 0, or — oo as 1 — >■ oo 

(c) y = ce - ' + sin 2l — 2 cos 2l; y is asymptotic to sin 2l — 2 cos 2l as 1 — >■ oo 

(c) y = ce - ' /2 + 3 1 2 — 121 + 24; y is asymptotic to 3 1 2 — 121 + 24 as 1 ->• oo 

y = 3e' 4- 2(1 - l)e 2 ' 14. y = (2 2 - l)e -2 '/2 


y = (3l 4 - 4l 3 + 6 1 2 + l)/12l 2 16. y = (sinl)/! 2 

y = (1 + 2)e 2 ' 18. y = l -2 [(^r 2 /4) — 1 — 1 cos 1 + sin 1] 

y = —(1 + l)e - '/l 4 , 1 0 20. y = (1 — 1 4- 2 e - ')/l, 1^0 

(b) y = — | cos 1 4- | sin 1 + (a + |)e' /2 ; a 0 = — f 

(c) y oscillates for a = ao 

(b) y = -3e' /3 + (a + 3)e' /2 ; a 0 = -3 

(c) y — >■ —oo for a = ao 

(b) y = [2 + a( 3 tc + 4)e 2,/3 — 2e -ir,/2 ]/ (3^ + 4); ao = —2/(3 jr + 4) 

(c) y — >■ 0 for a = ag 

(b) y = le - ' + (ea — 1 )e - '/l; ao = 1/e 

(c) y — »• 0 as 1 — ► 0 for a = ao 

(b) y = —(cos 1)/1 2 + n 2 a/4t 2 ; a 0 = 4/7T 2 

(c) y — >■ | as 1 — >■ 0 for a = a 0 

(b) y = (e' — e + a sin 1)/ sinl; a 0 = (e — l)/sin 1 

(c) y -> 1 for a = a 0 

(l,y) = (1.364312,0.820082) 28. y 0 = -1.642876 

(b) y = 12 + cos2l + ^ sin2l — ™e -(/4 ; y oscillates about 12 as 1 -> oo 

(c) 1 = 10.065778 
yo = -5/2 

y 0 = —16/3; y — »• — oo as 1 — >■ oo fory 0 = —16/3 

See Problem 2. 40. See Problem 4. 

See Problem 6. 42. See Problem 12. 


Section 2.2, page 47 

1. 3y 2 - 2x 3 = c; y / 0 

3. y -1 + cosx = c if y ^ 0; also y = 0; 

4. 3y + y 2 - x 3 + x = c; y ^ -3/2 

5. 2 tan 2y — 2x — sin 2x = c if cos 2y / 0; 
where 

6. y = sinfln |x| + c] if x ^ 0 and |y| <1; 

7. y 2 — x 2 + 2(6^ — e -x ) = c; y + e^ / 0 
9. (a) y = l/( x 2 - x - 6) 

10. (a) y = — V2x — 2x 2 + 4 

11. (a) y = [2(1 - x)e* - 1] 1/2 

12. (a) r = 2/(1 — 21n0) 

13. (a) y = —[2 ln(l + x 2 ) + 4] 1/2 


2. 3y 2 — 2 In |1 + x 3 | = c; x / — l,y 7^ 0 
everywhere 

also y = ±(2rc + 1 )tt/ 4 for any integer «; every- 
also y = ±1 

8. 3y + y 3 — x 3 = c; everywhere 
(c) — 2 < x < 3 
(c) — 1 < x < 2 

(c) —1.68 < x < 0.77 approximately 
(c) 0 < 9 < 

(c) —00 < x < 00 
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14. 

15. 

16. 

17. 

18. 

19. 

20 . 

21 . 

22 . 

23. 

24. 

25. 

27. 

28. 

29. 

30. 
32. 

34. 

35. 

36. 
38. 


- 1/2 


(c) |*| < 1^5 


(c) x > \Vl5 
(c) — oo < x < oo 

(c) —1.4445 < x < 4.6297 approximately 
(c) |*| < 2.0794 approximately 
(c) |* — 7t/2| < 0.6155 

(c) — 1 < * < 1 


(a) y = |^3 - 2Vl + x 2 
(a) y = -l + lV4* 2 -15 
(a) y = -y/ (x 2 + l)/2 
(a) y = 5/2 — ^/x 3 — e x + 13/4 
(a) y = — | + i V65 - 8e* - 8e- 

(a) y = [n — arcsin (3cos 2 x)]/3 
r - 1 1/3 

(a) y = | (arcsin x) 2 + 1 

y 3 — 3y 2 — * — x 3 + 2 = 0, |x| < 1 

y 3 - 4y - x 3 = -1, |x 3 - 1| < 16/3 a/ 3 or -1.28 < x < 1.60 

y = — l/(x 2 /2 + 2x — 1); x = — 2 
y = — 3/2 4- ^/2x — e* + 13/4; x = ln2 

y = —3/2 + ysin2x + 1/4; x = jr/4 26. y = tan(x 2 + 2x); x = — 1 

(a) y — ► 4 if yo > 0; y = 0 if yo = 0; y — ► — oo if yo < 0 

(b) T = 3.29527 

(a) y — >• 4 as f — >■ oo (b) 7 = 2.84367 

(c) 3.6622 < y 0 < 4.4042 

c ad — fee , . 

x = — y H — In |ay + &| + ^; a ^ 0, ny + b 0 

a a 2 

(e) |y + 2x| 3 |y — 2x| = c 31. (b) arctan (y/x) — In |x[ = c 

(b) x 2 + y 2 — cx 3 = 0 33. (b) |y — x| = c|y + 3x| 5 ; also y = x 

(b) |y + x\ |y + 4x| 2 = c 

(b) 2x/(x + y) + In |x + y| = c; also y = — x 

(b) x/(x + y) + In |x| = c; alsoy=— x 37. (b) |x| 3 |x 2 — 5y 2 | = c 

(b) c|x| 3 = |y 2 — x 2 | 


Section 2.3, page 59 


1 . 

3. 

4. 

5. 

6 . 

7. 

8 . 
9. 

11 . 

12 . 

13. 

14. 

15. 

17. 

18. 


t — 100 In 100 min = 460.5 min 2. Q(t) = 120j/[l — exp(— 1/60)]; 120y 

Q = 50e-°- 2 (l - e-°- 2 ) lb = 7.42 lb 

Q(t) = 200 + t - [100(200) 2 /(200 + r) 2 ] lb, t < 300; c = 121/125 lb/gal; 
lim c = 1 lb/gal 

t— >oo 

(a) Q(t) = Hre“ ,/50 + 25 - =§| cost + ^ sin? 

(c) level = 25; amplitude = 25^2501/5002 = 0.24995 


(c) 130.41s 

(a) (ln2)/ryr (b)9.90yr (c)8.66% 


(a) k(e n — l)/r (b) k = $3930 
k = $3086.64/yr; $1259.92 
(a) t = 135.36 months 
(a) 0.00012097 yr- 1 
(c) 13,305 yr 

P = 201,977.31 - 1977.31e (ln2) ', 0 ; 

(a) r = 2.9632; no 
(c) r = 6.3805 

(b) y c = 0.83 

(a) u(t) = 2000/(1 + 0.048f) 1/3 

(a) u(t) = ce _fa + + kTi(k coscot 

(b) R = 9.11°F; r = 3.51 h 

(c) R = kT 1 /s/k 2 + co 2 -, t = (!/«) 


(c) 9.77% 

10. (a) $89,034.79 (b) $102,965.21 

(b) $152,698.56 

(b) go exp(-0.00012097f), f in yr 

t <tf = 6.6745 (weeks) 

(b) r = 101n2 = 6.9315 

16. t = In ^ / In if min = 6.07 min 

(c) r = 750.77 s 
+ co sin cot) / (k 2 + u> 2 ) 

arctan (co/k) 
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19. 


20 . 

22 . 

23. 

24. 


25. 

26. 

27. 

28. 

29. 

30. 


31. 


32. 


(a) c = k + ( P/r ) + [co — k — (P/r)]e rt/v ; lim c = k + ( P/r ) 

t—>oo 

(b) T = (Eln 2 )/r; T = (V\n 10 )/r 

(c) Superior, T = 431 yr; Michigan, E = 71.4 yr; Erie, E = 6.05 yr; Ontario, 7 = 17.6 yr 

(a) 50.408 m (b) 5.248 s 21. (a) 45.783 m (b) 5.129 s 

(a) 48.562 m (b) 5.194 s 

(a) 176.7 ft/s (b) 1074.5 ft (c) 15 ft/s (d) 256.6 s 
(a) dv/dx — — fiv (b) /r = (66/25) In 10 mi -1 = 6.0788 mi -1 
(c) r = 900/(11 In 10) s = 35.533 s 


, \ m2 s i 

(a) x m = - — In 


1 + 


kv o 


+ 


mv o 


m , 


i + ^° 
mg, 

(b) v = v 0 - gf; yes 


?ng ) k 

(a) u = -( mg/k ) + [ii 0 + (mg/k)\ exp (-kt/m) 

(c) i) = 0 for / > 0 

(a) v L = 2a 2 g(p - p')/9p (b) e = 4 jt a 3 g(p - p')/3E 

(a) 11.58 m /s (b) 13 .45 m (c) k > 0.2394 kg/s 

(a) v = R^2g/(R + x) (b) 50.6 h 

(b) x = ut cos A, y — — g? 2 /2 + ut sin A + h 

(d) — 16L 2 /(m 2 cos 2 A) + L tan A + 3 > PI 

(e) 0.63 rad < A < 0.96 rad (f) u = 106.89 ft/s, A = 0.7954 rad 

(a) v = (u cosA)e ~ rt , w = — g/r + (u sin/1 + g/r)e~ rt 

(b) x = ucos/t(l — e~ n )/r, y = —gt/r + (u sin/1 + g/r)(l — e~ n )/r + h 
(d) u = 145.3 ft/s, A = 0.644 rad 

(d) k = 2.193 


Section 2.4, page 75 


1. 

3. 

5. 

7. 

9. 

10 . 

12 . 

14. 

15. 


16. 

17. 

18. 
20 . 
21 . 

22 . 


26. 

28. 

30. 

31. 


0 < t < 3 

7t/ 2 < ? < 3?r/2 
—2<t<2 

2? + 5y > 0 or 2? + 5y < 0 

1 — t 2 + y 2 > 0 or 1 — t 2 + y 2 < 0, 
Everywhere 


2. 0 < t < 4 
4. —oo < t < —2 
6. 1 < t < n 
8. t 2 +y 2 < 1 
f/0, y^0 

11. y # 0, y / 3 


t ^ nn for n = 0, ±1, ±2, . . . ; y ^ — 1 13. y = i^yg — 4? 2 if yo 7^ 0; 

y = [(1/yo) - 1 2 ] _1 if yo / 0; y = 0 if y 0 = 0; 

interval is |?| < 1/^/yo if yo > 0; —00 < t < 00 if yo < 0 


\t\ < |yol/2 


y = y 0 /^2ty 2 + 1 if y 0 / 0; y = 0 if y 0 = 0; 
interval is — \/2y\ < t < 00 if y 0 ^ 0; —00 < ? < 00 if y 0 = 0 
y = i^flnfl + f 3 )+y§; -[1 - exp(-3yg/2)] 1 '' 3 <t < 0 o 

y — >■ 3 if yo > 0; y = 0 if y 0 = 0; y — ► — 00 if y 0 < 0 

y -/ —00 if yo < 0; y — / 0 if yo > 0 19. y — >■ 0 if yo < 9; y -4- 00 if yo > 9 

y -> —00 if yo < y c ^ —0.019; otherwise y is asymptotic to ^/t — 1 
(a) No (b) Yes; set t 0 = 1/2 in Eq. (19) in text, 

(c) |y| < (4/3) 3/2 = 1.5396 

(a) y^?) is a solution for t > 2; y 2 (?) is a solution for all f 

(b) f y is not continuous at (2, —1) 

1 1 

(a) yi (?) = — — ; y 2 (t) = —— / p.(s)g(s)ds 

M(0 Jf 0 

y = ±[5?/(2 + 5c? 5 )] 1/2 29. y = r/(k + cre~ n ) 

y = ±[e/(cr + cee _2f ')] 1/2 

, where /x(?) = exp(2r sin? + 277) 


y = ± //(?) 


>/w 


p(s) dy + c 
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32. y = \( 1 — e 2t ) for 0 < t < 1; y = |(e 2 — l)e 21 for f > 1 

33. y = e -2 ' for 0 < r < 1; y = e _(t+1) for t > 1 


Section 2.5, page 88 

1. y = 0 is unstable 

2. y = —a/b is asymptotically stable, y = 0 is unstable 

3. y = 1 is asymptotically stable, y — 0 and y — 2 are unstable 

4. y = 0 is unstable 5. y = 0 is asymptotically stable 

6. y = 0 is asymptotically stable 7. (c) y = [yo + (1 — yo)kt]/[l + (1 — yo)/ct] 

8. y = 1 is semistable 

9. y = — 1 is asymptotically stable, y = 0 is semistable, y = 1 is unstable 

10. y = — 1 andy = 1 are asymptotically stable, y = 0 is unstable 

11. y = 0 is asymptotically stable, y = b 2 /a 2 is unstable 

12. y = 2 is asymptotically stable, y = 0 is semistable, y = — 2 is unstable 

13. y = 0 and y = 1 are semistable 

15. (a) r = (1 /r) In 4; 55.452 yr 

(b) T = (1/r) ln[j0 (1 - a)/(l - /?)<*]; 175.78 yr 

16. (a) y = 0 is unstable, y = K is asymptotically stable 

(b) Concave up for 0 < y < K/e, concave down for K/e < y < K 

17. (a) y = K exp{[ln(y 0 /iC>]e-"} (b) y(2) = 0.7153i<: = 57.6 x 10 6 kg 

(c) r = 2.215 yr 

18. (b) (h/a)yjk/an\ yes (c) k/a < na 2 

19. (b k 2 /2g(aaj 1 

20. (c) Y = Ey 2 = KE[ 1 - (E/r)] (d) Y m = Kr/4 for E = r/2 

21. (a) y,, 2 = Tqi T ^1 - (4/z/r7C)]/2 

22. (a) y = 0 is unstable, y = 1 is asymptotically stable 
(b) y = y ally a + (1 - yo)e~ a ‘] 

23. (a )y=y 0 e~ fl ‘ (b) x = x 0 exp[-ay 0 (l - e _/i, )/(S] (c) x 0 exp(-ay 0 //S) 

24. (b) z = l/[v + (1 - uje^'l (c) 0.0927 

25. (a,b) a = 0: y = 0 is semistable. 

a > 0: y = ,/a is asymptotically stable and y = — is unstable. 

26. (a) a < 0: y = 0 is asymptotically stable. 

a > 0: y = 0 is unstable; y = J~a and y = —Ja are asymptotically stable. 

27. (a) a < 0: y = 0 is asymptotically stable and y = a is unstable. 
a = 0: y = 0 is semistable. 

a > 0: y = 0 is unstable and y = u is asymptotically stable. 

- 11 

28. (a) lim x(t) = min(p,<7); x(t) = 

V 7 f-»co ^ 7 q e a(q-p)t_p 

9 

n oit 

(b) lim x(t) = p\ x(t) = 

vat + 1 


Section 2.6, page 99 


1. x 2 + 3x + y 2 — 2y = c 
3. x 3 - x 2 y + 2x + 2y 3 + 3y = c 
5. ax 2 + 2 bxy + cy 2 = k 
7. e* siny + 2y cosx = c; alsoy = 0 
9. e xv cos 2x + x 2 — 3 y = c 
11. Not exact 

13. y = [x + V28 - 3.r 2 ]/2, |x| < ^28/3 

14. y = [x — (24x 3 + x 2 - 8x - 16) 1/2 ]/4, 


2. Not exact 
4. x 2 y 2 + 2xy = c 
6. Not exact 
8. Not exact 
10. y lnx + 3x 2 — 2y = c 
12. x 2 + y 2 = c 


x > 0.9846 
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15. 

b = : 

5; x 2 y 2 

+ 2x 3 y = c 

16. 

19. 

x 2 + 

2 In |y| - 

■ y~ 2 = c; also y = 0 

20. 

21. 

xy 2 - 

- (y 2 - 2y + 2)e y = c 

22. 

24. 


= exp J 

R(t) dt , where t = xy 

25. 

26. 

f*W 

= e~ x ; 

y = ce* + 1 + e 2 * 

27. 

28. 

n(y) 

= e 2 y /r. 

xe 2v — In |y| = c; also y 

= 0 

29. 

n(y) 

= siny; 

e x siny + y 2 = c 

30. 

31. 

fi(x,y) = xy; 

x 3 y + 3x 2 + y 3 = c 



b = 1; e 2xy + x 2 = c 
e x siny + 2y cosx = c 
x 2 e* siny = c 

/r(x) = e 3 *; (3x 2 y + y 3 )e 3x = c 

/i(y) = y; xy + y cosy — siny = c 

/x(y) = y 2 ; x 4 + 3xy + y 4 = c 


Section 2.7, page 109 


1 . 

2 . 

3. 

4. 


5. 

7. 

8 . 

9. 

11 . 


12 . 


13. 


14. 


15. 


(a) 1.2, 1.39, 1.571, 1.7439 

(c) 1.19631, 1.38335, 1.56200, 1.73308 

(a) 1.1, 1.22, 1.364, 1.5368 

(c) 1.10775, 1.23873, 1.39793, 1.59144 

(a) 1.25, 1.54, 1.878, 2.2736 

(c) 1.26551, 1.57746, 1.94586, 2.38287 

(a) 0.3, 0.538501, 0.724821, 0.866458 

(b) 0.284813, 0.513339, 0.693451, 0.831571 

(c) 0.277920, 0.501813, 0.678949, 0.815302 

(d) 0.271428, 0.490897, 0.665142, 0.799729 
Converge for y > 0; undefined for y < 0 
Converge 


(b) 1.1975, 1.38549, 1.56491, 1.73658 
(d) 1.19516, 1.38127, 1.55918, 1.72968 
(b) 1.105, 1.23205, 1.38578, 1.57179 
(d) 1.1107, 1.24591, 1.41106, 1.61277 
(b) 1.26, 1.5641, 1.92156, 2.34359 
(d) 1.2714, 1.59182, 1.97212, 2.42554 


6. Converge for y > 0; diverge for y < 0 


Converge for |y(0)| < 2.37 (approximately); diverge otherwise 


Diverge 10. Diverge 

(a) 2.30800, 2.49006, 2.60023, 2.66773, 2.70939, 2.73521 

(b) 2.30167, 2.48263, 2.59352, 2.66227, 2.70519, 2.73209 

(c) 2.29864, 2.47903, 2.59024, 2.65958, 2.70310, 2.73053 

(d) 2.29686, 2.47691, 2.58830, 2.65798, 2.70185, 2.72959 

(a) 1.70308, 3.06605, 2.44030, 1.77204, 1.37348, 1.11925 

(b) 1.79548, 3.06051, 2.43292, 1.77807, 1.37795, 1.12191 

(c) 1.84579, 3.05769, 2.42905, 1.78074, 1.38017, 1.12328 

(d) 1.87734, 3.05607, 2.42672, 1.78224, 1.38150, 1.12411 

(a) -1.48849, -0.412339, 1.04687, 1.43176, 1.54438, 1.51971 

(b) -1.46909, -0.287883, 1.05351, 1.42003, 1.53000, 1.50549 

(c) -1.45865, -0.217545, 1.05715, 1.41486, 1.52334, 1.49879 

(d) -1.45212, -0.173376, 1.05941, 1.41197, 1.51949, 1.49490 

(a) 0.950517, 0.687550, 0.369188, 0.145990, 0.0421429, 0.00872877 

(b) 0.938298, 0.672145, 0.362640, 0.147659, 0.0454100, 0.0104931 

(c) 0.932253, 0.664778, 0.359567, 0.148416, 0.0469514, 0.0113722 

(d) 0.928649, 0.660463, 0.357783, 0.148848, 0.0478492, 0.0118978 

(a) -0.166134, -0.410872, -0.804660, 4.15867 

(b) -0.174652, -0.434238, -0.889140, -3.09810 


16. A reasonable estimate fory at t = 0.8 is between 5.5 and 6. No reliable estimate is possible 
at t = 1 from the specified data. 


17. A reasonable estimate fory at t = 2.5 is between 18 and 19. No reliable estimate is possible 
at t = 3 from the specified data. 

18. (b) 2.37 < a 0 < 2.38 


19. (b) 0.67 < ot 0 < 0.68 
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Section 2.8, page 118 

1. dw/ds = (s + l) 2 + (w + 2) 2 , w(0) = 0 2. dw/ds = 1 — (w + 3) 3 , iu(0) = 0 

n ^ 2 k t k 

3. (a) M) = ^— (c) lim^oo <t> n (t) = e 2 ' — 1 


4. (a) 0„(f) = ^ 


(-l)*f* 

A-! 


(c) lim„^o o 0„(t) = e ‘ -1 


5. (a) <p n (t) = (~l) k+l t k+1 /(k + l)\2 k 1 (c) linin^oc 0„(t) — 4e (/2 + 2r - 4 


6. (a) (fin(t) = t - 


ftl+1 


(n + 1)! 


(c) limn^oo <t>n(t) = t 


n t 2k-l n f>k-l 

7 - (a) M) E 1 ■ 3 ■ 5 ■ ■ ■ (2fc — 1) 8 - (a) Mt) = - £ 2 ■ 5 ■ 8 • ■ ■ (3fc — 1) 

*=1 V y A:=l V ' 

f 3 t 3 t 1 t 3 t 1 2t n t 1S 

9. (a) 0i(r) = 0 2 (t) = - + ; 0 3 (t) = - + + + = 

3 3 7-9 3 7-9 3 • 7 • 9 ■ 11 (7 • 9) 2 • 15 

f 4 f 4 3 t 1 3 r 10 t 13 

10. (a) *(0 = *; 0 2 (r) =r- 0,(0 = r + — 

t 2 t A t 6 

11. (a) 0i (t) = t, 02 (0 = r— gj - * - 

t 2 f 3 f 4 7f 5 14 1 6 

03 ( o - r -^ + ^ + ^- 5r + ^r + °( f )’ 

t 2 t 3 It 5 31f 6 _ , 

04 (?) — t — — -|- — — -|- -\- (D(t ) 

2! 3! 5! 6! 

t 3 

12. (a) 0i(O = -t-t 2 - 

t 2 t 3 r 4 t 5 t 6 

Mt ) -- f -2 + 6 + 4 ~ 5“24 + 0(n ’ 




r 


3/ 5 4r 


0 3 (O--t-- + --^ + — + 0(/ 7 ), 


M )--t-- + --— + - + 0(P) 


Section 2.9, page 130 


1. 

2 . 

3. 

4. 

5. 

6 . 
7. 
9. 

10 . 

11 . 

12 . 


y n = (— l)"(0.9)"y 0 ; y„ -> 0 as n oo 

y« = yo/(« + 1); y« o as n oo 

y n = yoV(ra + 2)(n + l)/2; y„ — >■ oo as n — > oo 

I Vo, if n — 4k or n = 4 k — 1; 

— y 0 , if n = 4k — 2 or n = 4A' — 3; 
y K has no limit as n —>■ oo 
y„ = (0.5)" (y 0 — 12) + 12; y„ — > 12 as n — > oo 
y n = (— l)”(0.5)"(yo - 4) + 4; y„ -► 4 as n -► oo 
7.25% 8. $2283.63 

$258.14 

(a) $804.62 (b) $877.57 (c) $1028.61 

30 years: $804.62/month; $289,663.20 total 20 years: $899.73/month; 
$215,935.20 total 

$103,624.62 13. 9.73% 
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16. (b) u n —*■ — oo as ti — ^ oo 

19. (a) 4.7263 (b) 1.223% (c) 3.5643 (e) 3.5699 


Miscellaneous Problems, page 132 


2. 2 y + cosy — x — sin x — c 
4. y = -3 + ce*-* 2 
6. y = x 1 (1 -e'-*) 

8. y = (4 4- cos 2 — cosx)/x 2 
10. x + In |x| + x _1 + y — 2 In |y| = c; alsoy = 0 
12. y = ce~ x + e~ x ln(l + e*) 

14. x 2 + 2xy + 2y 2 = 34 
16. e~ x cos y + e 2y sinx = c 


18. y = e~ 2x f 
Jo 


e ° 2 ds + 3e 2x 


20. e* + e~ y = c 

22. y 3 + 3y — x 3 + 3x = 2 

24. sinysin 2 x = c 

26. e~ y/x + In |x| = c 

28. x 3 + x 2 y = c 

30. (y 2 /x 3 ) + (y/x 2 ) = c 


1. y = (c/x 2 ) + (x 3 /5) 

3. x 2 + xy - 3y - y 3 = 0 
5. x 2 y + xy 2 + x — c 
7. x 4 + x — y 2 — y 3 = c 
9. x 2 y + x + y 2 = c 
11. (x 3 /3) + xy + = c 

13. y = tan(x + x 2 + c) 

15. y = c/cosh 2 (x/2) 

17. y = ce 3x - e 2 * 

19. 2xy + xy 3 - x 3 = c 
21. 2xy 2 + 3x 2 y - 4x + y 3 = c 
e 2 ' e-‘ 

23 - y= 3i +c T 

25. (x 2 /y) + arctan(y/x) = c 
27. (x 2 + y 2 + l)e _v2 =c_ 

29. arctan(y/x) — In y/x 2 + y 2 = c 

31. x 3 y 2 + xy 3 = -4 

34. (a) y = t + (c - 1) _1 
(c) y = sin f + (c cos 1 — i sin f) -1 

35. (a) v' — \x(t) + b]v = b 

(b) v = ^b f i±(t) dt + cj//x(0, M(0 = exp[— (at 2 /!) — bt] 

36. y = cif -1 + ci + In t 37. y = ci In t + cj + f 

38. y = (1/A:) In | (A: — t)/(k + t)\ + C 2 if Ci = A 2 > 0; y = (2/A) arctan(l/A) + C 2 if 

Ci = —A 2 <0; y = — 2? -1 + c 2 if Ci = 0; also y = c 

39. y = ±|(? — 2ci)^/r + ci + c 2 ; also y = c Hint: /c( v) = c -3 is an integrating factor. 

40. y = cie _t + c 2 — fe - ' 

41. c 2 y = cit — In |1 4- cif| + c 2 if ci 0; y = ft 2 + c 2 if ci = 0; also y = c 

42. y 2 = cit + c 2 43. y = Ci sin(t + c 2 ) = Ai sin t + k 2 cos t 

44. |y 3 — 2ciy + c 2 = 2t; also y = c 45. t + c 2 = ±§(y — 2ci)(y + Ci) 1/2 

46. y In |y | — y + ciy + t = c 2 ; also y = c 47. e y = (t + c 2 ) 2 + ci 

48. y = |(f + 1) 3/2 - | 49. y = 2(1 - f)“ 2 

50. y = 3 Inf — | ln(t 2 + 1) — 5 arctant + 2 + | In 2 + 

51 ■y=i< 2 + ! 


i r x 

32. - = -x / 

y Ji 


e x 

-r-rfsl — 
s 2 2 


(b) y = r 1 + 2t(c - t 2 )- 1 


CHAPTER 3 Section 3.1, page 144 

1. y = Cie' + c 2 e -3 ' 2. y = cie - ' + c 2 e~ 2 ' 

3. y = cie ,/2 + c 2 e _,/3 4. y = cic f/2 + c 2 e' 

5. y = ci + c 2 c -5 ' 6. y = cie 3 ' /2 + c 2 e _3 ' /2 

7. y = ci exp[(9 + 3\/5)t/2] + c 2 exp[(9 — 3v / 5)r/2] 

8. y = Ci exp[(l + V3)t] + c 2 exp[(l — V3)t] 9. y = e'; y — >■ cx) as t — ► cx) 

10. y = 3 e - ' — ^e _3t ; y -»• 0 as t — >■ oo 11. y = 12e ,/3 — 8e f/2 ; y ->• — oo as t — > oo 

12. y = — 1 — e -3 '; y — > — 1 as f — > oo 

13. y = ^(13 + 5^/l3) exp[(— 5 + \/l3)f/2] + ^(13 — 5 a/ 13) exp[(— 5 — \/T3)r/2]; y->0as 
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14. y = (2/V33) exp[(— 1 + V^3)f/4] - (2/V33) exp[(-l - V33)t/4]; y — ► oo as t — > oo 

15. y = Ae~ 9( ' _1) + jge ,_1 ; y oo as f -* oo 

16. y = -le«+ 2 )/ 2 + |e _(,+2)/2 ; y ->• -oo as f -> oo 

17. y" + y' - 6y = 0 18. 2y" + 5y' + 2y = 0 

19. y = \e‘ + e~‘\ minimum isy = 1 at t — In 2 

20. y = — e‘ + 3e f/2 ; maximum is y = | at f = ln(9/4), y = 0 at f = In 9 

21 . a = -2 22. p = -1 

23. y ->• 0 for ot < 0; y becomes unbounded for a > 1 

24. y -> 0 for a < 1; there is no a for which all nonzero solutions become unbounded 

25. (a) y = 1(1 + 2P)e~ 2 ' + | (4 - 2pV 12 

(b) y = 6.71548 when t = § In 6 = 0.71670 (c) ,8 = 2 

26. (a) y = (6 + P)e~ 2 ‘ - (4 + ^e” 3 ' 

(b) t m = ln[(12 + 3/3)/ (12 + 2/1)], y m = £(6 + /D 3 /(4 + /l) 2 

(c) fi = 6(1 + V3) = 16.3923 (d) t,„ -*■ ln(3/2), y„, — > oo 

27. (a ) y = d/c (b) aY" + bY' + cY = 0 

28. (a) 6 > 0 and 0 < c < b 2 /4a (b) c < 0 (c) b < 0 and 0 < c < b 2 /ia 


Section 3.2, page 155 


1. 

3. 

5. 

7. 

9. 

11 . 

14. 

16. 

18. 

20 . 

21 . 

22 . 

23. 

24. 
26. 
28. 

29. 

31. 

34. 

36. 

40. 

42. 

43. 

44. 

45. 
48. 
51. 


_lJP 

2 

2. 

1 

e- 4 ' 

4. 

x 2 ^ 1 

-e 2 ' 

6. 

0 

0 < t < oo 

8. 

— 00 < t < 1 

0 < t < 4 

10. 

0 < f < oo 

0 < x < 3 

12. 

2 < x < 3?r/2 

The equation is nonlinear. 

15. 

The equation is nonhomogeneous. 

No 

17. 

3te 2 ' + ce 2 ' 

te' + ct 

19. 

5 W(f,g) 


— 4(t cost — sinf) 

y 3 and y 4 are a fundamental set of solutions if and only if aib 2 — a 2 £>i / 0. 

yi(t) = \e- 2 ‘ + §e‘, y 2 (t ) = -i<r 2 ' + |e' 

yi(f) = -i<? -3(f-1) + y 2 (f) = -ie _3((_1) + 

Yes 25. Yes 

Yes 27. Yes 

(b) Yes (c) [y 1 (t),y 3 (t)] and [y 1 (t),y 4 (t)] are fundamental sets of solutions; [y 2 (t),y 3 (t)] 
and [y 4 (f),y 5 (t)] are not 


ct 2 e' 30. c cost 

c/x 32. c/(l— x 2 ) 

2/25 35. 3Ve = 4.946 

p(t) = 0 for all f 

If t 0 is an inflection point, and y = 0 (t) is a solution, then from the differential equation 
p(t 0 )<p'{to ) + q(to)4>(t 0 ) = 0. 


Yes, y=c 1 e~ x2/2 

No 

Yes, y = 


f e^ 12 dt + 

Jxn 


c 2 e 




— - — l r 

MW L A 0 


m(0 


dt -|- c 2 


Pl(x) = exp — 


[~I(l 


cosx . 

H I ax 

x x 


Yes, y = ci* -1 + c 2 x 47. x 2 /x" + 3 xfi' + (1 + x 2 — v 2 )/i = 0 

(1 — x 2 )/jl" — 2 xfx! + a(a + 1 )/x = 0 49. /x" — x/x = 0 

The Legendre and Airy equations are self-adjoint. 
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Section 3.3, page 163 


1 . 

3. 

4. 

5. 

6 . 
7. 
9. 

11 . 

13. 

15. 

17. 

18. 

19 . 

20 . 
21 . 
22 . 

23. 

24. 

25. 

26. 

35. 

37. 

39. 

41. 

44. 

45. 

46. 


-1.1312 + 2.4717/ 


2. e 2 cos 3 - ie 2 sin 3 = -7.3151 - 1.0427/ 


e cos 2 + ie sin 2 = 

-1 

e 2 cos(?r/2) — ie 2 sin(jr/2) = — e 2 / = —7.3891/ 

2cos(ln2) — 2/sin(ln2) = 1.5385 — 1.2779/ 

jr -1 cos(21njr) + in -1 sin(21n7r) = —0.20957 + 0.23959/ 

y = c\e' cos t + C 2 e l sin f 8. y = C\e‘ cos ->/5 1 + c 2 e' sin s/5 1 

y = c\e 1{ + c 2 e “ 4r 10. y — C\e~ l cos t + C 2 e~' sin / 

y = cie -3 ' cos 2 1 + c 2 e -3 ' sin2f 12. y = c\ cos(3//2) + C 2 sin(3f/2) 

y = c\e~‘ cos(//2) + c 2 e~' sin(f/2) 14. y = Cje' /3 + c 2 e _4 ' /3 

y = cie _,/2 cosf + c 2 e _,/2 sinf 16. y = Cje -2 ' cos(3//2) + c 2 e -2 ' sin(3f/2) 

y=^sin2f; steady oscillation 

y = e -2 ' cos f + 2e -2 ' sin f ; decaying oscillation 

y = — e ,_jr/2 sin 2f ; growing oscillation 

y = (1 + 2\/3) cos / — (2 — V3) sin r; steady oscillation 

y = 3e _,/2 cos t + 3 e~‘ /2 sin f; decaying oscillation 

y = \/2 e ~('- ,r / 4 ) cos t + ~J2 e -('- lr /4) sin f; decaying oscillation 

(a) = 2e' /6 cos(V23//6) — (2/\/23)e t/6 sin(v / 23//6) 

(b) / = 10.7598 

(a) u = 2e~ ,/s cos(\/34//5) + (7/V34) e _r/5 sin(V34f/5) 

(b) T = 14.5115 

(a) y = 2e _ 'cos\/5f + [(a + 2)/-\/5]e _ 'sin\/5f (b) a = 1.50878 

(c) / = { 7r — arctan[2V5/(2 + a)]}/-s/5 (d);r/\/5 

(a) y = e~ at cos t + ae^‘" sin t (b) T = 1.8763 
(c) a = i, T = 7.4284; a = §, T = 4.3003; a = 2, T = 1.5116 


• 1 cos(^ Inf) + c 2 f 1 sin(^lnf) 


y = ci cos(ln t) + c 2 sin(ln f ) 
y = cif 

y = cit 2 + c 2 f 3 
y = Cif + c 2 r 3 

Yes, y = ci cosx + c 2 sinx, 

No 

Yes, y = cie _(2/4 cos(-\/3 f 2 /4) + c 2 e _,2/4 sin(-\/3 f 2 /4) 


36. y = C\t 1 + c 2 f 2 


/ 


38. y = cif 6 + c 2 f _1 
40. y = Cif cos(21nf) + c 2 f sin(21nf) 
42. y = Ci/ -3 cos(lnf) + c 2 f -3 sin(lnf) 

e“ <2/2 df 


Section 3.4, page 171 


1 . 

3. 

5. 

7. 

9. 

11 . 

12 . 

13. 

14. 

15. 


16. 

17 . 


y = cie' + c 2 fe' 
y = Cie _,/2 + c 2 e 3t / 2 
y = Cie' cos 3f + c 2 e‘ sin 3 f 
y = Cie _</4 + c 2 e~ 4 ' 
y = cie 2,/5 + c 2 fe 2,/5 

y = 2e 2,/3 — | fe 2,/ 3 , y -> — oo as t — > oo 

y = 2fe 3 ', y — > oo as t — > oo 


2. y = cie _,/3 + c 2 fe _,/3 
4. y = Cie _3,/2 + c 2 fe _3,/2 
6. y = Cie 3 ' + c 2 fe 3 ' 

8. y = Cie -3,/4 + c 2 fe _3 ' /4 
10. y = e“' /2 cos(f/2) + c 2 e“' /2 


y = — e ,/3 cos3f + |e ,/3 sin3f, y -> 0 as f — >• oo 
y = 7e _2(,+1) + 5fe _2((+1> , y -* 0 as f oo 


(a) y = e _3,/2 — \te ~ 3tl2 


(b) t = | 


(c) f 0 = 16/15, y 0 = — §e- 8/5 = -0.33649 

(d) y = e ~ 3,/2 + (b + |)fe _3</2 ; b = — \ 
y = 2e' /2 + (b - 1 )fe'/ 2 ; b = 1 


sin(f/2) 


(a) y = e ~ t/2 + ffc"'/ 2 (b) t M = |, y M = 5c” 4 / 5 = 2.24664 

(c) y = e _</2 + (b + ^)te~' /2 

(d) t M = 4b/(l + 2b) -»• 2 as b -> oo; y M = (1 + 2b) exp[— 2b/(l + 2b)] -»• oo 


as b 


oo 
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18. (a) y = ae -2 ' /3 + (§ a - l)fe -2 ' /3 

23. y 2 (t) = t 3 

25. y 2 (t ) = f -1 Inf 

27. y 2 (x) = cosx 2 

29. y 2 (x) = x 1/4 e" 2 ^ 

32. y = cxe-^' 2 f e Ss2/2 ds + c 2 e -J * 2/2 
Jo 

34. y2(0 = f -1 Inf 

36. y 2 (x) = x 

39. (b) y 0 + (a/b)y' 0 

42. y = dr 1/2 + c 2 f -1/2 Inf 

44. y = Qf -1 + C 2 f _1 Inf 

46. y = cyr 2 cos(3 In f) + C 2 f -2 sin(3 In f) 


(b) a = | 

24. y 2 (f) = r 2 
26. y 2 (f) = fe' 

28. y 2 (x) = x 
30. y 2 (x) = x _1/2 cosx 

33. y 2 (f) = yi(f) 

35. y 2 (f) = cosf 2 
37. y 2 (x) = x -1/2 cosx 
41. y = cA + c 2 f 2 Inf 
43. y = cif + C 2 f 5/2 
45. y = ci f 3/2 + C 2 f 3/2 In t 


/ y, 2 (.v)cxp I - / p(r)dr 
Jto L 7s 0 


ds 


Section 3.5, page 183 

1. y = Cie 3 ' + c 2 e~‘ — e 2t 

2. y = Cxe - ' cos 2 f + c 2 e - ' sin 2f + ^ sin 2f — ] 2 cos 2f 

3. y = Cxe 3 ' + c 2 e - ' + j|fe - ' + |f 2 e - ' 4. y = Cx 4- c 2 e -2 ' + |f — 1 sin2f — | cos2f 

5. y = cx cos 3 f + C 2 sin 3f + ^ (9f 2 — 6f + l)e 3 ' + | 

6. y = cxe - ' + C 2 fe - ' + f 2 e - ' 

7. y = cxe - ' + c 2 e~ t! 2 + t 2 — 6f + 14 — ^ sin f — 2, cos f 

8. y = Cx cos f + c 2 sin f — | f cos 2f — | sin 2 f 

9. !( = cx cos a>of + C 2 sin &>of + (&>o — w 2 ) -1 cos wf 

10. m = cx cos coot + c 2 sin coot + (1 /2a>a)t sin a>ot 

11. y = cie -f/2 cos(Vl5 f/2) + c 2 e~ ,/2 sin(VT5 f/2) + le' — le - ' 

12. y = cie - ' + C 2 e 2 ' + Ife 2 ' + \e~ 2t 13. y = e' — le -2 ' — f — 1 

14. y = jg sin2f — ^ cos2f + if 2 — 1 4- le' 15. y = 4fe' — 3e' + |f 3 e' + 4 

16. y = e 3 ' + |e - ' — |e 2 ' — fe 2 ' 17. y = 2 cos 2 f — | sin2f — If cos2f 

18. y = e - ' cos 2f + le - ' sin 2f + fe - ' sin 2f 

19. (a) Y (f) = f(Af 4 + Af 3 + Ait 1 + A* + A) + f(£ot 2 + £if + B 2 )e~ 3t 
+ D sin 3f + £ cos 3f 

(b) A = 2/15, A = -2/9, A 2 = 8/27, A = -8/27, At = 16/81, £ 0 = -1/9, 

£1 = -1/9, Bi = -2/27, 7) = -1/18, £ = -1/18 

20. (a) Y(t) = Aot + Ai + t(Bot + Bi) sint + t(Dot + Di) cost 
(b) A 0 = 1, A = 0, fi 0 = 0, £1 = 1/4, D 0 = -1/4, D, = 0 

21. (a) Y(f) = e'04 cos 2f + £ sin 2f) + (£V + 7)i)e 2 ' sin f + (£ 0 f + £i)e 2 ' cos f 

(b) A = -1/20, B = -3/20, D 0 = -3/2, D, = -5, £ 0 = 3/2, £j = 1/2 

22. (a) Y(t) = Ae~' + t(B 0 t 2 + Bj^t + B 2 )e~' cost + t(D 0 t 2 + D\t + D 2 )e~‘ sint 
(b) A = 3, £ 0 = -2/3, £x = 0, £ 2 = 1, Do = 0, D x = 1, D 2 = 1 

23. (a) Y(f) = A 0 t 2 + Af + A + t 2 (B 0 t + B\)e 2 ‘ + (D 0 f + Di) sin2f + (£ 0 f + E\) cos2f 
(b) A = 1/2, A = 1, A = 3/4, B 0 = 2/3, £1 = 0, D 0 = 0, Di = -1/16, 

E 0 = 1/8, A = 1/16 

24. (a) Y(f) = f(Af 2 + Af + A) sin2f + f(£ 0 f 2 + £if + £ 2 ) cos2f 

(b) A = 0, A = 13/16, A = 7/4, £ 0 = -1/12, fl, = 0, £ 2 = 13/32 

25. (a) Y(f) = (At 2 + Af + A)e' sin 2f + (£ot 2 + £if + £ 2 )^' cos 2f + 
e - '(D cos t + £ sin f) + Fe' 

(b) A = 1/52, A = 10/169, A = -1233/35,152, £ 0 = -5/52, £1 = 73/676, 

B 2 = -4105/35,152, £> = -3/2, £ = 3/2, £ = 2/3 

26. (a) Y (f) = f (Af + A)e - ' cos 2 f + t(B 0 t + £i)e - ' sin 2f + (D 0 f + D i)e -2 ' cos f + 

(£of + £i)e -2 ' sinf 

(b) A = 0, A = 3/16, £ 0 = 3/8, £1 = 0, D 0 = -2/5, D, = -7/25, £ 0 = 1/5, 

£1 = 1/25 
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27. (b) w = -§ + Cl e 5 ' 

N 

28. y = ci cos Xt + c 2 sin Xt + [ o m /( 7. 2 — w 2 7t 2 )] sinmjr? 

m=l 

0 < ? < jr 
t > n 

0 < ? < it /2 

— | (1 + e ir/2 )e - ' cos 2? — jL(l + e lt/2 )e~ t sin 2?, ? > tt/ 2 


29. y = 


30. y = 

31. No 


—(1 + jt/ 2) sin? — (it/ 2) cost + (jr/2)e n 
| — ^e _ 'sin2 1— |e~'cos2?, 


34. y = cie 4 ' + c 2 e ' — |e 2 ' 


Section 3.6, page 189 


2. Y(t) = —\te~‘ 


1. Y(?) = e' 

3. Y(t) = §?V' 4. Y(?) = 2?V /2 

5. y = ci cos t + c 2 sin ? — (cos ?) ln(tan ? + sec t) 

6. y = ci cos 3? + ci sin 3? + (sin 3?) ln(tan 3 1 + sec 3?) — 1 

7 . y = c\e~ 2t + c 2 te~ 2 ' — e~ 2 ‘ In t 

8. y = ci cos 2? + c 2 sin 2? + ^(sin2?) In sin 2? — |? cos 2? 

9. y = ci cos(?/2) + c 2 sin(?/2) + t sin(?/2) + 2[ln cos(?/2)] cos(?/2) 

10. y = C\e' + c 2 ?e' — ^e' ln(l + t 2 ) + te’ arctan? 

11. y.c,e» + c ,e‘ + /l'*->-e»->, g <s><ts 

12. y = ci cos 2? + c 2 sin 2? + i J [sin 2(t — J)]g(s) ds 


13. 

Y (?) = 

| + ? 2 In ? 

14. Y (t) = 

— 2? 2 



15. 

Y (?) = 

i(?-l)e 2 ' 

16. Y (t) = 

-^(2?- 

- l)e-' 


17. 

Y(x) = 

gx 2 (lnx) 3 

18. Y(x) = 

-i* 1/2 

cosx 


19. 

Y (x) = 

f xe‘ — te* 

20. Y(x) = 


? _3/2 sin(x 

- ?)g(?) dt 

23. 

(b)y = 

y 0 cost + y' 0 sint + / 

JtQ 

sin(? — s)g(s) ds 




24. 

y=(b 

- a)- 1 f [e 6( '- J) - e“ ( '- 

_I) ] g(s) ds 25. y = fi- 

i r'gw- 

~ s) sin /x(? — 

s)g(s) ds 



JtQ 


JtQ 



26. 

y= [ 

(? — i)e II(f_I) g(i) ds 

29. y = ci? 

+ c 2 ? 2 + 4? 2 In ? 


30. 

y = ci?’ 

+ 

£ 

i 

+ 

31. y — C\( 1 + t) + 

c 2 e‘ + | (? - 

- l)e 2 ' 

32 . 

y = cie 

' + c 2 ? — ^(2? — l)e ‘ 






Section 3.7, page 202 

1. u = 5 cos(2 1 — 8), 8 = arctan(4/3) = 0.9273 

2. it = 2 cos (? — 2 jt/3) 

3. u = 2\/5cos(3? — 5), 5 = — arctan(l/2) ~ —0.4636 

4. m = Vl3cos(7r? — 8), 8 = n + arctan(3/2) = 4.1244 

5. it = \ cos 8 1 ft, t in s; a> = 8 rad/s, 7 = it /A s, R = 1/4 ft 

6. u = | sin 14? cm, ? in s; ? = tt/14 s 

7. u = (1/4a/ 2) sin(8\/2?) — ^ cos(8\/2?) ft, ? in s; a> = 8-s/2 rad/s, 

T = it/ Ay/2 s, R = ^11/288 = 0.1954 ft, 8 = it — arctan(3/\/2) = 2.0113 

8. Q = 10~ 6 cos 2000? C, ? in s 

9. u = e _10 '[2cos(4V6?) + (5/V6) sin(4\/6?)] cm, ? in s; 

fi = 4V6 rad/s, T d = it/2V6s, T d /T = 7/2^6 = 1.4289, r = 0.4045 s 

10. u = (l/8V^T)e _2( sin(2-v/31 ?) ft, ? in s; ? = 7T/2V31 s , r = 1.5927 s 
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11 . 

12 . 

13. 

16. 

17. 

20 . 

23. 

25. 

26. 

27. 

28. 

29. 

30. 
32. 


u = 0.057198e-° 15 ' cos(3.87008 1 - 0.50709) m, t in s; p. = 3.87008 rad/s, 
II /coo = 3.87008/ vT5 = 0.99925 
Q = 10- 6 (2e- 500 ' - e" 1000 ') C; t ins 
y = V2079 = 1.4907 

r = \M 2 + B 2 , r cos 9 = B, r sin 6 = —A; R = r, 5 = 9 + (4m + V)n/2, 
n — 0,1,2, . . . 


y = 8 lb-s/ft 
Vo < —yuo/2m 
y = 5 lb-s/ft 

(a) z = 41.715 (d) y 0 = 1.73, 

(e) r = (2/y) ln(400/y4 - y 2 ) 


18. R = 10 3 ft 
22 . 2tt/V3T 
24. k = 6, v = ±2^5 
min r = 4.87 


(a) u(t) = e~ Yt,lm ^mo^A/mm — y 2 cos pt + (2mv 0 + yz/ 0 ) sin/xtj /jAkm — y 2 

(b) K 2 = Am(kii^ + yiiovo + mv\)/(4km — y 2 ) 

plu" + Pogu = 0, 7 = 2tc y/ pi / pog 

(a) u = \/2 sin V2 t (c) clockwise 

(a) m = (16/Vl27)e _f/S sin(Vl27t/8) (c) clockwise 

(b) u = ci cos (y/k/m t) + byjm/k sin (yjk/m t) 


(b) u = sin t, A = 1, 7 = 2?r (c) A = 0.98, T = 6.07 

(d) e = 0.2, A = 0.96, T = 5.90; e = 0.3, A = 0.94, 7 = 5.74 

(f) e = -0.1, A = 1.03, 7 = 6.55; e = -0.2, A = 1.06, 7 = 6.90; e = -0.3, 

A = 1.11, 7 = 7.41 


Section 3.8, page 215 

1. — 2sin8fsinr 2. 2sin(f/2)cos(13f/2) 

3. 2cos(3nt/2) cos(nt/2) 4. 2sin(7t/2) cos(t/2) 

5. m" + 256m = 16 cos 3f, m(0) = i, m'(0) = 0, u in ft, t in s 

6. w" + 10M' + 98u = 2sin(f/2), m(0) = 0, u'(O) = 0.03, u in m, tins 

7. (a) m = cos 16f + ^ cos 3f (c) a> = 16 rad/s 

8. (a) 11 = jjj^j-[160e^ 5 ' cos(\/73t) + 3 ^ 3 ’ 443 e~ 5( sin( V73 1) — 160cos(t/2) + 

3128 sin (7/2)] 

(b) The first two terms are the transient. (d) co = 4%/3 rad/s 

9. u = j|(cos7r — cos8r) = ^ sinfr/2) sin(15r/2) ft, t in s 

10. 11 = (cos8r + sin8f — 8fcos8f)/4 ft, t in sec; 1/8, n/8, n/A, 3n/8s 

11. (a) 4j_(30cos2r + sin2f) ft, tins (b) m = 4 slugs 

12. u = (V2/6) cos(3f — 3n/4) m, r in s 

| 7 0 (f — sint), 0 < t < n 

7o[(2jt — t) — 3 sin f], n < t <2n 

—AF 0 sin t, 2n < t < 00 

16. g(0 = 10~ 6 (e" 4000 ' _ 4 e -iooor + 3) q t in s, Q(0.001) = 1.5468 x 10“ 6 ; 

0(0.01) = 2.9998 x 10~ 6 ; Q(t ) -s- 3 x 10~ 6 as t -> 00 

17. (a) m = [32(2 — co 2 ) cos cot + 8a; sin &>f]/(64 — 63ca 2 + 16t<; 4 ) 

(b) A = 8/V64 - 63co 2 + 16 co 4 (d) co = 3VI4/8 = 1.4031, A = 64/VT27 S 5.6791 

18. (a) u = 3(cos t — cos cot) /{co 2 — 1) 

19. (a) u = [{co 2 + 2) cos f — 3 cos cot]/ (co 2 — 1) + sin t 
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CHAPTER 4 Section 4.1, page 224 

1. — oo < t < oo 2. f > 0 or r < 0 

3. t > 1, or 0 < t < 1, or t < 0 4. t > 0 

5. . . . , — 3tt/2 < x < — ?r/2, — n /2 < x < 1, 1 < x < tt/2, 7t/ 2 < x < 3 jt/2, . . . 

6. —oo < x < —2, —2 <x<2, 2 < x < oo 

7. Linearly independent 

8. Linearly dependent; f (t) + 3/2(7) — 2/3 (r) = 0 

9. Linearly dependent; 2/i(f) + 13/ 2 (0 — 3/ 3 (0 — 7/4(0 = 0 


10. 

Linearly independent 


11. 1 


12. 

1 


13. — 6e _2f 


14. 

c- 2 ' 


15. 6x 


16. 

6/x 


17. sin 2 1 = 

®(5) - \ cos 2 1 

19. 

(a) ao[n(n — \)(n — 2) • • 

• 1] + «i \n(n - 

- 1) ■ ■ ■ 2]f + ■ ■ ■ 

+ fl„f" 


(b) (a 0 r n + air n 1 4 h a n )e n 




(c) e',e ',e 2t ,e 2t ; yes, 

W (e' , e~' , e 2 ' . 

, e“ 2 ') 7^ 0, —00 

< t < OO 

21. 

W(t) = ce~ 2 ‘ 


22. W(0 = 

c 

23. 

W(t ) = c/t 2 


24. W(0 = 

c/t 

27. 

y = cie 1 + c 2 t + c 3 fe' 


28. y = C\t‘ 

' + C2^ 3 + C3(Y + 1) 


Section 4.2, page 231 


1. 

3. 

5. 

7. 

9. 

11 . 

13. 

15. 

16. 

17. 

18. 

19. 

20 . 
21 . 
22 . 

24. 

25. 

26. 

27. 

28. 
29. 
31. 
33. 

35. 

36. 

37. 

38. 

39. 


^/2 e '[W4)+2mir] 
2 e i[(lW6)+2m7r] 

1, i(— 1 + 0/3), 
1, i, —1, — 


|(— 1 — 0/3) 


2. 2e‘ [(2ir/3)+2mirl 

4 gi[(3a-/2)+2mir] 
g A /2e , [(5ir/4)+2mn-] 

8. 2 1/4 e“ ir 78 ; 2 1 / 4 e 7ir 78 

10. (V3 + /) / -s/2, - (V3 + A /V2 


y = cie' + C 2 te‘ + c 2 e~‘ 12. y = cie' + c 2 te‘ + c 3 f 2 e' 

y = cje' + c 2 e 2t + c 3 e - ' 14. y = C\ + c 2 t + c 3 e 2 ' + c 4 te 2 ' 

y = ci cos r + c 2 sin t + e'' 3 * /2 (c 3 cos 3 t + c 4 sin 3 0 + e~'^‘ tl2 (Ci cos + c 6 sin \t) 
y = cie' + c 2 e~' + c 3 e 2 ' + c 4 e -2 ' 
y = c 4 e' + c 2 te‘ + c 2 t 2 e' + c 4 e~' + c^te~‘ + C(,t 2 e~ { 
y = a + c 2 t + c 3 e' + c 4 e - ' + C5 cos r + C6 sin t 
y = ci + c 2 e' + c 3 e 2t + c 4 cos r + C5 sin t 
y = ci + c 2 e 2 ' + e~'(c 3 cos ->/3 1 + c 4 sin V3 1) 

y = e'[(ci + c 2 t) cos t + (c 3 + c 4 0 sin f] + e _t [(C5 + c 6 t) cos t + (c 7 + c 8 t) sin t ] 

y = (ci + c 2 ?) cos f + (c 3 + c 4 t) sin t 23 . y = c\e' + c 2 e (2+ ' /3) ' + c 3 e (2_v/3) ' 

y = cie - ' + c 2 e (_2+v ^ ) ' + c 3 e (_2_ '^ )( 
y = cie _,/2 + c 2 e _f/3 cos(t/-s/3) + c 3 e _,/3 sin(f/-\/3) 
y = cie 3 ' + c 2 e -2 ' + c 3 e (3+,/3) ' + c 4 e (3_ ' /3) ' 


cie _ ' /3 + c 2 e~ t,A + c 3 e _( cos 2r + c 4 e _t sin 2r 
cie - ' cost + c 2 e~' sinr + c 3 e -2 ' cos(-\/3 1) + c 4 e -2 ' sin(V3 0 
2 — 2 cost + sin t 30. v = lc _f /V2 s j n 

2r - 3 

4e 2 ' 


_ 2 e < - - e 
3 C 10 e 
1 lo^-f/3 


_ 16 .-1/2 
6 C 15 e 


30. y = \e~ tl '^ 2 sin(f/\/2) — ^e' 1 ^ 2 sin(t/V2) 

32. y = 2 cos f — sin f 

34. y = Ae - ' + f£e'/ 2 cos t + Ae' /2 sin t 


1 7 

y= jie _t cost— j|e _ 'sinf— ^e -2 ' cos(-s/3 0 4 — e _2f sinlv^O 
y = 3 (cosh t — cos 0+1 (sinh r — sin t) 

(a) W(t) = c, a constant (b) W(t) = —8 (c) W{t ) =4 

(b) mi = ci cos f + c 2 sin ? + c 3 cos \/6 1 + c 4 sin \/6 1 
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Section 4.3, page 237 

1 . y — C\e' A c 2 te‘ A cj,e~' + ^te - ' + 3 

2. y = cie' + C2e _t + C3 cost + C4 sin t — 3 1 — jtsinf 

3. y = cie - ' + C2 cos t A C3 sin t + \te~‘ + 4(t — 1) 

4. y = Ci + c 2 e' + c 3 e - ' + cos f 

5. y = ci + c 2 t A c 3 e -2 ' + c 4 e 2 ' - |e' - ^t 4 - if 2 

6. y = ci cos r A C2 sin t A C3t cos f A c 4 f sin t A 3 A i cos 2 f 

7. y = ci + c 2 t A c 3 t 2 + c 4 e _t + e t/2 [c 5 cos(\/3t/2) + C6 sin(V3f/2)] + Jjt 4 

8. y = ci + c 2 t + c 3 t 2 + c 4 e _t A ^ sin 2f A ^ cos 2 f 

9. y= ^(l-cos2f) + if 2 

10. y = (t — 4) cost — (|f + 4) sin? + 3t + 4 

11. y = 1 + i(f 2 + 3t) - te ( 

12. y= -f cost - |sinr+ ie“'+ ge'+ ^e- 3 '+ |cos2r- ^|sin2f 

13. Y(t) = tC4 0 t 3 + Ait 2 + A 2 t A ^3) + Bt 2 e‘ 

14. Y(f) = t(A 0 t A A 1 )e~ ! A B cos t A C sin f 

15. Y(t) = At 2 e' + B cost A Csint 

16. Y(t) = At 2 + ( B 0 t A Bi)e‘ A t(C cos2f + Z)sin2t) 

17. Y (t) = t(A 0 t 2 Y .Ait A A2) A (B^t A 7?i) cos f A (Cot A Ci) sm t 

18. Y(t) = Ae' A (-B 0 t A Bi)e~' A te~‘(C cost A -D sint) 

19. fco = £fo, = aoa n A aia" -1 A • • • A a n -iot A a n 


Section 4.4, page 242 


1. 

2 . 

4. 

5. 

6 . 

7. 


8 . 


9. 

10 . 

11 . 

12 . 

13. 

14. 

15. 

16. 
17. 


y = ci A C2 cos t A C3 sin t — In cos f — (sin t) ln(sec t A tan f) 

y = ci A c 2 c' A c 3 e~' - \t 2 3. y = Cie' A C2e~' A c 3 e 2 ' A ^e 4 ' 

y = ci A c 2 cos f A c 3 sin f A ln(sec t A tan t) — t cos f A (sin f) In cos t 

y = cie' A C2 cos f A C3 sin t — ^ e~‘ cos t 

y = ci cos t A C2 sin t A c 2 t cos f A c 4 f sin t — |f 2 sin f 

y = Qe' A c 2 cos f A C3 sin t — \ (cos t) In cos f A \ (sin f) In cos t — \t cos t 


— if sint A \e' 


^ e s / cos s^j ds 
y = ci A c 2 e' A c 2 e~‘ — In sin f A ln(cos t A 1) A \e' jf ^ 


e s / sins ) ds 


+ ^ 


■jC( 


e s /sins I ds 


Ci = 0, C2 = 2, C3 = 1 in answer to Problem 4 

Ci = 2, C2 = l, C3 = — c 4 = | in answer to Problem 6 


ci = f, c 2 = i. c 3 = 


- 3 , to = 0 in answer to Problem 7 

W 2 


Ci = 3, C2 = 0, C3 = — e*' , t 0 = jt/ 2 in answer to Problem 8 
Y(x) = x 4 /15 


\ / [e‘ s — sin(f — ,y) — cos(t — s)]g(s) ds 

'to 

\ / [sinh(t — 5) — sin(f — s)]g(s) ds 
Jt 0 

Y(f) = i I e (f_I) (t _ s) 2 g(s) ds; Y(t) = — fe'ln|t| 
Jtn 


Y(t) = 2 
Y(f) = 2 


Y(x) = | f [(x/t 2 ) - 2{x 2 /t 3 ) A (x 3 /t 4 )]g(t) df 
J Xl 
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CHAPTER 5 Section 5.1, page 249 


1- P = 1 
3. p = oo 
5. p = 1 
7. p = 3 

“ (-1)^+1 

9. > , p = oo 

11. 1 4- (x — 1), p = oo 

13. £>1 P = 1 


15. yV, p = 1 


2. p = 2 

4 . p = 1 

6. p = 1 

8. p = e 

oo „ 

io. y — , p = oo 

4 -^ 7 !! 

n=0 

12. 1 — 2(x + 1) + (x + l) 2 , p = oo 

OO 

i 4 . y (- d"x", p = i 

n=0 

oo 

16. y (— l)" +1 (x — 2)", p = 1 


17. / = 1 4- 2 2 x + 3 2 x 2 + 4 2 x 3 4 f (n + l) 2 x" 4 

/' = 2 2 4- 3 2 ■ 2x 4- 4 2 ■ 3x 2 4- 5 2 • 4x 3 4- • • • 4- {n 4- 2) 2 (;i 4- l)x" 4- • • 

18. y' = a\ + 2d 2 x + 3a 2 x 2 + 4ti4.x 3 + ■ ■ ■ + in 4- l)fl„ + ix" 4- • • • 

OO OO 

= y| na„x n ~ l = y^(77 4- V)ci n +\x!' 

n=l n = 0 

y" = 2a 2 4- 6a 2 x + 12t74X 2 4- 20t75.x 3 4- ■ ■ ■ 4- (w 4- 2)(n 4- l)fl„ + 2X K 4- ■ 

OO OO 

= y n(n — l)a n x n ~ 2 = y {n 4- 2){n 4- l)«„ + 2x" 


21. y { n + 2)(n+ l)a„+2X n 

n = 0 

OO 

23. n + l)a n x n 

n = o 
oo 

25. ^2 t( w + 2)(w + l)fl w +2 + 

n=0 

oo 

27. y[(77 4- l)no n+ i + d„]x " 


22. y a n _ 2 x” 

n=2 

oo 

24. y [(77 4- 2) (77 4- l)fl «+2 — 77(77 — l)a„]x" 

n=0 

00 

26. fli 4- y [(77 4- I)a «+1 4- fl„-i]x" 

n = 1 

28. d n = (—2) n do/n\, n = 1,2, ... ; ao e_2< 


Section 5.2, page 259 

1 . ( a ) 77„ + 2 = fl „/(71 4 - 2 ) (77 4 - 1 ) 

2 v 4 „6 oo 

A Jv A % A 

(b’d) HW = 1 + ^ + 77 +m+--- = I] 


n=0 

v 2n+l 


( 277 )! 


= coshx 


y 2 W-x+ - + - + - + • 


= y — = sinhx 

^( 2,7 + !)! 


2. (a) t7„ +2 = a n j (77 4- 2) 

2 4 6 x., 

(b,d) yi(x) = 14 1 1 )-■■■ = y 

v ' ’ ? ?.! ?. d. A 2"77! 


2 2-4 2-4-6 

n=0 

x 3 x 5 * x 7 ^ 2 n 77!x 2,I+1 

y 2 (A ’ = A + y + 375 + 3.5.7 + ' ■ ' = jL (277 4- 1)! 

3. (a) (77 4- 2)fl„ +2 - a„ + i — d„ = 0 

(b) yi(x) = 1 + \{x - l) 2 + i(x - l) 3 + \{x - l) 4 + • • • 
y 2 (x) = (x - 1) + Ux - l) 2 4- \{x - l) 3 4- l(x - l) 4 4 — 
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4. (a) a n+ 4 = —k 2 a„/(n + 4)(/i + 3); « 2 = 03 = 0 

l-2 v 4 1,4 r 8 t6„l 2 

/ V A /V .yv /V A 

( b’d) *<*> = 1 - ^4 + 3^7T 


3 . 4 . 7 . 8 - 11 -12 


= i+E 


(— l) m+1 (k 2 x 4 ) m+1 


3 • 4 • 7 • 8 • ■ ■ (4m + 3) (4m + 4) 


& 2 x 5 /c 4 x 9 

y2( x)=X - — + —— 


k 6 X 12 


* + E 


4 • 5 • 8 • 9 • 12 • 13 

(— 1 ) m+ 1 (k 2 x 4 ) m+1 
4 • 5 ■ 8 • 9 • ■ ■ (4m + 4) (4m + 5) 


Hint: Let n = 4m in the recurrence relation, m = 1, 2, 3, ... . 

5. (a) (n + 2 )(n + l)«„ +2 — n(n 4- l)a„ + i + a n = 0, n > 1; a 2 = — \do 


(b) yi(x) = 1 - \x 2 - gX 3 - ^x 4 + • • ■ , y 2 (x) = x - \ 


1 ^3 


2_^4 _ J_„s , . 
10 a. oa a. ~r 


6. (a) a n+ 2 = — (n 2 - 2n 4- 4)a„/[2(n + l)(n + 2)], 
(b) yi(x) = 1 - x 2 4- gx 4 - ^x 6 + • • ■ , 


n >2; a 2 — ~ a o, «3 = — j«i 


y 2 (x) = x - ix 3 + j^x 5 - JgX 7 + • 


7. (a) a„ +2 = -d„/(n + 1), « = 0, 1,2 — 


„ „ X 2 X 4 X 6 

b’d) yiw = 1 - t + rs “ r^5 


+ ' 


= 1+ E 


( — l) n x 2 ‘ 


1 • 3 ■ 5 • • • (2 n - 1) 


X 3 X 5 

y? (x) = x 1 

7 2 2-4 


2-4-6 


■ = * + £ 


«=i 
(-l)"x 2n+1 

2 • 4 • 6 • • • (2/i) 


8. (a) a„ +2 = -[(« + l) 2 fl„+i 4- 4- fl„-i]/(» + l)(n + 2), n = 1, 2, . . . 

a 2 = ~(d 0 + fli)/2 

(b) yi(x) = 1 - |(x - l) 2 + \(x - l) 3 - i(x - l) 4 4 

y 2 (x) = (x - 1) - |(x - l) 2 + |(x - l) 3 - |(x - l) 4 + • • • 

9. (a) (n + 2)(n + l)fl„ +2 + (n — 2 )(n — 3)d n = 0; n = 0,1,2, . . . 

(b) y x (x) = 1 - 3x 2 , y 2 (x) = x - x 3 /3 

10. (a) 4(n + 2)a„ + 2 — (n — 2 )a n = 0; n = 0, 1, 2, . . . 

2 3 S 7 

/, 'X X X X X 

(b,d) yi (x) = 1 , y?(x) = x • 

v ' 7 4 7 12 240 2240 


r 2«+l 


4" (2n - l)(2n + 1) 


11. (a) 3(n + 2)fl„ +2 — (n + 1 )a„ = 0; n = 0,1,2, . . . 


(b,d) yi(x) = 1 + - — t- — 4 x° 4 

t ) xu T 6 24 432 

2 , 8 , 16 7 

y 7 (x) = x 4 — x 4 x 4 x 4- • 

y ’ 9 135 945 


3 ■ 5 • • • {2n — 1) , 

- 4- -x 2,i 4- - 


3" ■ 2 • 4 • • • (2 n) 
2 • 4 • • ■ (2«) 


-x 2 " +1 4- • 


3" -3- 5--- (2/14-1) 

12. (a) (n 4- 2)(w 4- l)fl„ +2 — (« 4- 1 )/m„ + i 4 - (n — 1 )a„ = 0; n = 0, 1, 2, . . . 


2 3 4 n 

\ X X X 

(b’d) ^ l(x) = 1+ Y + : 6 + 24 + '" + ^ + ' 


y 2 (x) = x 

13. (a) 2(n + 2)(n + 1 )a n+2 + (n + 3 )a n = 0; n = 0, 1, 2, . . . 

7 


(b’d) yi^l-Jx^x 4 ^ 


, 3 -5- ■■(2/14-1) , 

x 6 4 - ... 4 - (- 1 )” -x 2 " 4- • 

2 n (2/i)! 


357 
x x x 

y 2 (x) = x 1 1- • 

7 3 20 210 


4 • 6 • • • ( 2n 4- 2) , , 

+ (-1)" x 2k+1 4- • 


2" (2 n + 1)! 

14. (a) 2(n + 2 )(n + l)d n+2 + 3(n + l)d n+ i + (n + 3 )a n = 0; n = 0, 1, 2, . . . 

(b) yi(x) = 1 - |(x - 2) 2 4 - |(x - 2) 3 4 - ^(x - 2) 4 4 

y 2 (x) = (x - 2) - |(x - 2 ) 2 4- ^(x - 2 ) 3 4- gj(x - 2 ) 4 4 
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15. 

16. 

17. 

18. 
19. 


21 . 


(a) y = 2 + x+x 2 +±x 3 + \x 4 + ■ 


(a) y = -1 4- 3x 4- x 2 - ^x 3 - |x 4 + ■ 
(a) y = 4 - x - 4x 2 + \x 3 + |x 4 + 


(c) about \x\ < 0.7 
(c) about |x| < 0.7 
(c) about \x\ < 0.5 

(a) y — — 3 + 2x — \x 2 — \x 3 — ^x 4 + • • • (c) about \x\ < 0.9 

(a) y i(x) = 1 - §(x- l) 3 - i(x- l) 4 + i(x - 1) 6 + •• • 
y 2 (x) = (x - 1) - \(x - l) 4 - ±(x - l) 5 + ^(x - l) 7 4 


(a) yi (x) = l--x 2 + 


2 , *■(* ~ 4 > 4 


4! 


X(X — 4)(A. — 8) 
6! 


x + • 


y 2 (x) = x — 


X — 2 


3! 


, (X — 2) (X — 6) , (X — 2)(X — 6)(X — 10) 7 
x 4 x x' + ■ 


5! 


7! 


(b) 1, x, 1 — 2x 2 , x — |x 3 , 1 — 4x 2 + |.x 4 , x — ^x 3 + ^x 5 

(c) 1, 2x, 4x 2 - 2, 8x 3 - 12x. 16x 4 - 48x 2 + 12, 32x 5 - 160x 3 + 120x 
22. (b) y = x — x 3 /6 + • • • 


Section 5.3, page 265 


1. 

2 . 

3. 

4. 

5. 
7. 
9. 


10 . 


11 . 

12 . 

13. 

14. 

15. 

16. 

17. 

18. 
19. 


4>"{ 0) = -l, 0"'(O) = O, <t> (4) ( 0) = 3 

<t>"(0) = 0, 0) = -2, 0 (4) (O) = 0 

<p"{ 1) = 0, </>’"( 1) = -6, 0 (4, (1) = 42 

</>"(0) = 0, 0'"(O) = -fl O , 0< 4 )(O) = -4, fl! 

p = oo, p = oo 6. p = 1, p = 3, p = 1 

p = 1, p = a/3 8. p = 1 

(a) p = oo (b) p = oo (c) p = oo (d) p = oo (e) p = 1 

(f) p = -j2 (g) p = oo (h) p = 1 (i) P = 1 (j) P = 2 

(k) p = V3 (1) p = 1 (m) p = oo (n) p = oo 

a 2 2 (2 2 — aV 2 4 (4 2 - ar)(2 2 - a 2 )a! 2 6 

(a) yi(x) = 1 - —x 2 x 4 x 6 - ■ 


4! 

[(2m - 2) 2 - a 2 ] ■ ■ • (2 2 - cr)a 2 


6! 


y 2 (x) = x + 


(2m)! 

1 — a 2 , (3 2 — a 2 )(l — a 2 ) 


[(2m - l) 2 - a 2 ] ■■■ (1 - a 2 ) 

(2m + 1)! 

(b) y\(x) or y 2 (x) terminates with x n as a = n is even or odd 

(c) n = 0, y — 1; n = 1, y = x; n = 2, y = 1 — 2x 2 ; n = 3, y = x — ^x 3 

y 2 (x) = x - ix 4 + ^x 6 + 4 jX 7 + ■ 


12 1 180 1 504 

y 2 (x) = x - ix 4 4- ix 5 - i,x 6 + ■ 
y 2 (x) = X + ix 3 + ix 5 + jijX 7 + • • 
y 2 (x) = x - lx 3 + ±x 4 + j^x 5 + ■ 


yi(x) = 1 - gx 3 + j^x 5 + jggX 6 + • 

p = OO 

yi(x) = 1 - lx 3 + ix 4 - ^x 5 + • • 
p = OO 

yi(x) = 1 + x 2 4- j^x 4 + j^x 6 4 

p = jr /2 

yiW = 1 4- ^x 3 4- ^x 4 - jkjX 6 4- • • 

p = 1 

Cannot specify arbitrary initial conditions at x = 0; hence x = 0 is a singular point. 

x 2 x n 

,= l+x + - + ... + _ + ■■■ = «- 

X 2 X 4 X 6 X 2 ” 

>' = 1+ T + 2T4 + 2.4.6 + " 
y = 1 + x 4- ^x 2 4- ^x 3 4- ■ ■ ■ 

y=l4-x4-x 2 4----4-x"4---- = 


+ 


2" • n\ 


4 - • 


1 — x 
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20 . 


21 . 


/ X 2 X n \ / X 3 X 4 

y = a 0 (l+x+ - + ■■■ + - + ■■■) +2^- + - + ■■■ + 

= ao^' 1 + 2 — 1 — x — = ce* — 2 — 2x — x 2 

/, X 2 X 4 X 6 (- 

y-fl°^- T + ^-^3y + "- + - 

/ X 2 X 3 X 4 X 5 

+ ( * + ^ ^ — 2 — 4 3 — j + • • • 



x" 

n\ 



= a 0 <? 312/2 + 



3 




23. 1, 1 - 3x 2 , 1 - 10x 2 + f x 4 ; x, x - §x 3 , x - f x 3 ± f x s 

24. (a) 1, x, (3x 2 - l)/2, (5x 3 - 3x)/2, (35x 4 - 30x 2 + 3)/8, (63x 5 - 70x 3 + 15x)/8 
(c) Pi, 0; P 2 , ±0.57735; P 3 , 0, ±0.77460; P 4 , ±0.33998, ±0.86114; 

P 5 , 0, ±0.53847, ±0.90618 


Section 5.4, page 276 

1. y = cix -1 ± c 2 x -2 2. y = C\\x ± 1| _1/2 ± c 2 |x ± 1| _3/2 

3. y = cix 2 ± c 2 x 2 In |x| 4. y = cix -1 cos(2 In |x|) ± c 2 x _1 sin(2 In |x|) 

5. y = C\x ± c 2 xln |x| 6. y = Cj(x — l) -3 ± c 2 (x — l) -4 

7. y = ci|x|<- 5 +v®)/ 2 + c 2 |x|<- 5 -^/2 

8. y = ci|x| 3/2 cos(l V31n |x|) ± c 2 |x| 3/2 sin(i\/31n |x|) 

9. y = c\x 3 ± c 2 x 3 In |x| 


10. y = ci(x — 2) 2 cos(21n |x — 2|) ± c 2 (x - 

■ 2) 2 sin(21n |x — 2|) 

11. y = c\ |x|~ 4 / 2 cos(i\/l51n |x|) ± c 2 |x|- 4 / 2 

sin( 3 -s/l51n |x|) 

12. y = c\x ± c 2 x 4 

13. y = 2x 3/2 - x- 1 

14. y = 2x~ 1/2 cos(2 lnx) — x -1/2 sin(2 lnx) 

15. y = 2x 2 — 7x 2 In |x| 

16. y = x _1 cos(21nx) 

17. x = 0, regular 

18. x = 0, regular; x = 1, irregular 

19. x = 0, irregular; x = 1, regular 

20. x = 0, irregular; x = ±1, regular 

21. x = 1, regular; x = —1, irregular 

22. x = 0, regular 

23. x = —3, regular 

24. x = 0, —1, regular; x = 1, irregular 

25. x = 1, regular; x = —2, irregular 

26. x = 0, 3, regular 

27. x = 1, —2, regular 

28. x = 0, regular 

29. x = 0, irregular 

30. x = 0, regular 

31. x = 0, regular 

32. x = 0, ±htt, regular 

33. x = 0, ±«jr, regular 

34. x = 0, irregular; x = ±hjt, regular 

35. a < 1 

36. p > 0 

37. y = 2 

38. a > 1 


39. (a) a < 1 and p > 0 


(b) a < 1 and ft > 0, or a = 1 and ,6 > 

0 

(c) <x > 1 and /? > 0 


(d) <x > 1 and p > 0, or <x = 1 and p > 

0 

(e) a = 1 and p > 0 



44. Irregular singular point 

45. Regular singular point 

46. Regular singular point 

47. Irregular singular point 

48. Irregular singular point 

49. Irregular singular point 



Answers to Problems 


747 


Section 5.5, page 282 

1. (b) r(2r — 1) = 0; a n = — 


(In— 2 


( n + r)[2(n + r) - 1] 


'1 = b r 2 = 0 


(c) yi(x) = 

+ 


1 - 


+ 


2-5 2 • 4 • 5 • 9 2 • 4 • 6 • 5 • 9 • 13 

(-l) n x 2n 


+ • 


2"n!5 • 9 • 13 • • • (4w + 1) 


(d) y 2 (x) = 1 - — + 

+ 


2-3 2- 4-3-7 2 • 4 • 6 • 3 • 7 ■ 11 

(~l) n x 2n 


2”n\3 - 7 - 11 - ■ • (4« - 1) 

2 1 r». (In— 2 


+ ■ 


2. (b) r- - 1 = 0; «„ = - 


(« + r) 2 - i ’ 


r2 = -| 


(c) yt(x) = x 1 / 3 


1 - 


rnt© 


_ 2 


1 


3 ■ -T- 

(-l) m /r\2i" 


© 4 


1-(1 +5) '2/ 2!(1 + |)(2 + i) V2 

K© 


+ • 


m\(l + i)(2 + i) ■ ■ • (m + i) 


(d) ;y 2 (x) = x 


= r-l/3 


1 - 


— (-V 
1-1) \2> 


1!(1- i) 

( _i r 


+ 


1 


2!(l- i)(2- 1) 

K© 


3 ' 
2m 


© 4 


w!(l- \)(2- i)---(w- i) ^2 


Hint: Let zz = 2»z in the recurrence relation, m = 1, 2, 3, ... . 

3. (b) r(r - 1) = 0; a n = - . — 1X ; r, = 1 , r 2 = 0 


(c) yx(x) = x 


{n + r)(n + r - 1) ’ 

x x 2 (-1)" 

1 1 1 1 

1!2! 2!3! n\(n + 1)! 


x n + - 


4. (b) r 2 = 0; a n = 


1 


/‘i = r> = 0 


(n + r) 2 ’ 

(c) yi (x) = l+^ + ^ + -.. + ^ + 
5. (b) r(3r — 1) = 0; a n = — 


(c) yi (x) = x 1 / 3 


(n + r)[3(n + r) - 1] ' 


n = 1, r 2 = 0 


(d) y 2 (x) = 1 “ ^5 (h ) + 



xr \ (-1)'" 

2 / m\5 • 11 • • ■ (6 m — 1) 


/ ? \ m 

X \ 

,2) + ' 


Hint: Let n = 2«? in the recurrence relation, m = 1, 2, 3, 

2 r\ f\_ _ tl n — \ 


6. (b) ©-2 = 0; 
(c) yi(x) = X s72 


1 - 


(/z + r) 2 — 2 ’ 
x 


ri = ©2, r 2 = —©2 

2 

+ • 


X" 


1 (1 + 2 © 2 ) 2 ! (1 + 2 © 2 ) (2 + 2 © 2 ) 

(- 1 )" 


n!(l + 2 ©2) (2 + 2 ©2) ■■■(«+ 2 ©2) 


x" + ■ 
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(d) y 2 (x) = x-^ 

+ 


1 - 


1(1 — 2 \/ 2 ) 2!(1 - 2 V 2)(2 - 2 V 2 ) 

(- 1 )" 


+ 

x" H 


n!(l - 2^2) (2 - 2V^) •••(«- 2^2) 

7. (b) r 2 = 0; (n + r)n„ = a„_i; ri = r 2 = 0 

(c) y\(x) = 1 + x + — + — -4 -4 — - + ■•• = e* 

2! 3! n\ 

8. (b) 2r 2 + r — 1 = 0; (2n + 2r — l)(n + r + 1 )a n + 2a„_ 2 = 0; 


r i = 2’ ''2 = - 1 

(c) y,(x) = x 1 / 2 (l - y + 


X 4 (-l) m x 2m 

2!7 • 11 m!7 • 11 ■ ■ ■ (4/n + 3) 


(d) y 2 (x) = x Ml-x 2 !- — + 


(~l) m x- 


mJ2m 


+ • 


2!5 ml 5 • 9 • • • (Am — 3) 

9. (b) r 2 — Ar + 3 = 0; ( n + r — 3 ){n + r — 1 )a n — (n + r — 2)a n _i = 0; = 3, r 2 = 1 


(c) yi{x) = x 3 ( 1 + -x + — A t- 


2x n 


3 4 n\(>i + 2) 

10. (b) r 2 - r + i = 0; (n + r - \) 2 a„ + a„_ 2 = 0; r! = r 2 = 1/2 


(c) y x (x) = x 1 / 2 ^1 - ^ + 
11. (a) r 2 = 0 ; r, = 0. r 2 = 0 


x 4 ' (-l) m x 2m 

2M 2 + 2 2m (m\) 2 + ' 


, a(a + 1) 

(b) yi(x) = 1 + ,, 19 (x - 1) - 


a (a + 1)[1 ■ 2 - a(a + 1)] ^ lU 

2 • l 2 v ~ (2 • 1 2 )(2 ■ 2 2 ) 


-(x-l) 2 + . 


,, 1VI+ i«(a + 1)[1 ■ 2 - a (a + 1)] ■ ■ ■ [n(n - 1) - a (a + 1)] , 1V1 

+( ’ 2"(?i\) 2 (X 

12. (a) r\ = 1, r 2 = 0 at both x = ±1 
(b) yi(x) = lx-11 1 / 2 

^ (-1)"(1 + 2a) • ■ ■ (2n - 1 + 2a)(l - 2a) • • ■ (2n - 1 - 2a) ^ 

1 , 1 (■* i' 


n=l 

y 2 W = 1 


2" (2/! + 1)! 


(— l)"a(l + a) ■ ■ ■ (n - 1 + a)(-a)(l - a) ■■•(«- 1 - a) 

+ 2^ „ f — — 77 (* “ !) 


n=l 

„2 


13. (b) r 2 = 0; ri = 0, r 2 = 0; a,, = 


n\ 1 ■ 3 • 5 • • • (2« — 1) 

(n — 1 — X)a n -\ 


(c) y 1 W = i+^x + 


(-*)U - A) 
(2!) 2 


x 2 + • • • + 


(-A.)(l - A) ■ ■ ■ (w - 1 - A.) 
(n!) 2 


For X = the coefficients of all terms past x n are zero. 

16. (e) [(n — l) 2 — 1 ]b„ = — h„_ 2 , and it is impossible to determine b 2 . 


Section 5.6, page 290 


1. (a) x = 0; 

2. (a) x = 0; 

3. (a) x = 0; 
(a) x = 1; 

4. None 


(b) r(r - 1) = 0; n = 1, r 2 = 0 
(b) r 2 - 3r + 2 = 0; n = 2, r 2 = 1 
(b) r(r - 1) = 0; r x = 1, r 2 = 0 

(b) r(r + 5) = 0; r, = 0, r 2 = -5 
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5. 

6 . 

7. 

8 . 

9. 

10 . 

11 . 

12 . 

13. 

14. 

15. 

16. 

17. 

18. 
19. 


(a) x — 0; 

(b) r 2 + 2r - 2 = 0; n = -1 + s/3 = 0.732, r 2 = -1 - V3 = -2.73 

(a)x = 0; (b) r(r — |) = 0; n = ii = 0 

(a) x = —2; (b)r(r-|)=0; ri = §, r 2 = 0 

(a) x = 0; (b) ;- 2 + 1 = 0; ri = i, r 2 = —i 

(a) x — —1; 

(b) r 2 — 7r + 3 = 0; n = (7 + vf7)/2 = 6.54, r 2 = (7 - V37)/2 = 0.459 

(a) x = 1; (b) ;- 2 + r = 0; t*i = 0, r 2 = —1 

(a) x = —2; (b) r 2 - (5/4)r = 0; n = 5/4, r 2 = 0 

(a) x = 2; (b) r 2 - 2 r = 0; n = 2, r 2 = 0 

(a) x = -2; (b) r 2 - 2 r = 0; n = 2, r 2 = 0 

(a) x = 0; (b) r 2 — (5/3 )r = 0; r\ = 5/3, r 2 = 0 

(a) x = -3; (b) r 2 - (r/3) -1 = 0; n = (1 + V37)/6 = 1.18, 

r 2 = (1 — V37)/6 = -0.847 

(b) = 0, r 2 = 0 

(c) yi(x) = 1 + x + lx 2 + ^x 3 + • • ■ 

yi(x) = yi(x) lnx - 2x - |x 2 - ^x 3 H 

(b) ri = 1, r 2 = 0 

(c) yi(x) = x - 4x 2 + yX 3 - gx 4 + • • • 

y 2 (x) = — 6vi(x) lnx + 1 - 33x 2 + ^x 3 + ■ ■ ■ 

(b) n = 1, r 2 = 0 

(c) yx(x) = x + |x 2 + ?x 3 + |x 4 + • • • 
y 2 (x) = 3yi(x) lnx + 1 - f x 2 - f x 3 + • • • 

(b) n = 1, r 2 = 0 

(c) yi(x) = x - lx 2 + ix 3 - jljx 4 + ■ ■ • 

^ 2 (x) = -yi(x) lnx + 1 - lx 2 + ^x 3 - j||gX 4 + ■ ■ ■ 

(b) n = 1, r 2 = -1 

(c) yi(x) = x - ix 3 + ^x 5 + ■ ■ ■ 

l 2 (x) = - iyi(x) lnx + x _1 - ^x 3 + • • • 

(b) n = r 2 = 0 

(c) yi (x) = (x - 1) 1/2 [1 - f (x - 1) + ||(x - l) 2 + • • ■], (d) p = 1 

(c) Hint: (, n — 1 )(n — 2) + (1 + a + /!)(« — 1) + afi = (n — 1 + a)(n — 1 + /?) 

(d) ffinf.' (n - y)(n -1-)/) + (1 + q' + >8)(/i - y) + af! = (n - )/ + a)(n - y + ft) 


Section 5.7, page 301 

(— l)”x" 


1. yi(x) = J2 


n\(n + 1)! 


l 2 (x) = — li(x) lnx + 


1 


n!(« — 1)! 


1 A (-l)"x" 2 (-1 ) n H n „ 

2 ■ = 7 E 1^2- • ^ 2(X) = * ,( * )lnjc " i E (n!) 2 * 


(«!)- 


3. yi (x) £ & y 2 (x) +,(x) lnx - 2 £ 

n = 0 ^ n=l 

4. »(,) = !£ ( ~ 1)H x" 


(—l) n 2"H n „ 
( n !) 2 ^ 


x " n\(n + 1)! 

n=u 


y 2 (x) = —y\(x) lnx + — 


i-T h " + h '-\ -itx» 

4 —' /;!(« — 1 )! 
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5. yi(x) = x 


.3/2 


i+E 


(-lr 


y 2 (x) = jc 


_ V- 3 / 2 


i + E 


^w!(l+ |)(2+|). .-(m+ §) 
(- 1 )”' 


' m!(l - f)(2 -§)•■■ (m - |) 


/x\ 2m 

V2/ 

/ v \ 2 

© 


Hint: Let n = 2/77 in the recurrence relation,/// = 1,2,3, 

For /• = — | , a\ = 0 and 03 is arbitrary. 


CHAPTER 6 Section 6.1, page 311 


1. 

3. 

5. 

6 . 
8 . 

10 . 

12 . 

14. 

16. 

18. 

20 . 

22 . 

24. 

26. 


Piecewise continuous 2. Neither 

Continuous 4. Piecewise continuous 

(a) F(s) = 1/s 2 , s > 0 (b) F{s) = 2/s 3 , s > 0 

(c) F(s) = n!/s" +1 , s > 0 

F(s) = s/(s 2 + a 2 ), s > 0 


F(s) = 


F(s) = 


F(s) = 
F(s) = 


F(s) = 


-b 2 


, s > |Zj| 


(s — a) 2 — b 2 
s 


, s — a > \b\ 


’+b 2 ' 


s > 0 


(s — a) 2 + 6 2 ’ 
2as 


s > 0 


(s 2 + a 2 ) 2 ’ 

//! 

r (s) = s > u 

(s - a)« +1 

2fl(3s 2 + a 2 ) 


F(s) = 


, s > |a| 


7. F(s) = 
9. F(s) = 


s 2 — b 2 
s — a 


(s — a) 2 — b 2 

11. F(s ) = „ ^ ^ , s > 0 


, s > |£>| 

, s — a > \b\ 


+ b 2 ’ 


13. F(s) = 


(s — a) 2 + b 2 ’ 

1 

15. F(s) = — , s > a 


17. F(s) = 


(s — a) 2 ’ 
s 2 + a 2 
(s — a) 2 (s + a) 2 


, s > |a| 


(s 2 — a 2 ) 3 
Converges 
Converges 

(d) r (3/2) = Jn/ 2; r(ll/2) = 945^/32 


2a (3s 2 — a 2 ) 

19 '^ )= lsW’ i>0 

21. Converges 
23. Diverges 


Section 6.2, page 320 


1. /(f) = | sin 2 1 
3. /(f) = |e' - f e“ 4 ' 

5. /(f) = 2e~' cos2f 
7. /(f) = 2e' cos t + 3e‘ sin t 
9. /(f) = —2e~ 2 ' cos f + 5e~ 2 ‘ sin f 
11. y= i(e 3 ' + 4e- 2 ') 

13. y = e‘ sinf 

15. y = 2e'cos V3f — (2/v / 3)e'sinv / 3f 
17. y = te‘ — t 2 e‘ + |f 3 e' 

19. y = cos\/2f 

21. y = l(cosf — 2 sinf + 4c' cosf — 2e‘ sinf) 

23. y = 2e~' + fe”' + 2f 2 e“' 

1 e _I (s + 1) 

25 Y(s ) = v ’ 

s 2 (s 2 + l) s 2 (s 2 + l) 

29. F(s) = l/(s — a) 2 


2 . /(f) = 2fV 
4. /(f) = fe 3 ' + fe" 2 ' 

6. /(f) = 2cosh2f — | sinh2f 
8. /(f) = 3 — 2 sin 2f + 5 cos 2 f 
10. /(f) = 2e~' cos 3f — | e - ' sin 3f 
12. y = 2e"' - e~ 2t 
14. y = e 2t — te 2 ‘ 

16. y = 2e~‘ cos 2 f + sin 2f 
18. y = coshf 

20. y = (a/ 2 — 4) [ (co 2 — 5) cos a/f + cos 2f ] 
22. y = i (e _( — e' cos f + 7e' sin f) 


s 1 - e -* s 

14. Y(s) = + — 

s 2 + 4 s(s 2 + 4) 

!6. Y(s) = (1 — e _s )/s 2 (s 2 + 4) 

SO. F(s) = 26 (3s 2 - b 2 )/(s 2 + 6 2 ) 3 
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31. F(s) = n\/s n+1 32. F(s) = n\/(s — a) n+1 

33. F(s) = 2b (s - a) /[is - a) 2 + b 2 ] 2 34. F(s) = [(,y - a) 2 - /r]/[(s - a) 2 + ft 2 ] 2 

36. (a) Y' + s 2 Y = s (b) s 2 Y" + 2s Y' - [s 2 + a (a + 1)] Y = -1 


Section 6.3, page 328 


7. 

8 . 

9. 

11 . 

12 . 

13. 

15. 

17. 

19. 

21 . 

23. 

26. 

28. 

30. 

32. 

33. 

35. 

37. 

39. 


40. 


(b) /(f) = — 2 l( 3 it) + 4 us(t) - Uy (t) 

(b) f(t) = 1- 2u\it) + 2m 2 (0 - 2m 3 (0 + w 4 (f) 

(b) /(f) = 1 + z< 2 (f)[e _(,_2) - 1] 10. (b) fit ) = f 2 + w 2 (0(l - t 2 ) 


(b) /(f) = f - Ml(f) - w 2 (f) - « 3 (f)(f - 2) 

(b) fit) = f + m 2 (f) (2 - f) + ms ( 0(5 - 0 - 

F(s) = 2 e~ s /s 2 

g—KS g—'llts 

F(s) = —2 2 — (1 + ?rs) 

F(s) = s _2 [(l — s)e _2s — (1 + s)e _3s ] 
fit ) = f 3 e 2 ' 

fit ) = 2 u 2 it)e‘~ 2 cos(f — 2) 

/(f) = f/i(f)e 2(t_1) cosh(f — 1) 

/(f) = 2(2f)" 
fit) = ie t/3 (e 2(/3 - 1) 

F(s) = s -1 (l — e _I ), s > 0 

F(s) = 1 [1 - e~ s + • • • + e~ 2ns - e - (2 " +1)s ] 


m 7 (0(7 - f) 

14. F(s) = e _J (s 2 + 2)/s 3 

16. F(s) = -(e _s + 2e~ 3s — 6e" 4s ) 
s 

18. F(s) = (1 - e~ s )/s 2 

20. fit) = iw 2 (f)[e'- 2 - e- 2( '- 2) ] 

22. fit) = « 2 (f) sinh 2(f — 2) 

24. fit ) = Mi(f) + m 2 (0 — 1(3(1) — « 4 (0 
27. fit) = 3e _(/2 cos f 
29. fit) = |e' /2 M 2 (f/ 2) 

31. F(s) = s _1 (l — e -s + e _2s — e _3s ), s > 0 

1 _ e — (2n+2)s 

= , s > 0 

s(l + e~ s ) 


1 00 

no = - TV-l)"^ = 


1A 

1 + e- 


s > 0 


C{fit)} = 

£{/( 0 ) = 


l/s 

, s > 0 

1 + e- 1 

1 — (1 + s)e -s 
s 2 (l - e~0 


s > 0 


(a) C[fit)} =1 *(l-e ’), s > 0 

(b) C{git)} = s" 2 (l - e~ s ), s > 0 

(c) £(/z(f)} = s“ 2 (l — e~ s ) 2 , s > 0 

(b) C[pit)} = * f , s > 0 

s 2 (l + e~ s ) 


36. C{fit)} = 
38. C{fit)} = 


1 - e~ s 

— ; , S > 0 

s(l + e~ s ) 

1 + e~ ns 

(1 +s 2 )(l - e~ ns ) ’ 


s > 0 


Section 6.4, page 336 

1. (a) y = 1 — cos f + sinf — M 37r (f)O + cosf) 

2. (a) _y = e - ' sinf + lu x if)[l + e - *' -30 cost + sinf] 

- |m 2*(0[1 - e" (, “ 2Ir, cosf - e - (f - 2l) sinf] 

3. (a) y = '[1 — m 2 „ (f)](2 sin f — sin 2f) 

4. (a) y = i (2 sin f — sin 2 f) — g ((^ (f) (2 sin f + sin 2 f) 

5. (a) y = | + ie“ 2t - e“' - wio(f)[| + i e - 2(, - 10) - e- (, - 10) ] 

6. (a) y = e~* -V 2t + w 2 (f)[| - e- (( - 2) + \e~ 2(, - 2) ] 

7. (a) y = cosf + w 3 jr (f)[l — cos(f — 3^)] 

8. (a) y = hit) — i(^/ 2 (f)/((f — tr/2), /z(f) = ^ (— 4 + 5f + 4e“' /2 cos f — 3e _(/2 sin f) 

9. (a) y = | sinf + \t — ^u 6 (t)[t — 6 — sin(f — 6)] 

10. (a) y = hit) + Uxit)hit — tc), /((f) = -^[— 4 cos f + sin f + 4e _,/ 2 cos r _|_ e -(/2 s j n 

11. (a) y = m„( 0[J - \ cos(2f - 2^)] - (/3 tt ( f)f 3 - \ cos(2f - 6?r)] 

12. (a) _y = u\it)hit — 1) — z< 2 (f)/((f — 2), /((f) = — 1 + (cosf + coshf)/2 

13. (a) y = /f(f) — u„it)hit — n), /z(f) = (3 — 4cosr + cos2f)/12 

14. /(f) = [w to (0(f - fo) - ut 0 +k(t)(t -t 0 - k)]ih/k) 
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15. g(t) = [u, 0 {t){t - t 0 ) - 2 u, 0+k (t)(t -t 0 -k) + u, 0+2 ic(t)(t - t 0 - 2k)](h/k) 

16. (b) u(t) = 4kui/ 2 (t)h(t — |) — 4ku5/ 2 (t)h(t — |), 

h(t) = \ - (v'7/84)e-'/ 8 sin(3v'7f/8) - \e~ 118 cos(3V7t/8) 

(d) k = 2.51 (e) r = 25.6773 

17. (a) k = 5 

(b) y = [u s (t)h(t — 5) — u 5+k (t)h(t — 5 — /r)]/Ar, /z (?) = If — ^ sin 2? 

18. (b) /*(0 = [w 4 -/t(f) - M4+fc(0]/2*; 

y = [u 4 _ k (t)h(t — 4 + /c) — u 4+k (t)h(t — 4 — /r)]/2/c, 

h(t) = i - ie~ f/6 cos(Vl43 f/6) - (VT43/572)e-'/ 6 sin(Vl43t/6) 

19. (b) y = 1 — cost + 2 E( — l) A: M^(f)[l — cos(f — fcrr)] 

jt=i 

21. (b) y = 1 — cost + E ( — l)*Mfer(0[l — cos(t — Attr)] 

*=i 

23. (a) y = 1 — cos t + 2 E (— l)*tziut/ 4 (i)[l — cos(t — ll/c/4)] 

/:=! 


Section 6.5, page 343 


1. 

2 . 

3. 

4. 

5. 

6 . 

7. 

8 . 
9. 

10 . 

11 . 

12 . 

13. 

14. 


15. 

16. 

17. 

19. 

21 . 

23. 

24. 


(a) y = e 'cos t + e 'sin t + u n (t)e ir) sin(t — n) 

(a) y = \u n (t) sin2(f — n) — \u 2lt {t) sin2(t — 2 j r) 

(a) y = - \e~ 2 ‘ + + w 5 (0[— e _2<?_5) + e~ ( '“ 5) ] + u w {t)[\ + l e - 2( '- 10) - <r ( '- 10) ] 

(a) y = cosh (7) — 20 tt 3 (t) sinh(t — 3) 

(a) y = | sint — | cost + cos V2 1 + (l/\/2) u ilt (t)e~ < -‘~ 37r) sin ~Jl(t — 3n) 

(a) y = | cos2f + \u 4jI (t) sin2(f — An) 

(a) y = sin t + U 2 „ (t) sin(f — 2 n) 

(a) y = u„ /4 (t) sin2(f - tc/A) 

(a) y = u„ /2 {t)[l - cos (t - n/ 2)] + 3 u 2 „, 2 (t) sin(t - 37r/2) - M 2jr (t)[l - cos(f - 2 jt)] 
(a) y = (1/V31) m„/ 6(0 exp[— i(r - 7r/6)] sin(V3l/4)(t - n/6) 

(a) y = | cost + | sin t — ^ e~ ' cos t — |e - ' sint + / 2 (t)e~ (t ~ n /2) sin(f — n/2) 

(a) y = wi(f)[sinh(t — 1) — sin(t — l)]/2 
(a) -e~ T/i S(t — 5 — T), T = 8jr/yi5 

(a) y = (4/V15) u 1 (t)e-«- 1 ' ,/ * sin(Vl5/4)(t - 1) 

(b) t, = 2.3613, yi = 0.71153 

(c) y = (8V7/21) Ul (t)e- ( ‘- 1)/8 sin(3V7/8) (t - 1); h = 2.4569, yi = 0.83351 

(d) t x = 1 4- n/2 = 2.5708, y x = 1 

(a) ki = 2.8108 (b) h = 2.3995 (c) k, = 2 

(a) cj>(t,k) = [«4_ Jt (t)/t(t — 4 + A:) — u 4+k (t)h(t — A — k)]/2k, h{t) = 1 — cost 

(b) 0o (t) = M 4 (f) sin(f — 4) (c) Yes 

20 20 

( b ) y = E Mfer(0 sin(f - /ctt) 18. (b) y = EC-I^+Vk-W sin(t - kn) 

k=l k = 1 


20 

(b) y = E Mfer^W sin(f - kn/ 2) 


k=l 


20 

20. (b) y = X;(-l) fc+1 H/W 2 (t)sin(t-/c7r/2) 


*=i 


15 


(b) y = E M(2jt-i>(0sin[f 

k=l 


(2k — l)7r] 


40 

22. (b) y = E(-l) t+lw m/4(f)sin(t- llAr/4) 


&=1 


20 

(b) y = E (— l)* +1 t/ J b r (f)e -(t-for)/2 ° sin[-s/399(t - At7r)/20] 

A:=l 


^ _ J§9 


E M(2*-i)^(0e- [ '- (2t - 1)ir]/20 sin{ N /399[t - (2/c - l)tr]/20| 
k=l 
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Section 6.6, page 350 


3. 

4. 
6 . 

8 . 

10 . 

12 . 

13. 

15. 

16. 

17. 

18. 

19. 

20 . 

21 . 

22 . 

23. 

24. 

25. 

26. 

27. 

28. 


sin f * sin f = \ (sin t — t cos f ) is negative when f = 2n, for example. 


2 ' 

F(s) = 2 /s 2 (s 2 + 4) 

F(s ) = l/s 2 (s — 1) 

(t — t) 3 sinr dr 


m - 1 

Jo 


5. F(s) = l/(j + l)(s 2 + 1) 

7. F(s) = s/(s 2 + l) 2 

'= f e~ (, ~ T> cos2r dr 


9. /(f) = f , 
Jo 


fit) = i f (t — x)e <( r) sin2rrfr 11. f(t)= f sin(f — r)g(r) dr 

" Jo Jo 

(c) fu" 

Jo 


(1 — u) n du = 


T(m + 1) T(n + 1) 
T (m + n + 2) 


Ilf' 

y — — sin cot H / sin«;(f — r)g(r) dr 14. y = / sin(f — r) sinar dr 

a> a> J o Jo 

y=| f e” ( ' _T)/2 sin2(t — T)g(r) rfr 

Jo 

y = e _t/ 2 cosf _ i e -(/2 s j n f _|_ j e _((_T) / 2 sin(f — t)[ 1 — z/^(r)] rfr 

Jo 

y = 2e~ 2 ' + fe -2 ' + [ {t - r)e _2(,_r) g(r) rfr 

Jo 


/'t 


y = 2c ' - e 21 + | [e r) — e 2(t r) ] cosczr c/r 
1 [‘ 

y=^ j [sinh(f — t) — sin(f — r)]g(r) dr 


y = | cos t — \ cos 2 f + 


4>(s) = 


■FW 

1 + K(s) 


J" 


[2sin(f — r) — sin2(f — r)]g(r)dr 


(a) 0(f) = | (4 sin 2f — 2 sin f) 

(a) 0(f) = cosf 

(b) 0"(f) + 0(f) = 0, 0(0) = 1, 0'(O) = O 

(a) 0(f) = cosh(f) 

(b) 0"(f) - 0(f) = 0, 0(0) = 1, 0'(O) = 0 

(a) 0(f) = (1 - 2f + t 2 )e~‘ 

(b) 0"(f) + 20'(f) + 0(f) = 2e _t , 0(0) = 1, 0'(O) = -3 

(a) 0(f) = ig - ' — ^e ,/2 cos(V3f/2) + -Fe ,/2 sin(V3f/2) 

(b) 0"'(f) + 0(f) = 0, 0(0) = 0, 0'(O) = 0, 0"( 0) = 1 

(a) 0(f) = cosf 

(b) 0 (4) (O — 0(f) = 0, 0(0) = 1, 0'(O) = O, 0"(O) = -1, 0"'(O) = 0 

(a) 0(f) = 1 — -Fe _f/2 sin(->/3f/2) 

(b) 0"'(f) + 0"(f) + 0'(f) = 0, 0(0) = 1, 0'(O) = -1, 0"(O) = 1 


CHAPTER 7 Section 7.1, page 359 

1. x'j = X 2 , x' 2 = — 2xi — 0 . 5 x 2 2. x'j = X 2 , x' 2 = — 2xi — 0 . 5 x 2 + 3 sinf 

3. x'j = X 2 , x 2 = —(1 — 0.25f _2 )xi — f _ 1 x 2 4. x'j = x 2 , x! 2 — x 3 , x' 3 = X 4 , X 4 = xj 

5. x'j = x 2 , x 2 = — 4xj — 0.25 x 2 + 2 cos 3 f, xx(0) = 1, x 2 (0) = —2 

6 . x'j=X 2 , x 2 = —q(t)x\ - p(t)x 2 + g(f); xi( 0 ) = M 0 , x 2 ( 0 ) = Mq 
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7. (a) x\ = c\e~‘ + c 2 e~ 2t , X 2 = c\e~' — c 2 e~ 3 ’ 

(b) ci = 5/2, c 2 = —1/2 in solution in (a) 

(c) Graph approaches origin in the first quadrant tangent to the line X\ = x 2 . 

8. (a) x'{ — x'i — 2x\ = 0 

(b) xi = ye 2 ' - |e-', x 2 = £e 2 ' - \e~* 

(c) Graph is asymptotic to the line x 1 = 2x 2 in the first quadrant. 

9. (a) 2x" — 5-v'j + 2xi = 0 

(b) xi = — |e ,/2 — 1<? 2 ', x 2 = |e' /2 — le 2 ' 

(c) Graph is asymptotic to the line x 1 = x 2 in the third quadrant. 

10. (a) x![ + 3x[ + 2xi — 0 

(b) x\ = —le~‘ + 6e -2 ', x 2 = — 7e - ' + 9e~ 2 ' 

(c) Graph approaches the origin in the third quadrant tangent to the line x\ — x 2 . 

11. (a) x" + 4xi = 0 

(b) xi = 3 cos 27 + 4 sin 2 1, x 2 — —3 sin 2r + 4 cos 2 7 

(c) Graph is a circle, center at origin, radius 5, traversed clockwise. 

12. (a) x" + x[ + 4.25 Xl = 0 

(b) x\ = — 2e _,/2 cos 27 + 2e~ t/2 sin 2 7, x 2 = 2e _,/2 cos 2t + 2e _,/ 2 sin 27 

(c) Graph is a clockwise spiral, approaching the origin. 

13. LRCI" + LI' + RI = 0 

18- y'l = y3, y 2 = y4, rmy' 3 = -{hi + k 2 )yi + k 2 y 2 + fi(7), 
m 2 y\ = k 2 yi - (k 2 + k 3 )y 2 + F 2 (t) 

22. (a) Qi = | - ^Gi + |jQ 2 , 0i(O) = 25 

02 = 3+^01- i0 2 , 0 2 (O) = 15 

(b) 0f = 42, 0f = 36 

(c) xj = -j^Xi + ^x 2 , *i(0) = -17 

x 2 = ra x i “ | x 2, *2(0) = -21 

23. (a) Q[ =3 qi-F Ql + ^02, 0,(0) = 0? 

02 — 92 + 5)01 — 02 ’ 02 ( 0 ) = 0 2 

(b) 0f = 6(9gi + <? 2 ), 0f = 20(3(7 i + 2(72) 

(c) No 

(d) f < 0f/0f < f 


Section 7.2, page 371 



(c) 


/— 3 + 5; 
\ 2 + i 


7 + 5 A 
7 + 2 i) 


3. 



(c),(d) 



4 

-1 

-4 


(d) 

(b) 

0 
0 
1 


(8 + 7 i 
^6 - 4; 

1 3 

2 -1 

3 -1 
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4. (a) 


5. 


3-2 i 
1 + 2 


2 -;\ 
-2 + 3 i) 


(b) 


(3 + 2 i 
\2 + i 


1 — i 
-2 - 3 i 


/ \ /3 + 2i 

(c > (i-i 


2 + / 
-2 - 3i 


( 1 

-11 

-3\ 

( 5 

0 

-A 

6. (a) 11 

20 

17 

(b) 2 

7 

4 

\— 4 

3 

-12/ 

\-l 

1 

4 / 


(c) 


-8 

15 

-1 


8. (a) 4 i (b) 12-8/ (c) 2 + 2; (d) 16 


10 . 


11 . 


16. 


18. 


( 1 


-3 

2\ 


f l 

1 

3 


°\ 

f — 3 


3 

-A 

13. 

1 

3 

1 

3 


5 

V 2 


-1 

0/ 


l-l 

0 


V 






(\ 

1 

4 



Singular 




15. 

0 

1 

2 

4 








0 

V 


( ^ 


3 

1 







10 

10 






2 

10 


4 

10 

+ 

17. Singular 




\-5B 


1 

3 







10 

10/ 






/ 1 

1 

0 

1 \ 


/ 6 

13 

5 

8 

5 

1 

0 

1 

1 

19. 

5 

11 

5 

6 

5 

1 

1 

1 

1 

0 

1 

5 

1 

5 

V 0 

1 

0 

1 / 


V -2 

4 

5 

4 

5 


21. (a) 


(b) 


( c ) 


' 7e r 

5e~‘ 

I0e 2, \ 

—e‘ 

Te~‘ 

2e 2 ' 

8e‘ 

0 

—e 2 ‘J 

' 2e 2 ' 

- 2 + 3e 3 ' 

1 + 4e _2t — e' 

4e 2f 

- 1 - 3e 3 ' 

2 + 2e~ 2t + e‘ 

— 2e 2 ' 

- 3 + 6e 3 ' 

-1 + 6e- 2 ' - 2e' 

e l 

—2e~‘ 

2e 2, \ 

2e* 

—e~ l 

— 2e 2 ' 1 (d) (e 

—e l 

—3e~ f 

4e 2 7 


3e 3 ' + 2e' — e 4, \ 
6e 3 ' + e‘ + e 4 ' 
— 3e 3 ' + 3e' - 2e 4 ', 


/ 1 

2 

-1 


2e^ x 

3e~ l 


I \ 

4 

5 
1 
5 

4 / 



Section 7.3, page 383 

1. X! = — 1 x 2 = 1, X3 = — | 2. No solution 

3. x 1 = — c, x 2 = c + 1, X3 = c, where c is arbitrary 

4. X! = c, x 2 = — c, x 3 = — c, where c is arbitrary 

5. xi =0, x 2 = 0, X3 = 0 6. Xi = Ci, x 2 = c 2 , X3 = ci + 2 c 2 + 2 

7. Linearly independent 8. x (1) — 5x (2) + 2x (3) = 0 
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9. 2x (1) - 3 x (2) + 4x ( 3) - x (4) = 0 
li. x® + x® - x< 4 > = o 


14. Linearly independent 


10. Linearly independent 

13. 3x (1, (t) — 6x (2) (0 + x (3) (r) = 0 

16. M = 2, x (1) = Q) J ^2 = 4, x <2) = 


1 


17. Xi = 1 + 2 i, x (1) = ( _ ) ; k 2 = 1 - 2i, x (2) = 


1 

1 + / 


18. A, = -3, x (1 > = 


; * 2 = -l, X< 2 > = 


19. 7, = 0, x (1> = . ; A 2 = 2. 


r (2) 


20. A-i = 2, x (1) = 


21. A.i = -1/2, x 4 = 


; * 2 = —2, x (2) = 


-V3 


; * 2 = -3/2, x® = 


22. A] = 1, x (1) = I — 3 1 ; * 2 = 1 + 21, x (2) = I 1 I ; 2 l 3 = 1 — 2r, x (3) = f 1 


23. Xi = 1, x (1) = I 0 


( — 2\ 


0\ 


X 2 = 2, x (2) = 1 ; * 3 = 3, x (3) = 1 


0J 


i-ll 


24. Xi = 1, x (1) = I — 2j ; * 2 = 2, x< 2 > = I lj; * 3 = -1, x< 3 > = 


k— 2 J 


25. A.r = -1, x (1 > = I -4 I ; * 2 = -1, x (2) = I 0 


*3 = 8, x< 3 '= ll 


Section 7.4, page 389 

2. (c) W(t) = cexp / [pn(t) +p 22 (t)]df 


/■ 


(d) x' = 


6. (a) W(f) = f 2 

(b) x (1) and x (2) are linearly independent at each point except t = 0; they are linearly 
independent on every interval. 

(c) At least one coefficient must be discontinuous at t = 0. 

0 1 

v — 2r -2 2r _1 

7. (a) W(t) = t(t — 2)e‘ 

(b) x (1) and x (2) are linearly independent at each point except t = 0 and t = 2; they are 
linearly independent on every interval. 

(c) There must be at least one discontinuous coefficient at t = 0 and t = 2. 

( 0 1 \ 

t 2 - 2 x 


(d) x' = I 2 — 2 1 
\t 2 — 2t 


t 2 - 2r, 
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Section 7.5, page 398 


1. (a) x = cj ( 2 ) e ‘ + c 2 


2. (a) x = ci ( 1 ) e ' + c 2 


3. (a) x = ci L ) e' + c 2 I „ I e 


5. (a) x = ci ( * ) e 31 + c 2 


4. (a) x = ci ( _ 4 ) e 3 ' + c 2 

6. (a) x = ci f e ,/2 + c 2 


7. (a) x = ci (T) + c 2 e 2t 


9 x =d ! +4_!V 2 ' 


/_ 2\ /— 3\ 

8. (a) x = ci I I + c 2 I 1 e' 

10. x = ci e ‘ + c 2 (_ j') e ~“ 


11. x = ci ^lj e 4 ' + c 2 ^ — 2J e' + C3 ^ 0J e ' 

12. x = ci ^—4^ e ~‘ + d ^ 0^ e~‘ + C3 ^1^ e 8 ' 

/ 4 \ ( 3 \ ( °\ 

13 . x = ci — 5 I e 2 ' + c 2 — 4 e ' + C3 I lie 2 ' 

1-7/ 1-2/ V — 1/ 


A / A /A 

14. x = ci I — 4 e' + c 2 — 1 e 2 ' + C3 2 e 3 ' 


1 — 1 ; 


1 — 1 ; 


15. x = — 


2 \3 

0\ 


e 2 ' + 


2 V 1 

/I 


17. x = | -2 I e' + 2 I 1 I e 2 ' 


a; 


16. x = 


1 A 


2 V 1 


1 A 


2 V 5 


18. x = 6 ( 2 j e‘ + 3 f-2 j e~‘ - 


-8J 


20. x = Cl (A t + c 2 (A t 1 


21. x = ci A ) t 2 + C 2 


22. x = Cl [ 4 )+C 2 ^ 2 j r 2 

29. (a) x[ = x 2 , x 2 = — (c/a)x\ — (b/a)x 2 

30. (a) x=-f (-2) ^ 

(c) r = 74.39 

31. (a) x = Cl e<-2+V2 W 2 + c 2 e c- 2 -V2),/ 2; 


23. x = ci Q t 1 +c 2 (A t 2 


ci j2 = (—2 ± \/2)/2; node 
4 -l 

,V2; c " r ‘' 2 VV2 

(c) r ii2 = -1 ± ^/a; a = 1 


(b) x = Ci ( ^ ) e<- 1 +' /2 ) t + c 2 ( A e (-1 -v5)«; rj2 — _i -j- ,/2; saddle point 
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= Cl 


32. (a) 

Section 7.6, page 409 

1. (a) x = cie' 

2. (a) x = cie - ' 

3. (a) x = d L 

4. (a) x = c\e' 12 

5. (a) x = cie - ' 

6. (a) x = ci 


e 21 + c 2 


cos 2 1 
cos 2 1 + sin 2 1 


+ c 2 e‘ 


33. (a) 


/ 1 RiV 4 


\CR 2 L 


CL 


> 0 


sin2f 
— cos 2 1 + sin2f 


2 cos 2 1 
sin2f 


+ C2e 


— 2sin2f 
cos 2f 


5 cos t 
2 cos f + sin 


\ / 5 sin t 

in t) Cl cos t + 2 sin t 


5 cos 

3(cos + sin 1 1) 


+ c 2 e' 


•</2 


5 sin 

3(— cos + sin 


cos t 
2 cos t + sin t 


+ c 2 e 


sinf 
— cost + 2 sinf 


—2 cos 3 1 
cos 3t + 3 sin 3f 


) + C! (si 


—2 sin 3? 
sin 3t — 3 cos 3t 


/ 2 \ f 0 ) 

7. x = ci — 3 I e' + c 2 e cos 2 1 + c 2 e‘ 

\ 2/ \ sin 2t I 



3. x = Ci I — 2 e 2( 4- c 2 e 


9. x = e 


^cos t — 3 sin t 
i cos t — sin t 


( — \/2sin\/2r \ 

cos -s/2 t 

V— cos -s/2 1 — \/2 sin V2 1) 
10. x = e~ 


+ c 3 e 


•s/2 cos -s/2 1 \ 

sin \[2 1 

\\/2 cos \/2 1 — sin ~J2 1) 
cos t — 5 sin t 
—2 cos t — 3 sin t ) 


11. (a) r = -\±i 


12. (a) r = | ± i 


13. (a) r = a ± i (b) a = 0 

14. (a) r=(fx± 'Jot 1 — 20)/2 (b) a = -s/20, 0, -s/20 

15. (a) r = ±V4 - 5a (b) a = 4/5 16. (a) r = f ± |V3a (b) a = 0, 25/12 

17. (a) r = -1 ± (b) or = -1, 0 

18. (a) r = — i ± |s/49 — 24a (b) a = 2, 49/24 

19. (a) r = - 2 ± V« 2 + 8a - 24 (b) a = -4 - 2</l0, -4 + 2-s/IO, 5/2 

20. (a) r = -1 ± V25 + 8ff (b) a = -25/8, -3 

sin(-s/21n t) 


21. x = c\t 1 

22. x = ci 


cos (-s/2 In f) , J 

\/2sin(A/21nf) C2 


5 cos(ln t) 

2 cos(ln t ) + sin(ln t) 
23. (a) r = -\±i, 


+ c 2 


— V2 cos(\/2 In t) y 
5 sin (In f) 

— cos(ln t ) + 2 sin(ln t ) j 

24. (a) r=-i±i, i 


? c ; h \ f A _ _ „-t /2 ( cos(f/2) \ t/2 / sin(f/2) \ 

25. (b) ^j-cie ^ 4sin(f/2) J+^ ( v -4cos(//2)J 

(c) Use Ci = 2, c 2 = — | in answer to part (b). 

(d) lim I (t) = lim U(f) = 0; no 
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26. (b) 


= c\e 


cos f 


— cos t — sin f 


+ c 2 e ' 


sinf \ 
— sin f + cos t J 


(c) Use Ci = 2 and c 2 = 3 in answer to part (b). 

(d) lim 7(f) = lim V (t) = 0; no 

t— >oo t->oo 

28. (b) r = ±i^/k/m (d) |r| is the natural frequency. 


29. (c) r\ = -1, $ (1) = 


= -4, § = 


3 

-4 


(d) jci = 3ci cos f + 3c 2 sin f + 3c 3 cos 2 1 + 3c 4 sin 2f, 
x 2 — 2ci cos f + 2c 2 sin f — 4c 3 cos 2 1 — 4 c 4 sin 2f 

(e) Xj = — 3ci sin f + 3c 2 cos f — 6c 3 sin 2f + 6c 4 cos 2 1 , 
x ' 2 = — 2ci sin f + 2 c 2 cos t + 8c 3 sin 2 1 — 8c 4 cos 2 f 


30. (a) A = 


/ 0 

0 

-4 
V 9/4 


(b) n = i, S (1) = 


0 

0 

3 

-13/4 

/ 1 \ 

1 
; 

\ 1 


0 \ 
1 
0 
0 ) 


r 2 = -i, = 


l 1 \ 
1 
—i 

V ~i ) 



/ 

4 ^ 


/ 

4 \ 

r 3 = 5 2 i, = 


-3 

10; 

; r 4 = — « < 4 > = 


-3 

-10; 


V 

-T' J 


V 

f' / 


(c) y = ci 


/ cos t \ 


/ sin f \ 


cos f 


sinf 



+ C2 


+ C3 

— sinf 


cos f 


^ — sin t J 

^ COS f ) 



V 


4 cos 1 1 \ 
—3 cos 1 1 
—10 sin 1 1 
T sin \ l / 


+ C4 


(e) ci = y , c 2 = 0, c 3 = i c 4 = 0. period = 4n. 


4 sin |f \ 
—3 sin 1 1 
10 cos 1 1 
\ -y cos ft / 


/ 


31. (a) A = 


0 

0 

-2 

1 


(b) n = i, § (1 > = 


r 3 


= V3t, £ (3) = 


0 
0 
1 

-2 

/ 1 \ 

1 

; 

\ 1 

1 \ 
-1 
V3i 

V -V3 ; ) 


0 \ 
1 
0 
0 / 


r 2 = -i, ? (2) = 


/ 


/ 1 \ 
1 
—i 

V -i / 


/ 


r 4 


= -V3i, § (4) = 


1 \ 


-1 

-V3i 

V V3i ) 



/ cos f \ 


/ sin f \ 


/ cos s/3 f \ 


< sin s/3 t \ 


cos f 


sinf 


— cos s/3 f 


— sin s/3 f 

y = u 

— sinf 

+ c 2 

cos f 

+ c 3 

— /3sin s/3f 

+ C 4 

s/3 cos -s/3 f 


v — sin f / 

v cos f / 

V s/3 sin /3f / 

V — \/3cos-/3f / 

Cl = 1 , 

c 2 = 0, c 3 = —2, c 4 = 0. 
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Section 7.7, page 420 

l 


1. (b) 0(0 = 

2. (b) 0(0 = 

3. (b) 0(?) = 

4. (b) 0(0 = 

5. (b) 0(0 = 


“3 e 

-l e -‘ 

3 e 


+ fc 2< 

+ |e 2 ' 


k~ ,/2 + ±e“' 

ig-,/2 _ l g -< 

3 
2 

3 „, _ 3 — / 

2 e 2 e 


k" 3 ' + k 2 ' 


5 

A„~ 3' 


k-' /2 +k“' 
-ie'+k"' 
-k'+l e “' 


-k“ 3 '+ le 2t 


+ k 2 ' 


5 

4^-3/ 


+ k 2 ' 


/ cos t + 2 sin f 
\ sinf 


— 5sint \ 
cos t — 2 sin t) 


6. (b) 


—2e 'sin 2 A 
e~‘ cos 2? / 


7. (b) 0(0 = 

8. (b) 0(0 = 


_1»2 1 i 3 4, 
2 C ^ 2 C 

— |e 2 ' + |e 4 ' 


k 2 ' ■ 

3 


i A' 


' cos f + 2e ' sin t 
5e~‘ sin t 


e ' sin t 
' cos t — 2e~‘ sin t 



/ 

-2e~ 2 

+ 3e 



-e- 2 ' 

+ e~ 



-e~ 2 ‘ 

+ e~’ 

9. (b) 0(0 = 


Ip-tt 

2 e 

- 4e~ 

+ 

3^2r 

2 e 

ip- 1 -, 

4 e 

- 5 e 

-' + 

13 2, 
12 C 

5 -2, 

- k“ 

3 e 



Zp-2( 

2 e 

-2e~ 

- 

V' 

2 e 

7-2, 

4 e 

- I e_ 

-r _ 

13 2, 
12 e 

1_ -It 
4 e 

- k- 

3 e 


/ 

! e ' + 

1-2, 

3 e 

+ 

2 C 

-k' 

3 e 

+ l e " 

-2t 


k ! - e” 2 ' 

+ k 3 ' 

10. (b) 0(0 = 


"k- 

- k" 

3 e 

1, + g 3. 

k' 

3 e 

- |c" 

-2 1 


— 2e' + e~ 

"' + e 3 , 



_lg' _ 
6 e 

1-2, 

3 e 

+ 

1 3( 

2 e 

k' 

3 e 

- k~ 

-2 1 

- 

\e' + e -2 ' 

+ k 3 ' 


+ J-e 2 ' 
^ 12 e 

_ 2_ p 2f 
12 e 


11. X = 


17. (c) x = 


2 


M 0 

HO 


cos cot + 


Vo 

— 0) 2 Uo 


12. x = 
sincof 


e ' cos 2t + 


I e 'sin 2t 


Section 7.8, page 428 


1. (c) X = Cl 

2. (c) x = ci 

3. (c) x = ci 

4. (c) x = Ci 
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5. x = Ci I 4 e ' + c 2 1 e 2 ' + c 3 


1 te"' + 0 e 2 ‘ 


6. x = ci 1 e 2 ' + c 2 I 0 e ' + c 3 I lie' 


7. (a) x = 


V 1 / 

( 3 + 4t )e-i 

1,2 + 4 1) 



8. (a) x = 
10. (a) x = 2 


3 ' 
-lj 

:a 


13. x = ci ( 1 1 1 + c 2 


t\nt + I 1 ) t 



14. x = 
16. (b) 


C 1 I i I t 3 + c 2 


• t 3 lnr — (i) ? 3 

1 ' \4 / 


y~2 ) 
o\ 


P -t/2 


+ 


1\ t/2 /2^ 

1 - 2 / 


-r/2 


17. (b) x (1> 0) = lj e 2 ' 

(c) x (2, (f) = ^ 1^ re 2 ' + flj e 2 ' 

(d) x (3, (0 = | 1 j (t 2 / 2)e 2 ‘ + |l j te 2 ‘ + |o 




/ o 

1 

t + 2 \ 



(e) +(0 

= e 2 ' 

1 

t + 1 

(l 2 /2) + f 





l -1 

-t 

~(t 2 / 2) + 3 / 




/ 0 

1 

2 \ 

/- 3 

3 

2 \ 

(f) T = 

1 

1 

0 . 

T- 1 = 3 

-2 

- 2 


1-1 

0 

3/ 

V-i 

1 

1/ 


J = 


(2 

0 

\0 


18. (a) x (1) (0 = e‘, 



x (2) (t ) 2 j e' 
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(d) x (3) (0=1 4 I te‘ + I Ole' 


\—2l 


,-l ) 


1 

0 

2t 

\ / 1 

2 

2r \ 

0 

2 

4 1 

o 

>— < 

o 

4 

4r 

2 

-3 

—2t — 1 

/ V 2 

-2 

— 2f - 1 / 



/I 

2 

°\ 


/ 1 

-1/2 


(f) T = 

0 

4 

0 • 

T _1 = 

0 

1/4 

0 



-2 

-1/ 



-3/2 

-1/ 


J = 


/I 

0 

\o 


19. (a) J 2 = 


(c) exp(J0 = e 

(d) x = exp(J0x' 



3X" 


4A : 



/I 

0 



/ l r 

f/2 \ 

20. (c) exp(J0 = e At 

0 

1 

' 

21. (c) exp(Jr) = e A ' 

0 

1 

' 


V) 

0 

1/ 


l o 

0 

i / 


Section 7.9, page 439 


1. < -n(|W+c ([)<■-■'+ '(I )«•■ 


iG) -©'-(? 


i 




2 . x - C1 [ Y) e“ + c 2 e-' - yYjji) * + [ 2/ ^) 

„ ( 5 cos t \ ( 5 sin t \ (2\ / 1\ 

3 ' x = C1 (,2 cos f + sin r J + C2 cos t + 2 sin r J + (,1 J r cos f - (oj ? Sln ' 

— cost 

4. x = Cl (_}) e~ 3t + e 2 (J) e 2 ' - (J) e~ 2 ‘ + \ Q e' 


5. x = ci U | +c 2 




-2 


Inf + 


2 t - 2 


6. x = ci 

7. x = ci 

8. x = ci 

9. x = Ci 
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CHAPTER 


10 - X = C1 \V2) e ~ 


+ c 2 


~V2) 
1 


_ 4 , 1 (V2-1 

e — - 


3 V 2 - 


te 




5 cos / 


5 sin/ 


0 


11. x = ci I . . ) + c 2 1 ' 1 . ) + I,” ) rcost — tsint 

\2 cos/ + sin// \ — cos/ + 2sm// \1 /2 / \ 1 


5/2 


cos / 


12. x = [1 ln(sin/) — ln(— cost) — it + c{\ ( . ^cosf \ 
Ls ' ’ 5 1J \2 cos t + sm t) 


+ [= ln(sinf) - U + c 2 ] 


5 cos t 

2 cos t + sin t ) 

5 sin/ 

— cos t + 2 sin / J 


13. (a) *(») = 


e 1/2 cos 1/ 
4e _,/2 sin \t 


e~ t/2 sin |/ 

_4 e -*/ 2 cos If 


14. x = ci 

15. x = ci 


l) r + C2 (3 


r 1 - 


>\(\ 


t — 


(b) x = e~‘l 2 

, 1 

rln/ — 


4 — 4 cos 1/ 


3 \3 


r + c 2 


+ l 2i' + n(1 


'*-5(3 


Section 8.1, page 451 


1. (a) 

(b) 

(c) 

(d) 

2. (a) 

(b) 

(c) 

(d) 

3. (a) 

(b) 

(c) 

(d) 

4. (a) 

(b) 

(c) 

(d) 

5. (a) 

(b) 

(c) 

(d) 

6. (a) 

(b) 

(c) 

(d) 

7. (a) 

(b) 

(c) 

(d) 

8. (a) 

(b) 


1.1975, 1.38549, 1.56491, 1.73658 


1.19631, 1.38335, 

1.19297, 1.37730, 

1.19405, 1.37925, 

1.59980, 1.29288, 

1.61124, 1.31361, 

1.64337, 1.37164, 

1.63301, 1.35295, 1.15267, 

1.2025, 1.41603, 1.64289, 

1.20388, 1.41936, 1.64896, 

1.20864, 1.43104, 

1.20693, 1.42683, 

1.10244, 1.21426, 

1.10365, 1.21656, 

1.10720, 1.22333, 

1.10603, 1.22110, 

0.509239, 0.522187, 

0.509701, 0.523155, 

0.511127, 0.526155, 

0.510645, 0.525138, 

-0.920498, -0.857538, -0.808030, 
-0.922575, -0.860923, -0.812300, 
-0.928059, -0.870054, -0.824021, 
-0.926341, -0.867163, -0.820279, 
2.90330, 7.53999, 19.4292, 50.5614 
2.93506, 7.70957, 20.1081, 52.9779 
3.03951, 8.28137, 22.4562, 61.5496 
3.00306, 8.07933, 21.6163, 58.4462 
0.891830, 1.25225, 2.37818, 4.07257 
0.908902, 1.26872, 2.39336, 4.08799 


1.56200, 1.73308 
1.55378, 1.72316 
1.55644, 1.72638 
1.07242, 0.930175 
1.10012, 0.962552 
1.17763, 1.05334 
1.02407 
1.88590 
1.89572 
1.67042, 1.93076 
1.66265, 1.91802 
1.33484, 1.46399 
1.33817, 1.46832 
1.34797, 1.48110 
1.34473, 1.47688 
0.539023, 0.559936 
0.540550, 0.562089 
0.545306, 0.568822 
0.543690, 0.566529 


-0.770038 

-0.774965 

-0.788686 

-0.784275 
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9. 


10 . 


11 . 


12 . 


15. 

16. 

17. 

18. 
20 . 
21 . 
22 . 

24. 

25. 


26. 


l.i 


(c) 0. 

(d) 0. 

(a) 

(b) 

(c) 

(d) 

(a) 1 

(b) 1 

(c) 

(d) 1. 

(a) - 

(b) - 

(c) - 

(d) - 

(a) 0. 

(b) 0. 

(c) 0. 

(d) 0. 

1.595, 

&n + 1 = 
£n + 1 = 


958565, 1.31786, 2.43924, 4.13474 
942261, 1.30153, 2.42389, 4.11908 
95713, 5.09853, 6.41548, 7.90174 
95965, 5.10371, 6.42343, 7.91255 
96727, 5.11932, 6.44737, 7.94512 
96473, 5.11411, 6.43937, 7.93424 
60729, 2.46830, 3.72167, 5.45963 
60996, 2.47460, 3.73356, 5.47774 
61792, 2.49356, 3.76940, 5.53223 
61528, 2.48723, 3.75742, 5.51404 
1.45865, -0.217545, 1.05715, 1.41487 
1.45322, -0.180813, 1.05903, 1.41244 
1.43600, -0.0681657, 1.06489, 1.40575 
1.44190, -0.105737, 1.06290, 1.40789 
587987, 0.791589, 1.14743, 1.70973 
589440, 0.795758, 1.15693, 1.72955 
593901, 0.808716, 1.18687, 1.79291 
592396, 0.804319, 1.17664, 1.77111 
2.4636 

= [20(>„) - 1 ]h 2 , \e n+1 \ < [1 + 2maxo< ( <i \<p(t)\]h 2 , 

-- e 21n h 2 . Id | < 0.012, |e 4 | < 0.022 


e„+i = [20(60 - tn\h 2 , |e„+il < [1 + 2 max 0 <,<i |0(f)l] h 2 , 
e„+i = 2e 2,n h 2 , |dl < 0.024, |e 4 | < 0.045 

e„+i = [?„ + (>(60 + <p 3 (t n )]h 2 19. e n+1 = [19 - 15f„0' 1/2 (f„)]/; 2 /4 

e n+1 = {l + [f„ +0(6O]- 1/2 }/! 2 /4 

e„+i = (2 - [ <p{t „ ) + 2 1 \] exp[— ?„0(6O] - t n exp[-26,0(r„)])/i 2 /2 
(a) 0(f) = 1 + (1 /57r ) sin5^r? (b) 1.2, 1.0, 1.2 

(c) 1.1, 1.1, 1.0, 1.0 (d) h < l/^/50n : = 0.08 

e„+i = - j0"((„)/! 2 

(a) 1.55, 2.34, 3.46, 5.07 

(b) 1.20, 1.39, 1.57, 1.74 

(c) 1.20, 1.42, 1.65, 1.90 

(a) 0 (b) 60 (c) -92.16 27. 0.224 / 0.225 


Section 8.2, page 458 


1. 


2 . 


3. 


4. 


5. 


6 . 


(a) 

(b) 

(c) 

(a) 

(b) 

(c) 

(a) 

(b) 

(c) 

(a) 

(b) 

(c) 

(a) 

(b) 

(c) 

(a) 

(b) 

(c) 


1.19512, 1.38120, 1.55909, 1.72956 

1.19515, 1.38125, 1.55916, 1.72965 

1.19516, 1.38126, 1.55918, 1.72967 
1.62283, 1.33460, 1.12820, 0.995445 
1.62243, 1.33386, 1.12718, 0.994215 
1.62234, 1.33368, 1.12693, 0.993921 
1.20526, 1.42273, 1.65511, 1.90570 

1.20533, 1.42290, 1.65542, 1.90621 

1.20534, 1.42294, 1.65550, 1.90634 

1.10483, 1.21882, 1.34146, 1.47263 

1.10484, 1.21884, 1.34147, 1.47262 
1.10484, 1.21884, 1.34147, 1.47262 
0.510164, 0.524126, 0.542083, 0.564251 
0.510168, 0.524135, 0.542100, 0.564277 
0.510169, 0.524137, 0.542104, 0.564284 
-0.924650, -0.864338, -0.816642, -0.780008 
-0.924550, -0.864177, -0.816442, -0.779781 
-0.924525, -0.864138, -0.816393, -0.779725 
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7. 


9. 

10 . 

11 . 

12 . 

15. 

16. 

17. 

18. 
20 . 
23. 
25. 


(a) 2.96719, 7.88313, 20.8114, 

(b) 2.96800, 7.88755, 20.8294, 

(a) 0.926139, 1.28558, 2.40898. 

(b) 0.925815, 1.28525, 2.40869, 

(a) 3.96217, 5.10887, 6.43134, 

(b) 3.96218, 5.10889, 6.43138, 
1.61263, 2.48097, 3.74556, 
1.61263, 2.48092, 3.74550, 
-1.44768, -0.144478. 
-1.44765, -0.143690, 


(a) 

(b) 

(a) 

(b) 

(a) 0.590897, 0.799950, 

(b) 0.590906, 0.799988, 
e„ +1 = (38/z 3 /3) exp(4f„), 
e n +i = (2/z 3 /3) exp(2r„), 
e«+i = (4/z 3 /3) exp(27„), 
h = 0.071 

h = 0.081 

1.19512, 1.38120, 1.55909, 
1.20526, 1.42273, 1.65511, 


55.5106 
55.5758 
4.10386 
4.10359 
7.92332 
7.92337 
5.49595 
5.49589 
1.06004, 1.40960 
1.06072, 1.40999 


1.16653, 1.74969 
1.16663, 1.74992 

|e n+ i| < 37,758.8/z 3 on 0 < t < 2, \e t \ < 0.00193389 
\e„+i\ < 4.92604/; 3 on 0 < t < 1, |ei| < 0.000814269 
\e„ + 1 | < 9.85207/; 3 on 0 < t < 1, \e\\ < 0.00162854 

19. h = 0.023 
21. h = 0.117 

1.72956 24. 1.62268, 1.33435, 1.12789, 0.995130 

1.90570 26. 1.10485, 1.21886, 1.34149, 1.47264 


Section 8.3, page 

463 



1 . 

(a) 

1.19516, 

1.38127, 

1.55918, 

1.72968 


(b) 

1.19516, 

1.38127, 

1.55918, 

1.72968 

2. 

(a) 

1.62231, 

1.33362, 

1.12686, 

0.993839 


(b) 

1.62230, 

1.33362, 

1.12685, 

0.993826 

3. 

(a) 

1.20535, 

1.42295, 

1.65553, 

1.90638 


(b) 

1.20535, 

1.42296, 

1.65553, 

1.90638 

4. 

(a) 

1.10484, 

1.21884, 

1.34147, 

1.47262 


(b) 

1.10484, 

1.21884, 

1.34147, 

1.47262 


5. (a) 0.510170, 0.524138, 0.542105, 0.564286 
(b) 0.520169, 0.524138, 0.542105, 0.564286 

6. (a) -0.924517, -0.864125, -0.816377, -0.779706 
(b) -0.924517, -0.864125, -0.816377, -0.779706 

7. (a) 2.96825, 7.88889, 20.8349, 55.5957 
(b) 2.96828, 7.88904, 20.8355, 55.5980 

8. (a) 0.925725, 1.28516, 2.40860, 4.10350 
(b) 0.925711, 1.28515, 2.40860, 4.10350 

9. (a) 3.96219, 5.10890, 6.43139, 7.92338 
(b) 3.96219, 5.10890, 6.43139, 7.92338 

10. (a) 1.61262, 2.48091, 3.74548, 5.49587 
(b) 1.61262, 2.48091, 3.74548, 5.49587 

11. (a) -1.44764, -0.143543, 1.06089, 1.41008 
(b) -1.44764, -0.143427, 1.06095, 1.41011 

12. (a) 0.590909, 0.800000, 1.166667, 1.75000 
(b) 0.590909, 0.800000, 1.166667, 1.75000 


Section 8.4, page 469 

1. (a) 1.7296801, 1.8934697 

(b) 1.7296802, 1.8934698 

(c) 1.7296805, 1.8934711 


766 


Answers to Problems 


2. (a) 

(b) 

(c) 

3. (a) 

(b) 

(c) 

4. (a) 

(b) 

(c) 

5. (a) 

(b) 

(c) 

6. (a) 

(b) 

(c) 

7. (a) 

(b) 

(c) 

8. (a) 

(b) 

(c) 

9. (a) 

(b) 

(c) 

10. (a) 

(b) 

(c) 

11. (a) 

(b) 

(c) 

12. (a) 

(b) 

(c) 


0.993852, 0.925764 
0.993846, 0.925746 
0.993869, 0.925837 
1.906382, 2.179567 
1.906391, 2.179582 
1.906395, 2.179611 
1.4726173, 1.6126215 
1.4726189, 1.6126231 
1.4726199, 1.6126256 
0.56428577, 0.59090918 


0.56428581, 

0.56428588, 

-0.779693, 

-0.779692, 

-0.779680, 

2.96828. 

2.96829. 


0.59090923 
0.59090952 
-0.753135 
-0.753137 
-0.753089 
7.88907, 20.8356, 
7.88909, 20.8357, 


2.96831, 7.88926, 20.8364, 
0.9257133, 1.285148, 2.408595. 
0.9257124, 1.285148, 2.408595. 
0.9257248, 1.285158, 2.408594. 
3.962186, 5.108903, 6.431390, 
5.108903. 

5.108903. 

2.480909. 

2.480909. 


3.962186, 

3.962186, 

1.612622, 

1.612622, 


55.5984 

55.5986 

55.6015 

4.103495 
4.103495 
4.103493 
7.923385 
7.923385 
7.923385 

5.495872 

5.495873 


6.431390, 

6.431390, 

3.745479, 

3.745479, 

1.612623, 2.480905, 3.745473, 5.495869 
-1.447639, -0.1436281, 1.060946, 1.410122 
-1.447638, -0.1436762, 1.060913, 1.410103 
-1.447621, -0.1447219, 1.060717, 1.410027 
0.5909091, 0.8000000, 1.166667, 1.750000 
0.5909091, 0.8000000, 1.166667, 1.750000 
0.5909092, 0.8000002, 1.166667, 1.750001 


Section 8.5, page 479 

1 . (b) 0 2 (O — 0i (0 = 0.0010 — > cxi as t — ^ oo 

2. (b) 0i(O = ln[e'/(2 - 0)]; 0,(0 = ln[l/(l - 0] 

3. (a,b) h = 0.00025 is sufficient, (c) h — 0.005 is sufficient. 

4. (a) y = 4<?^ 10 ' + (f 2 /4) (c) Runge-Kutta is stable for h = 0.25 but unstable for h — 0.3. 
(d) h = 5/13 = 0.384615 is small enough. 

5. (a) y = t 6. (a) y = t 2 


Section 8.6, page 483 

1. (a) 1.26, 0.76; 1.7714, 1.4824; 2.58991, 2.3703; 3.82374, 3.60413; 

5.64246, 5.38885 

(b) 1.32493, 0.758933; 1.93679, 1.57919; 2.93414, 2.66099; 4.48318, 4.22639; 

6.84236, 6.56452 

(c) 1.32489, 0.759516; 1.9369, 1.57999; 2.93459, 2.66201; 4.48422, 4.22784; 

6.8444, 6.56684 

2. (a) 1.451, 1.232; 2.16133, 1.65988; 3.29292, 2.55559; 5.16361, 4.7916; 

8.54951, 12.0464 
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CHAPTER 9 


3. 


4. 


5. 


6 . 


(b) 1.51844, 1.28089; 2.37684, 1.87711; 3.85039, 3.44859; 6.6956, 9.50309; 

15.0987, 64.074 

(c) 1.51855, 1.2809; 2.3773, 1.87729; 3.85247, 3.45126; 6.71282, 9.56846; 

15.6384, 70.3792 


-0.32208, 1.20347; 


-0.322081, 1.20347; 


(a) 0.582, 1.18; 0.117969, 1.27344; -0.336912, 1.27382; -0.730007, 1.18572; 

-1.02134, 1.02371 

(b) 0.568451, 1.15775; 0.109776, 1.22556; 

-0.681296, 1.10162; -0.937852, 0.937852 

(c) 0.56845, 1.15775; 0.109773, 1.22557; 

-0.681291, 1.10161; -0.937841, 0.93784 

(a) -0.198, 0.618; -0.378796, 0.28329; -0.51932, -0.0321025; 

-0.594324, -0.326801; -0.588278, -0.57545 

(b) -0.196904, 0.630936; -0.372643, 0.298888; 

-0.561270, -0.288943; -0.547053, -0.508303 

(c) -0.196935, 0.630939; -0.372687, 0.298866; 

-0.561292, -0.28907; -0.547031, -0.508427 

2.34119, 1.67121; 1.90236, 1.97158; 


-0.501302, -0.0111429; 


-0.501345, -0.0112184; 


(a) 2.96225, 1.34538; 
1.29768, 2.46732 

(b) 3.06339, 1.34858; 
1.3555, 2.5175 

(c) 3.06314, 1.34899; 
1.35514, 2.51827 

(a) 1.42386, 2.18957; 
2.9955, 2.83354 

(b) 1.41513, 2.18699; 
2.97806, 2.82487 

(c) 1.41513, 2.18699; 
2.97806, 2.82488 


1.56602, 2.23895; 


2.44497, 1.68638; 1.9911, 2.00036; 1.63818, 2.27981; 


2.44465, 1.68699; 1.99075, 2.00107; 1.63781, 2.28057; 


1.82234, 2.36791; 
1.81208, 2.36233; 


2.21728, 2.53329; 
2.20635, 2.5258; 


2.61118, 2.68763; 
2.59826, 2.6794; 


1.81209, 2.36233; 2.20635, 2.52581; 2.59826, 2.67941; 


7. For h = 0.05 and 0.025: x = 10.227, y = —4.9294; these results agree with the exact solu- 
tion to five digits. 

8. 1.543, 0.0707503; 1.14743, -1.3885 9. 1.99521, -0.662442 


Section 9.1, page 494 

1. (a) r\ = —1, § (1) = (l,2) r ; r 2 = 2, £ (2 ' = (2, l) r (b) saddle point, unstable 

2. (a) r\ = 2, £ (1) = (l,3) r ; r 2 = 4,£ {2) = (l,l) r (b) node, unstable 

3. (a) /-] = —1, £ (1) = (l,3) r ; r 2 = 1,§ <2) = (1, l) r (b) saddle point, unstable 

4. (a) n = r 2 = —3, £ (l1 = (l,l) r (b) improper node, asymptotically stable 

5. (a) r 2 ,r 2 = — 1 ± i; £ (1 \£ <2) = (2 ± i, l) r (b) spiral point, asymptotically stable 

6. (a) r 2 ,r 2 = ±i\ £ {1) , £ (2) = (2 ± i, 1) T (b) center, stable 

7. (a )ri,r 2 = l±2i; £ (1) ,| <2) = (l.lqp i) r (b) spiral point, unstable 

8. (a) ri = -1, $ (1) = (l,0) r ; r 2 = -1/4,| <2) = (4,-3) r 
(b) node, asymptotically stable 

9. (a) r\ = r 2 = 1, £ a) = (2, l) r (b) improper node, unstable 

10. (a) ri,r 2 = ±3i; td), t(2) _ n — 1 ± 3i) T (b) center, stable 

11. (a) n = r 2 = -1; | (1) = (l,0) r , ^ (2) = (0,l) r 
(b) proper node, asymptotically stable 

12. (a) r\,r 2 = (1 ± 3t)/2; ^ ll) ,| (2) = (5,3 3 i) T (b) spiral point, unstable 

13. x 0 = 1, y 0 = 1; r 2 — \fl, r 2 = — V2; saddle point, unstable 

14. xq = —1, yo = 0; r\ = — 1 ,r 2 = —3; node, asymptotically stable 

15. jco = —2, = 1; r 2 ,r 2 = — 1 ± a/2 i; spiral point, asymptotically stable 

16. xg — y/S, yg = a//?; r\,r 2 = ±/85i; center, stable 
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17. c 2 > Akin, node, asymptotically stable; c 2 = 4 km, improper node, asymptotically stable; 
c 2 < Akin, spiral point, asymptotically stable 


Section 9.2, page 506 


1 . 

2 . 

3. 

4. 

5. 

6 . 

7. 

8 . 
9. 

10 . 

11 . 

12 . 

13. 

14. 

15. 

16. 

17. 

19. 

21 . 

23. 


x — 4<?~ f , y = 2e~ 2 ', y = x 2 /8 
x = 4e~', y = 2e 2 ', y = 32x -2 ; x = 4e~ f , y = 0 

x = 4cosf, y = 4sinf, x 2 +y 2 = 16; x = — 4sinf, y = 4cost, x 2 +y 2 = 16 

x = ^flcos \fab t, y = —Vb sin Vabt; (x 2 /a) + (y 2 /b) = 1 

(a, c) (— i,l), saddle point, unstable; (0,0), (proper) node, unstable 

(a, c) (— V3/3, — j), saddle point, unstable; (V3/3, — i), center, stable 

(a, c) (0,0), node, unstable; (2,0), node, asymptotically stable; 

(0, |), saddle point, unstable; (—1,3), node, asymptotically stable 
(a, c) (0,0), node, asymptotically stable; (1, —1), saddle point, unstable; 

(1, —2), spiral point, asymptotically stable 
(a, c) (0,0), spiral point, asymptotically stable; 

(1 — V 2, 1 + \/2), saddle point, unstable; (1 + \f2, 1 — \fl), saddle point, unstable 
(a, c) (0,0), saddle point, unstable; (2,2), spiral point, asymptotically stable; 
(-1,-1), spiral point, asymptotically stable; (—2,0), saddle point, unstable 
(a, c) (0,0), saddle point, unstable; (0,1), saddle point, unstable; 

(i,j), center, stable; (— |,^), center, stable 

(a, c) (0,0), saddle point, unstable; (V6, 0), spiral point, asymptotically stable; 

( — V^6, 0), spiral point, asymptotically stable 

(a, c) (0,0), saddle point, unstable; (—2,2), node, unstable; 

(4,4), spiral point, asymptotically stable 

(a, c) (0,0), saddle point, unstable; (2,0), saddle point, unstable; 

(1,1), spiral point, asymptotically stable; (—2,-2), spiral point, asymptotically stable 
(a,c) (0,0), node, unstable; (1,1), saddle point, unstable; 

(3,-1), spiral point, asymptotically stable 

(a,c) (0,1), saddle point, unstable; (1,1), node, asymptotically stable; 

(—2,4), spiral point, unstable 


(a) 4x 2 — y 2 = c 
(a) (y-2x) 2 (x + y) = c 
(a) 2x 2 y -2 xy + y 2 = c 
(a) (y 2 / 2) — cos* = c 


18. (a) 4x 2 + y 2 = c 

20. (a) arctan(y/x) — In ^Jx 2 + y 2 = c 

22. (a) x 2 y 2 - 3x 2 y -2 y 2 = c 

24. (a) x 2 +y 2 — (x 4 /12) = c 


Section 9.3, page 516 

1. linear and nonlinear: saddle point, unstable 

2. linear and nonlinear: spiral point, asymptotically stable 

3. linear: center, stable; nonlinear: spiral point or center, indeterminate 

4. linear: improper node, unstable; nonlinear: node or spiral point, unstable 

5. (a, b, c) (0,0); a! = — 2u + 2v, v' = Au + Av; r = 1 ± VT7; 
saddle point, unstable 

(—2,2); v! = An, v' = 6u + 6u; r = 4,6; node, unstable 

(4,4); u' = —6u + 6v, v' = —8 u: r = — 3±-i/39i; spiral point, asymptotically 

stable 

6. (a, b, c) (0,0); u' = u, v' = 3v; r= 1,3; node, unstable 
(1,0); u’ = —u — v, v' = 2v; r=— 1,2; saddle point, unstable 

(0, | ); w' = — \u, v' = — | u — 3v; r = —^,—3; node, asymptotically stable 
(—1,2); u' = u + v, v' = —2u — Av\ r = (— 3 ± vT7)/2; saddle point, unstable 
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7. (a, b, c) (1, 1); u' — —v, v' = 2 u — 2d; r = — 1 ± i; spiral point, asymptotically 
stable 

(—1, 1); u' = — d, d' = —2m — 2d; r = — 1 ± V3; saddle point, unstable 

8. (a,b,c) (0,0); u' = u, v' = ^v; r = 1, |; node, unstable 

(0,2); m' = — u, v ' = — 2 u—\v, r = — 1,— 1; node, asymptotically stable 

(1.0) ; m' = — m — d, t/ = — jt>; r = — 1, — J; node, asymptotically stable 

(|, |); «' = -^it-§u, u' = — |m — | w; r = (—5 ± \/57)/16; 

saddle point, unstable 

9. (a, b, c) (0,0); u' = — u + 2v, d' = m + 2d; r = ( l±\/T7/2; saddle point, unstable 

(2.1) ; u! = — §m + 3d, v' = —2u; r = (— 3 ± V87D/4; spiral point, asymptotically 

stable 

(2,-2); u' = — 3d, v' = u; r = ±V3i; center or spiral point, indeterminate 

(4,-2); !(' = — 4d, v’ = —u — 2v\ r = — 1 ± V5; saddle point, unstable 

10. (a, b, c) (0,0); u' = u, v' = v\ r = l,l; node or spiral point, unstable 

(—1,0); m' = — m, v' = 2v\ r = — 1,2; saddle point, unstable 

11. (a, b, c) (0,0); u' = 2u+v, v' = u — 2v\ r = ±V 5; saddle point, unstable 

(-1.1935, -1.4797); m' = -1.2399m - 6.8393u, u' = 2.4797m - 0.80655u; 

r = —1.0232 ± 4.1125/; spiral point, asymptotically stable 

12. (a, b, c) (0, ±2nn), n = 0,1,2,...; u' = v, v' = — m; 
r = ±i; center or spiral point, indeterminate 

[2, ±(2/i — l)7r], ii = 1,2,3,...; u! = — 3v, v' = — m; r = ±->/3; 
saddle point, unstable 

13. (a, b, c) (0,0); u' = u, v' = v; r = 1, 1; node or spiral point, unstable 

(1. 1) ; u' = u — 2u, v' — —2m + d; r = 3, —1; saddle point, unstable 

14. (a, b, c) (1,1); u'=—u — v, v' = u — 3v, r=— 2,-2; 

node or spiral point, asymptotically stable 

(-1,-1); u' = u + v, v' = u — 3d; r = — 1 ± V5; saddle point, unstable 

15. (a, b, c) (0,0); u' = —2u—v, v' = u — v\ r = (— 3 ± V3i)/2; 
spiral point, asymptotically stable 

(-0.33076, 1.0924) and (0.33076, -1.0924); u' = -3.5216 u - 0.27735d, 
d' = 0.27735m + 2. 6895d; r= —3.5092, 2.6771; saddle point, unstable 

16. (a, b, c) (0,0); u' = u+v, v' = — u + v; r = 1 ± ;; spiral point, unstable 

17. (a,b,c) (2,2); u' = — 4d, d' = — |m+|d; r = (7 ± \/273)/4; saddlepoint, unstable 

(—2,-2); m' = 4d, v' = jii — |d; r = (— 1 ± V33)/4; saddlepoint, unstable 

(— 1,2); u’ = — 4d, d ' =\u\ r = ±vT4 i; center or spiral point, indeterminate 
( — | , — 2); m' = 4d, d' = — ^m; r = ±\/2i; center or spiral point, indeterminate 

18. (a, b, c) (0,0); u' = 2u — v, v' = 2u — 4d; r=— 1±V7; saddlepoint, unstable 

(2.1) ; u! = — 3d, v' = 4m — 8d; r=— 2,-6; node, asymptotically stable 

(—2,1); m' = 5d, d' = — 4m; /• = ±2v / 5/; center or spiral point, indeterminate 

(-2,-4); m' = 10m — 5d, v' = 6 u; r = 5±*/5i; spiral point, unstable 

21. (b, c) Refer to Table 9.3.1. 

23. (a) R = A, T = 3.17 (b) R = A, T = 3.20, 3.35, 3.63, 4.17 

(c) T -> 7r as A — >■ 0 (d) A = 7r 

24. (b) v c = 4.00 

25. (b) d c = 4.51 

30. (a) dx/dt = y , dy/dt = — g(x) — c(x)y 

(b) The linear system is dx/dt = y, dy/dt = —g’( 0)x — c(0)y. 

(c) The eigenvalues satisfy r 2 + c(0)r + g'(0) = 0. 
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Section 9.4, page 530 

1. (b,c) (0,0); a' = |«, v' = 2u; r= |,2; node, unstable 

(0,2); u' = \u, v'=—\u — 2v; r=|,— 2; saddle point, unstable 

(|,0); u' = —^u—\v, v' = lv; r=— f, ?; saddle point, unstable 

(j,j); u! = — \u— |u, v / = — — I w ; r = ( — 22 ± \/204)/20; 

node, asymptotically stable 

2. (b,c) (0,0); «' = f w, v' = 2v; r= |,2; node, unstable 

(0,4); tt' = — |m, v' = —6u — 2v; r = —\,— 2; node, asymptotically stable 
(|,0); u' = — \u— ^v, v' = — jd; r = — |; node, asymptotically stable 

(1.1) ; u' = —u—\v, v' = —\u—\v, r = (— 3 ± Vl3)/4; 
saddle point, unstable 

3. (b,c) (0,0); u' — |w, v' = 2v\ r = |,2; node, unstable 

(0,2); u! = —\u, v' = —\u — 2v\ r = —\,—2 ; node, asymptotically stable 

(3,0); u' = — \u — 3v, = r=— — y; node, asymptotically stable 

(|, |i); m' = -\u- i/ = — |§m — gt>; r = -1.80475, 0.30475; 
saddle point, unstable 

4. (b,c) (0,0); m' = |m, t/=^u; r=|,|; node, unstable 

(0, |); w' = jm, d' = — Id; r = ±|; saddle point, unstable 

(3.0) ; «' = — — 3i>, v' = |n; r=— 1,|; saddle point, unstable 

(2, j); u' = —u — 2v, v' — — r = —1.18301, —0.31699; 

node, asymptotically stable 

5. (b,c) (0,0); u' = m, i/ = |u; r = 1, |; node, unstable 

(0, |); u! = — \u, v' = — \u— fu; r=— 1, — |; node, asymptotically stable 

(1.0) ; «' = — u — v, v' = jv; r = —1, i; saddle point, unstable 

6. (b,c) (0,0); u' = u, v' = |v; r = 1, |; node, unstable 

(0, |); u' = -^m, v'—fjU—^v; r=^, — |; saddle point, unstable 

(1,0); u! = — «+|a, a' = r=— 1, saddle point, unstable 

(2.2) ; m' = — 2u + a, v’ = |w — 3a; a = (—5 ± >/3)/2; 

node, asymptotically stable 

8. (a) Critical points are x = 0,y = 0; x = e\/ai,y = 0; x = 0,y = €2/02- 
x — >• 0, y — >■ 62/^2 as t — >■ 00; the redear survive. 

(b) Same as part (a) except x — ► ei/cri, y — >• 0 as f — >■ 00; the bluegill survive. 

9. (a) X = (B - yiR)/{l - 5/1/2), Y = {R - y 2 B)/{l - yiy 2 ) 

(b) X is reduced, Y is increased; yes, if B becomes less than yii?, then x — >■ 0 and y —>■ R 
as f — 00. 

10. (a) o-i e 2 - a 2 ei / 0: (0,0), (0, e 2 /er 2 ), foM,0) 

CT162 — <*2^1 = 0: (0,0), and all points on the line crix + aiy = ei 

(b) cti 62 — ff2«i > 0: (0,0) is unstable node; (ei/eri,0) is saddle point; 

(0, 62/^2) is asymptotically stable node. 

0162 — ar 2 ei < 0: (0,0) is unstable node; (0, € 2 / 02 ) is saddle point; 

(ei/oi,0) is asymptotically stable node. 

(c) (0, 0) is unstable node; points on the line o\x + a 2 y = are stable, nonisolated critical 
points. 

12. (a) (0,0), saddle point; (0. 15, 0), spiral point if y 2 < 1.11, nodeif/ 2 > 1.11; (2,0), saddle 
point 

(c) / = 1.20 
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13. (b) (2 — ^4 - fa, fa), (2 + ^4 - fa, fa) 

(c) (1,3) is an asymptotically stable node; (3,3) is a saddle point 

(d) ao = 8/3; critical point is (2, 4); X = 0, — 1 

14. (b) (2 — ^4 - fa, fa), (2 + ^4- fa, fa) 

(c) 1,3) is a saddle point; (3, 3) is an unstable spiral point 

(d) a 0 = 8/3; critical point is (2,4); X = 0,1 

15. (b) ([3 - V9 - 4a]/2, [3 + 2a - s/9 - 4a]/2), 

([3 + v / 9^4^]/2, [3 + 2a + 79^4^1/2) 

(c) (1,3) is a saddle point; (2, 4) is an unstable spiral point 

(d) a 0 = 9/4; critical point is (3/2, 15/4); X = 0,0 

16. (b) ([3 - s/9 - 4a] /2, [3 + 2a - 79^4a]/2), 

([3 + s/9^a\/2, [3 + 2a + 79 - 4a] /2) 

(c) (1,3) is a center of the linear approximation and also of the nonlinear system; (2, 4) is 
a saddle point 

(d) a 0 = 9/4; critical point is (3/2, 15/4); X = 0, 0 

17. (b) Pi(0,0), P 2 (1, 0), P 3 (0,a), P 4 (2 — 2a, — 1 + 2a). P 4 is in the first quadrant for 

0.5 < a < 1. 

(c) a = 0; P 3 coincides with P \ . a = 0.5; P 4 coincides with P 2 . a = 1; P 4 coincides 
with P 3 . 

( 1 — 2 x — y —x 

— 0.5y a - 2y - 0.5x 

(e) Pi is an unstable node for a > 0. P 2 is an asymptotically stable node for 0 < a < 0.5 
and a saddle point for a > 0.5. P 3 is a saddle point for 0 < a < 1 and an asymptotically 
stable node for a > 1. P 4 is an asymptotically stable node for 0.5 < a < 1. 

18. (b) Pi (0,0), P 2 (1,0), P 3 (0,0.75/a), P 4 [(4a - 3)/(4a - 2),l/(4a - 2)]. P 4 is in the 

first quadrant for a > 0.75. 

(c) a = 0.75 ;P 3 coincides with P 4 . 

1 — 2 x — y —x 

— 0.5y 0.75 — 2 ay — 0.5x 

(e) Pi is an unstable node. P 2 is a saddle point. P 3 is an asymptotically stable node for 
0 < a < 0.75 and a saddle point for a > 0.75. P 4 is an asymptotically stable node for 
a > 0.75. 

19. (b) Pi(0,0), P 2 (1,0), P 3 (0,a), P 4 (0.5,0. 5). In addition, fora = 1 every point on the 

line x + y = 1 is a critical point. 

(c) a = 0; P 3 coincides with Pi. Also a = 1. 

( 1 — 2x — y —x 

-(2a - 1 )y a - 2 y- (2a - l)x 

(e) Pi is an unstable node for a > 0. P 2 and P 3 are saddle points for 0 < a < 1 and 
asymptotically stable nodes for a >1. P 4 is an asymptotically stable spiral point for 
0 < a < 0.5, an asymptotically stable node for 0.5 < a < 1, and a saddle point for a > 1. 

Section 9.5, page 540 

1. (b,c) (0,0); u' = f u, n' = — fu; /- = f, — f; saddle point, unstable 

(|, 3); u' = — \v, v' = 3m; r = ±s/3i/2; center or spiral point, indeterminate 

2. (b,c) (0,0); u! = u, v' = — ^v; r = 1, — 2. ; saddle point, unstable 

(f,2); u' = —\v, v' = w; r=±fi; center or spiral point, indeterminate 
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3. 


4. 


5. 


6 . 


7. 

8 . 

9. 

11 . 


12 . 


13. 

14. 

15. 

16. 


(b,c) (0,0); u' = u, v' = — jv; 

(2,0); u' = — u — v, v' = |u; r = — 1 


m' = - iu. 


(i 5). 

\ 2 > 2’’ 

asymptotically stable 
(b, c) (0,0); u' = |m, t/ = — u; 

(!.°); 

( 1 , 1 ); 


r = 1,-1; saddle point, unstable 
saddle point, unstable 
■ = (—1 ± -v/TT 0/8; spiral point, 


v' = ,u; 


, 9 9 

11 =-H u -T 6 v ’ 


u = — u — li>, n = 


r = — 1; saddle point, unstable 

r = — | , 1 ; saddle point, unstable 
r = (—1 ± \/CL5)/2; node, asymptotically stable 


(b,c) (0,0); u! = —u, v' = — 
(1,0); m' = |m- |jt), t/ = —v; 

v' = ii>; 


( 2 , 0 ); 

(1 I)' 
"2 /» 


u' = — 3u — 




/ 3 9 

U = - %V, 


2 ’ 32 

asymptotically stable 
(b,c) t = 0,T,2T,...: 
t = r/4,5T/4,... : 

r = 772,3772,... : 
f = 3774,7774,... : 


= I 


r = — 1, — 1; node, asymptotically stable 
r = — 1, 1; saddle point, unstable 
r = — 3, saddle point, unstable 

r = (—3 ± V390/8; spiral point, 


77 is a max., 
dH /dt is a min., 
77 is a min., 
dH /dt is a max.. 


dP/dt is a max. 
P is a max. 
dP/dt is a min. 
P is a min. 


(a) yfcot/yfay (b) V3 

(d) The ratio of prey amplitude to predator amplitude increases very slowly as the initial 
point moves away from the equilibrium point. 

(a) 4 jt/V3 = 7.2552 


(c) The period increases slowly as the initial point moves away from the equilibrium point, 

(a) T = 6.5 (b) T = 3.7, T = 11.5 (c) T = 3.8, T = 11.1 

(a) Pi (0,0), P 2 (l/cr,0), P 3 ( 3,2 — 6cr); P 2 moves to the left and P 3 moves down; they 
coincide at (3, 0) when a = 1/3. 

(b) Pi is a saddle point. P 2 is a saddle point for a < 1/3 and an asymptotically stable 
node for cr > 1/3. P 3 is an asymptotically stable spiral point for a < ay = (s/7]3 — l)/2 = 
0.2638, an asymptotically stable node for ai < cr < 1/3, and a saddle point for a > 1/3. 

(a) Pi (0,0), P 2 (a/cr, 0), P 3 [c/y, (a/ar) — (ccr/aj/)]; P 2 moves to the left and P 3 moves 
down; they coincide at (c/y , 0) when cr = ay /c. 

(b) Pi is a saddle point. P 2 is a saddle point for a <ay/c and an asymptotically stable 
node for a > ay /c. P 3 is an asymptotically stable spiral point for sufficiently small values 
of a and becomes an asymptotically stable node at a certain value cri < ay /c. P 3 is a 
saddle point for a > ay /c. 

(a,b) Pi (0,0) is a saddle point; P 2 (5,0) is a saddle point; P 3 (2,2.4) is an asymptotically 
stable spiral point. 

(b) same prey, fewer predators (c) more prey, same predators 

(d) more prey, fewer predators 

(b) same prey, fewer predators (c) more prey, fewer predators 

(d) more prey, even fewer predators 

(b) same prey, fewer predators (c) more prey, same predators 

(d) more prey, fewer predators 


Section 9.7, page 563 

1. r = 1, 0 = t + t 0 , stable limit cycle 2. r = 1, 9 = — t + t 0 , semistable limit cycle 

3. r = 1, 9 = t + to, stable limit cycle; r = 3, 9 = t + t 0 , unstable periodic solution 

4. r = 1, 0 = — f + to, unstable periodic solution; r = 2, 0 = — f + to, stable limit cycle 

5. r = 2n — 1, 0 = t + to, n = 1,2,3, ... , stable limit cycle; 

r = 2«, 0 = f + t 0 , n = 1, 2, 3, . . . , unstable periodic solution 

6. r = 2, 9 = —t + t 0 , semistable limit cycle; 

r = 3, 9 = — t + to, unstable periodic solution 
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8. (a) Counterclockwise 

(b) r = 1, 9 = t + to, stable limit cycle; r — 2, 9 = t + to, semistable limit cycle; 
r = 3, 9 = t + to, unstable periodic solution 

9. r = \/2, 0 = — / + to, unstable periodic solution 

14. (a) /x = 0.2, 7 = 6.29; /x = 1, 7 = 6.66; /x = 5, 7=11.60 

15. (a) x' = y, y' = — x + juy — /xy 3 /3 

(b) 0 < /x < 2, unstable spiral point; /x > 2, unstable node 

(c) A = 2.16, 7 = 6.65 

(d) /x = 0.2, A = 1.99, 7 = 6.31; /x = 0.5, A = 2.03, 7 = 6.39; 

= 2, A = 2.60, 7 = 7.65; /x = 5 4.36, 7 = 11.60 

16. (b) x' = /xx 4- .y, / = — x 4- /xy; X = /x ± /; the origin is an asymptotically stable 

spiral point for /x < 0 and an unstable spiral point for /x > 0. 

(c) r' = r(/x — r 2 ), 0' = —1 

17. (a) The origin is an asymptotically stable node for /x < —2, an asymptotically stable spiral 
point for —2 < /x < 0, an unstable spiral point for 0 < /x < 2, and an unstable node for 
/x > 2. 

18. (a, b) (0, 0) is a saddle point; (12, 0) is a saddle point; (2, 8) is an unstable spiral point. 

19. (a) (0,0), (5a, 0), (2, 4« — 1.6) 

(b) r = -0.25 + 0.125a ± 0.25V220 - 400a + 25a 2 ; a 0 = 2 

20. (b) X = [-(5/4 -b)± y/ (5/4 - b) 2 - l] /2 

(c) 0 < fe < 1/4: asymptotically stable node; 1/4 < h < 5/4: asymptotically stable spi- 
ral point; 5/4 < b < 9/4: unstable spiral point; 9/4 < b\ unstable node 

(d) b 0 = 5/4 

21. (b) k = 0, (1.1994, -0.62426); k = 0.5, (0.80485, -0.13106) 

(c) X 0 = 0.3465, (0.95450, -0.31813) 

(d) k = 0.4, 7 = 11.23; X = 0.5, 7=10.37; X = 0.6, 7 = 9.93 

(e) Xi = 1.4035 

Section 9.8, page 573 

1. (b) X=Xi, § (1) = (0,0, l) 7 ’; X = X 2 , $ (2) = (20,9- V81 + 40r,0) r ; 

X = X 3 , ? (3) = (20,9 + V81 + 40r,0) r 

(c) X! = -2.6667, | (1) = (0,0, l) r ; X 2 = -22.8277, | (2) = (20, -25.6554, 0) r ; 

X 3 = 11.8277, ? (3) = (20, 43.6554, 0) r 

2. (c) Xi = -13.8546; X 2 ,X 3 = 0.0939556 ± 10.1945/ 

5. (a) dV/dt = —2 a[rx 2 + y 2 + b(z - r) 2 - br 2 ] 

11. (b)c = V05: 7i(V2/4,-V2,V2); X = 0, -0.05178 ± 1.5242/ 

c = 1 : 7, = (0.8536, -3.4142, 3.4142); X = 0.1612, -0.02882 ± 2.0943/ 

7 2 (0.1464, -0.5858, 0.5858); X = -0.5303,-0.03665 ± 1.1542/ 

12. (a) Pi (1.1954, -4.7817, 4.7817); X = 0.1893, -0.02191 ± 2.4007/ 

7 2 (0.1046, -0.4183, 0.4183); X = -0.9614,0.007964+ 1.0652/ 

(d) 7 : = 5.9 

13. (a,b,c) ci = 1.243 

14. (a) 7!(2.9577, -11.8310, 11.8310); X = 0.2273, -0.009796 ± 3.5812/ 

7 2 (0.04226, -0.1690,0.1690); X = -2.9053,0.09877 ± 0.9969/ 

(c) T 2 = 11.8 

15. (a) 7i(3.7668, -15.0673, 15.0673); X = 0.2324,-0.007814±4.0078Z 
P 2 (0.03318, -0.1327,0.1327); X = -3.7335,0.1083 + 0.9941/ 

(b) 7t = 23.6 
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CHAPTER 10 Section 10.1, page 583 


1 . y = — sinx 

3. y = 0 for all L;y = C 2 sinx if sinL = 0 

4. y = — tan L cos x + sin x if cos L^0;n 

5. No solution 
7. No solution 

9. y = ci cos2x + | cosx 
11. y = - fx + \x 2 


2. y = (cot V2n cos -J2x + sin ~/2x ) /->/2 


y = — tan L cos x + sin x if cos L/0;no solution if cos L = 0 


6. y = (—n sin V2x + x sin s/2n)/2 sin ~j2n 
8. y = Ci sin 2x + 1 sinx 
10. y = 1 cosx 

12. y=-lx- 1 + i(l-e 3 )x- 1 lnx+lx 2 


13. No solution 

14. A„ = [(2 n — l)/2] 2 , y„(x) = sin[(2n — l)x/2]; n — 1,2,3, . . . 

15. a„ = [(2n — l)/2] 2 , y„(x) = cos[(2n — l)x/2] ; n = 1,2,3,. . . 

16. Ao = 0, yo(x) = 1; A„ = n 2 , y„(x) = cos nx; /j = 1,2,3, . . . 

17. k„ = [(2n — l)7r/2L] 2 , y„(x) = cos[(2« — l)?rx/2L]; n = l,2,3, ... 

18. Ao = 0, yo(x) = 1; A n = (nn/L) 2 , y„(x) = cos(imx/L); n = 1,2,3,... 

19. A„ = — [(2/z — l)?r/2L] 2 , y„(x) = sin[(2n — l)^x/2L]; n = 1,2,3,... 

20. A„ = 1 + (nrr/lnL) 2 , y„(x) = xsin(H7r lnx/lnL); n = l,2,3, ... 

21. (a) u>(r) = G(R 2 — r 2 )/4/x (c) Q is reduced to 0.3164 of its original value. 

22. (a) y = k(x 4 - 2 Lx 3 + L 3 x)/24 

(b) y = k{x A - 2 Lx 3 + L 2 x 2 )/ 24 

(c) y = k(x 4 - 4 Lx 3 + 6L 2 x 2 )/24 

Section 10.2, page 592 

1. T = 2n/5 2. T = 1 

3. Not periodic 4. T = 2L 

5. T = 1 6. Not periodic 

1. T = 2 8. 7 = 4 

9. /(x) = 2L — x in L < x < 2L; /(x) = — 2L — x in —3 L < x < — 2L 

10. f(x) = x — linl<x<2; /(x) = x — 8in8<x<9 

11. /(x) = — L — x in —L < x < 0 



14. (b) /(x) = 
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11 


23. (b) f (x) = + J2 


(— 1)" — 5 


12 

9 


+ E 


24. (b) f(x) = - + J2 

n = 1 

25. (b) m = 81 

I 


n = 1 n=l 

162[(— 1)" — 1] 27(— 1)" 


4[1 - (-1)"] _ (-1)" ' 

nn 




n 3 7t 
mzx 


28. 


/7 4 7T 4 77 2 7T 2 

26. (b) m = 27 
/(f) dt may not be periodic; for example, let /(f) = 1 + cos f . 


nuJi r-Z 108( — 1)” + 54 

-r~ - y sm 

-5 -4 / n J TT- 3 


Section 10.3, page 600 

, / \ r / •. 4 sin(2 n - l)nx 

1. (a) fix) = - > 

n=l 


2. (a) /(x) = ^ - E 


2m — 1 
2 


/ 4/ „ 

3. (a) fix) = j + ffi E 

n=l 


(277 — 1) 2 7T 

cos[(2n — l)7rx/L] 


(- 1 )" 

cos(2n — l)x 4 smnx 


(2m - l) 2 


2 4 (_i)«+i 

4. (a) fix) = - + E 2 — cosn nx 

' n=l 

1 2 M (— l)” -1 

5. (a) fix) = - 4 — V — — cos(2m — l)x 

/ 7 r 1 


3 

1 

2 

«o 

T 


7T 2 — ' 2m — 1 

n = 1 


6. (a) fix) = — 4- (a„ cosnnx 4- sin77jrx); 


fl 0 


■ 

2(-l) n 

n 2 jr 2 

oo r 


b„ = 


—1/nn, 

1/nn — 4/n 3 n 3 , 


n even 
n odd 


7. (a) fix) = - j + E 


1 — cosmjt (—1)" . 

cos nx sm nx 


(b) n = 10; max|e| = 1.6025 atx = ±n 

n = 20; max|e| = 1.5867 at x = 4=jr 

n = 40; max|e| = 1.5788 at x = 4=jr 

(c) Not possible 

1 2 y 1 — cos nn 

8. (a) fix) = - 4 ^ > = cosnnx 

2 7T- 2 — ' m 2 

n=l 

(b) n = 10; max|e| = 0.02020 at x = 0, 4=1 

n = 20; max|e| = 0.01012 at x = 0, 4=1 

n — 40; max|e| = 0.005065 at x = 0, 4=1 

(c) n — 21 

2 .A (-1/+ 1 

n 


9. (a) fix) = - y ■ 

IT * 


(b) m = 10,20,40; max|e| = 1 atx 4= 1 

(c) Not possible 


1 


10. (a) fix) = - 4- J2 


n = 1 L 

(b) m = 10; lub|e| = 1.0606 asx — > 2 

n = 20; lub|e| = 1.0304 as x — »■ 2 

n = 40; lub|e| = 1.0152 as x -»• 2 

(c) Not possible 


6(1 — cosmjt) nnx 2 cosnn . nnx 

— cos 1 sm 

m 2 jt 2 2 nn 2 


nnx 

~T~ 
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11. (a) fix) = - + J2 


2 cos nn 

9 9 

n A 7r z 


2 — 2cosnn + n 2 n 2 cosnn . 


(b) n = 10; lub|e| = 0.5193 asx — > 1 

n = 20; lub|e| = 0.5099 as x — > 1 

n = 40; lub|e| = 0.5050 as x — > 1 

(c) Not possible 

n ft , 12 A (-1)” . 

12. (a) /(x) = — > — - — sinmrx 

n 3 L — ' n 3 

n = 1 

(b) n = 10; max|e| = 0.001345 atx = ±0.9735 
n = 20; max|e| = 0.0003534 atx = ±0.9864 

n = 40; max|e| = 0.00009058 at x = ±0.9931 

(c) n = 4 

13. y = (to sin /if — n sin <ot)/co {a> 2 — n 2 ), to 2 n 2 
y = (sin nt — nt cos nt)/2n 2 , of = n 2 

OO 

14. y = ^ b n (a)S\nnt — n sma)t)/a){co 2 — n 2 ), co ^ 1,2,3,... 

n = 1 

OO 

y = ^2 b n (m sin nt — n sin mt)/m{m 2 — n 2 ) ± b m (sinmt — mt cos mt) /2m 2 , a> - 


n = 1 
rtjtm 


1S -^E 


7r a) 2 — (2n — l) 2 


— - — sin(2n — 1)/ — — sinwf 
2n — 1 u> 


^14 cos(2n — l)nt — cos cut 

16. y — cos cot 4 (1 — cos cot) H > 

2a; 2 tt 2 L — ' 

Section 10.4, page 608 


jt 2 (2n — l) 2 [a> 2 — (2n — l) 2 Jr 2 ] 


1. Odd 2. Neither 

3. Odd 4. Even 

5. Even 6. Neither 

1 4 ^ 1 — cos(/ur/2) /ittx 

14./W = S + pE — 


4 {nn / 2) — sin(nn/2) nnx 

fix) = — y, 2 Sln “3“ 

n l n z 2 

n=l 

1C , w , , 1,2 ^(-l)"- 1 (2n-l)nx 

15. (a) fix) = - ± - 2^ cos ■ 


2 n 2 —‘ 2n—\ 

n = 1 


n=l 

17. (a) f{x) = 1 


r-, z, / 2 /nr\ . /X7rx 

16. (a) f(x) = > — — cos nn 4 sin — sin — — 

• J n it V hit 9 / 9 


i B , , , 4 ^ sin(2n - l)x 

18. (a) /(x) = -}_ 


n ‘ — ' 2n — 1 

n=l 


s \ r Z / 7777 Z/77r 

19. (a) f(x) = > — cos — + cos — 2cosmr | sin 

' 17 TT \ 9 9 


. ft* 

y 


N „ 1 1 sin2mrx 

20. a /(*)=--- V 

/ TT -4 ' 


2 7r ' n 

n = 1 


01 , w . , L 4L A cos[(2n 

21. (a) /(x) = - + — 2^ 


(2n - 


, w , , 2L ^ sminnxj L) 
22. (a) /(x) = — > 

TT ^ ' 17 


l)rrx/L] 
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i OO 

23. (a) fix) = j 


00 <—\y 

24. (a) f(x ) = 2 V sin nx 


~2n 

. nn 

4 

/ J27T 


nx 

n 

sin — 
2 

+ 

r 

COS — 

V 2 

- 1 )] 

cos — 
2 


25. (a) f(x) = y 


4« 2 7r 2 (l + cosnn) + 16(1 — cos nn) 


. nnx 

sin 

2 


, , „ 4 16 1 + 3 cosnn nnx 

26. (a f(x) = - + — J2 j cos — 

3 n z n z 4 

n = 1 

. 3 6 1 — cos nn nnx 

27. (b) gW = - + -2>J 2 cos— - 

2 n z n z 3 

n=l 

6 1 . rarx 

h)x) = - > - sin — - 
n z — ' 3 

n=l 

1 4cos(mr/2) + Inn sin(/wr / 2) — 4 nnx 

28. (b) g(x) = - + > 2-j cos — 

4 ■“ n z n z 2 

n=l 

, 4sin(nn/2) — 2nn cos(nn/2) . mrx 

^ = E ^ sm t- 


, 5 12 cosnn + 4 nnx 

29. (b) gix) = - — + — cos — 


h ix) = --J2 


12 z 

n=l 

1 ^ n 2 n 2 (3 + 5 cosnn) + 32(1 — cosnn) . ii.tx 


30. (b) g(x) = - + 


1 y 6n 2 n 2 i2cosnn — 5) + 324(1 — cosnn) nnx 


hix) = y 


4cosnjr + 2 + 144 cos mr + 180 


. iittx 

sm 

3 


40. (a) Extend /(x) antisymmetrically into (L,2L]; that is, so that/(2L — x) ■ 
0 < x < L. Then extend this function as an even function into (—2 L, 0). 


Section 10.5, page 618 

1 . xX" - XX = 0, T' + XT = 0 2. X" - XxX = 0, T' + XtT 

3. X" - XiX' + X) = 0, V + XT = 0 4. [p(x)X']' + Ai-(x)X = 0, 1 

5. Not separable 6. X" + (x + a)X = 0, Y" - 

7. w(x, 0 = e - 40te2 ' sin 2;rx - e” 2500 * 2 ' sin 5nx 

8. nix, t) = 2e~’ r2 ' /16 sin(jrx/2) — e _ir2</4 sin jrx + 4e _,r2 ' sin 27rx 

100 1 — cos nn „2_2,„ <n n . nnx 

9. w(x, f) = V e ' sin ■ 

n “ n 

n = 1 


160 sin(wr/2) „2 w 2 #/ iAnn nnx 

10. u{x,t) = y — , ; 2 e * /160 ° sin 

7r 2 Z — ' n z 

n = 1 


40 

TCX 

40~ 


11 ;((v t) = 122 V COS(n7T/4) ~ COS ^ nJI / 4) c -M/1600 sin n7lX 
y ’ n u 40 

12. M(x, 0 = - y ^V" 2 * 2 " 1600 sin nnx 

^ ~ l n 40 

13. t = 5, n = 16; t = 20, n = 8; t = 80, n = 4 

14. (d) t = 673.35 15. (d) t = 451.60 


—fix) for 


= 0 

r " + XT = 0 
XY = 0 
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16. (d) t = 617.17 

17. (b) t = 5, x = 33.20; t = 10, x = 31.13; t = 20, x = 28.62; t = 40, x = 25.73; 
( = 100, x = 21.95; t = 200, x = 20.31 

(e) t = 524.81 



1 — cos rut 



e -n 2 A 2 t/400 j. 


n 


(a) 35.91°C (b) 67.23°C (c) 99.96°C 

19. (a) 76.73 s (b) 152.56 s (c) 1093.36 s 

21. (a) aw xx — but, + (c — bS)w = 0 (b) S — c/b if b ^ 0 

22. X" + /x 2 X = 0, Y" + (X 2 — fi 2 )Y = 0, T' + a 2 X 2 T = 0 

23. r 2 R" + rR' + (X 2 r 2 - /j, 2 )R = 0, 0" + /x 2 0 = 0, r + a 2 ^ 2 r = 0 

Section 10.6, page 627 

1. u = 10 + |x 2. it = 30 — |x 

3. u = 0 4. u = T 

5. u = 0 6 . it = T 

7. it = 7(1 + x)/(l + L) 8. !< = r(l + L — x)/(l + L) 



* — ' »7T 

n=l 


(d) 160.29 s 

10. (a) f{x) = 2x, 0 < x < 50; /(x) = 200 — 2x, 50 < x 


< 100 


oo 



800 . tut 40 



(d) z/(50, t) — >• 10 as t — > oo; 3754 s 




c„ — [2(1 — cosrar) — « 2 7r 2 (l + cos«7r)] 

yiJrrJ 


mtx 
cos — — , 


I 0, 

l —4/ {n 2 - 1 )n. 


n odd; 
n even 


(b) linif-xx, n(x,t) = 2/^r 



(c) 200/9 


(d) 1543 s 

dC °° 

14. (a) u(x, t) = — + yc/-" V,/M 

6 ^ 
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15. 

16. 

17. 

19. 

20 . 
21 . 


2 L 


\ ' — (2/j— l) 2 jr 2 a 2 //4L 2 (2/1 1 )/TX 

(b) z/(x,f ) = > c„e ' sin 

C - = z[ m - 2 L 

oo 

(a) u{x,t) = c n 


. (2/z — l)jrx 

) sin ax 


e (2«-l) 2 ir 2 //3600 s j n (2n l)/rx 


60 


120 


r [2cos/zjr + (2« — l)7r] 


" (2« - l) 2 jr 2 1 

(c) x m increases from x = 0 and reaches x = 30 when ? = 104.4. 


(a) u(x,t) = 40 + c n e 


— (2/i— 1) 2 jt 2 7/3600 • ^/7 V)7IX 


60 


40 


" (2/7 - 1) 2 7T 2 

u(x) = 


[6 COS 777T — (2/7 — l)jr] 


m X 

T - 
a 


1 - 


§ + (L/a) - 1 
L — x 


1 


77 -|- (L / 7/) — 1 

(e) u n {x, t) — sin /x„x 

o’ 2 !/" + s(x) = 0; u(0) = 7\, v(L) = T 2 


0 < x < a, 


a < x < L, 


where f = K 2 A 2 /K 1 A 1 


w, = a 2 w xx ; w(0,t) = 0, w(L,t) = 0, u/(x,0) =/(x) — t/(x) 

22. (a) u(x) = Ti + (72 — 7Y)(x/L) + kLx/2 — kx 2 /2 

OO 

” 2 ' 2 ' sin 

20 


(b> >_£ sl „“, o,. 1<5 ° (c °;"-i) 


23. (a) d(x) = T\ + (T 2 — T\)x/L + kLx/6 — kx 3 /6L 

OO 

(b) u>(x,r) = ^c„e“" 2ll2 ' /400 sin^, 


20 


H=l 

h 3 w 3 


3mrn 3 (3 cos t/jjt — 1) + 60 cos m/r 


Section 10.7, page 640 


1. (a) 7<(x, r) 

2. (a) 7<(x, f) 

3. (a) 77 (x, /) 

4. (a) 7/(x, r) 

5. (a) u(x,t) 

6. (a) u(x,t) 

7. (a) u(x,t) 


8 1 /77T /7JTX 

— r > — sin — sin — — ( 

•n-2 /—i n 2 ? 7 


nnat 


8 1 / . 1171 . 3nrr \ . mtx nnat 

— > — sin — + sin — — sin — — cos — — 

y,i \ 4 4 L L 


32 2 + cos nn . /izrx /i/raf 

— - > sin cos 

n 3 ^ n 3 L L 

n = 1 

4 ^ sin(/7^'/2) sin(/77r/L) . nnx nnat 
— > sin cos 

71 1 

n=l 

8L ^ 1 


V 


nn 

~2 


nnx 

~L 


L 

nnat 


L 


L 


8 L 




sin(/zjr/4) + sin(3/77r/4) . nnx . nnat 
— sin — — sm ■ 


L 


L 


32 L 

mr 4 


OO 


E 

n = 1 


cos nn -\- 2 


n 


4 


. nnx . 

sin sm 

L 


nnat 

L 
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L 


L 


4 L sin(«7r/2) sm(nn/L) . imx . mrat 

8. (a) u(x,t ) = — - > — sin sin 

an 2 1 — ' n- 

n = 1 

_ . (2n — 1)7 xx (2 n — 1)7 rat 

9. u(x, t) = > c n sin — cos - 


c „ = £ J fix) sin 


n = 1 
L 


2 L 

. (2 n — l)7rx 


2L 


2L 


dx 


8 ^ 1 . (2 n — l) 7 r . (2n — l)n . (2 n — l)nx (2n — l)nat 

10. (a) u(x, t) = — > r sin ; sm 77 sin 77 cos ■ 

7 T 


7r 2n — 1 

n=l 


2L 


2L 


2L 


512 ^7 (2n — \)n + 3 coshjt (2n — V)nx (2n — l)nat 

11. (a) u(x,t) = — r > : sin cos 

w 7T 4 ^ (2u — l) 4 2L 2 L 

n = 1 

14. (b) </>(x + at) represents a wave moving in the negative x direction with speed a > 0. 

15. (a) 248 ft/s (b) 49.6 jt« rad/s (c) Frequencies increase; modes are unchanged. 

21. r 2 R" + rR' + (f 2 r 2 - ix 2 )R = 0, 0" + /x 2 0 = 0, T" + k 2 a 2 T = 0 

23. (b) a n = u^/l + (y 2 L 2 /n 2 n 2 ) (c) y = 0 

20 


24. (a) c„ = 


. «7r . 2n7r . 3rar \ 

2 sm — sm — sm — — I 


Section 10.8, page 653 

. nnx . nny 2 la 

1. (a) w(x, y) = } c n sin sinh , c„ = — 

' n n qi 


sinh(u7rh/a) 


/' 


. ft7TX 

g(x) sin dx 

a 


„ . 4a 1 sin(wr/2) . /27rj\: nny 

(b) u(x,y) = -y T -y . . ' - ^ sin sinh ’ 

n ~l n smh(nnb/a) a a 


^ . nnx . nnfb-y) 

2. u(x,y) = } c n sin sinh , c n — 

n n 


2/a 


smh.(nnb/a) 


f 


. nnx 

h(x) sin dx 

a 


3. (a) u(x,y) = sinh 


m . rarx . nny 
cmh sin 


E °° n s . nnx . , nn(b — y) 
c„ sin sinh — — , 

n n n 


2/b 


n sinh(nna/b) J 0 
(b) c( 1) = 2 


r b . mry ,,, 2 la f 

c<> - sinh(ra J 


r (2) _ _ _ 


«jr sinh(/i7ra/h) 


sinh(«7rfo/a) J 0 
2 (n 2 n 2 — 2) cos nir + 2 

« 3 jr 3 sinh(u7rfe/fl) 


. «?rx 

/z(x) sm dx 

a 


5. u(r,9) = y + ^ r "(c« cosn6 + k„ sinnd)', 


a" r 2 

Cn = 

tr Jo 


f(9)cosn9d9, k n = 


a" r 

n / 

7o 


/(0) sinnt/ 


00 2 Z* 77 

6. (a) u(r,9) = c„r"sin«0, c„ = / /((?) sin«0 rM 

f-f Jra” J 0 


(b) c„ = 


«=1 

4 1 — COS J27T 


7Tfl" 


7. w(r, 0) = c n j 


,nn/a 


. njrfl 


c„ = (2/a') 


a -nn/c f 

Jo 


. nnO 

sin 

a 


OO /> /7 

8. (a) u(x,y) =} c„e y, ‘ sm , c„ = - / /(x) sm rfx 

, a a Jo a 

n=L 

4 a 2 

(b) c n = -r-r(l - cos nn ) (c) y 0 = 6.6315 
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2/nn 


> fLJLA, ruiy 

10. (b) u(x,y) = Co + > c n cosh cos , c n = 

' b b sinh(ft7T 

n=l 
oo 

11. u{r,9) = cq + ^2 r n {c n cos n8 + k n sin/70), 


— I 

a/b) Jo 


r n7T y i 

f O') cos — - dy 
b 


Cn — 


— [ 

ra" _1 Jo 


n = 1 
2tt 


nna 

necessary condition is 

Jo 


i r 1 2 

g(9) cos n9d0, k n = — f / 

njra" -1 

2n 


g(6) sin ft# dO; 


g(0)d0 = 0. 


> . nicx mzy 

12. (a) u(x,y) = y c n sin cosh , 

« ft 

n=l 

4flsin(n7r/2) 
n n 2 n 2 cosh(nnbfa) 

n , \ , \ v' • , (2n - l)irx . (2/; - l)/ry 

13. (a) u(x,y) = > c„ sinh — sin 


2/ a 


cosh(nnb/a) 


f 


. rmx 

g(x) sin dx 

ft 


n = 1 

2/ft 


2ft 


2ft 


/ 


sinh[(2ft — l)7Tft/2fr] Jo 
32ft 2 

(b) c n = 


’ (2/z - l)ny 

f( y ) sm — rfy 


(2n — 1)% 3 sinh[(2/i — l)7r«/2ft] 

OO 

coy n7VX • i n7t y 

14. (a) u(x,y) = \~ / c n cos sinh 

2 z — ' a a 

n = 1 

2/ a 


= ~f 

ab Jo 


Co = —r \ g(x) dx , c n = - 


f 


(b) CO = - 1 + — 


30 

1 

2 


sinh (nn ft /a) 

24fl 4 * * * * 9 10 * 12 (l + cos nn) 
/7 4 7t 4 sinh(/ijrft/a) 


nnx 

g(x) cos dx 

a 


aa 4aa cos[(2/i — V)nx/a\ cosh[(2/i — l)7rz/a] 

16. (a) u(x , z) = ft 4 — > 

A n 2 " (2/i — l) 2 cosh[(2/i — l)7rft/a] 


CHAPTER 11 Section 11.1, page 670 

1. Homogeneous 2. Nonhomogeneous 

3. Nonhomogeneous 4. Homogeneous 

5. Nonhomogeneous 6. Homogeneous 

7. (a) 0 „(jc) = sin s[X n x, where -Jx n satifies = — tan \[Xn\ (ft) No 

(c) X x = 0.6204, A 2 = 2.7943 

(d) X n = (2/1 — l) 2 /4 for large zi 

8. (a) </>„(x) = cos *jxj t x, where ~/Xl satisfies Va = cot -v/A; (ft) No 

(c) X\ = 0.7402, A 2 = 11.7349 

(d) X n = (zi — l) 2 7r 2 for large /i 

9. (a) 4> n (x) = sin ,/A^x + cos x, where JXn satisfies 
(A — 1) sin y/x — 2VXcos VX = 0; (ft) No 

(c) Xi = 1.7071, A 2 = 13.4924 

(d) X n = (zi — 1) 2 7 t 2 for large « 

10. (a) For n = 1,2, 3, . . . ,<f> n (x) = sin/z„x — /x„ cos/r„x and A„ = — /z 2 , where /z„ satisfies 
/r = tan /r. 

(b) Yes; Ao = 0,</>o(x) = 1 — x 

(c) A, = -20.1907, A 2 = -59.6795 

(d) X n = —(2/i + l) 2 7r 2 /4 for large n 

12. fi(x) — e~* 2 13. /r(x) = 1/x 

14. ti(x) = e~ x 15. fi(x) = (1 - x 2 )" 1/2 
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16. X" + XX = 0, T" + cT' + (k+ Xa 2 ) T = 0 

17. (a) s(x) = e 1 (b) X n = n 2 n r 2 , </>„(x) = sinrarx; n = 1,2,3,... 

18. Positive eigenvalues X — X , ,, where */X„ satisfies *jx = = tan 3 a/XL; corresponding eigen- 
functions are <p n (x) = e -2 * sin 3^/X/x. If L = |,A. 0 = 0 is eigenvalue, 0oW — xe _2x is eigen- 
function; if L 7^ ^,7 = 0 is not eigenvalue. If L , there are no negative eigenvalues; 
if L , there is one negative eigenvalue X = — /x 2 , where /x is a root of p = = tanh3/xL; 
corresponding eigenfunction is 4>-i(x) = e -2 * sinh 3/xx. 

19. No real eigenvalues. 

20. Only eigenvalue is X = 0; eigenfunction is <p(x) = x — 1. 

21. (a) 2 sin a/X — \/X cos VX = 0 

(c) A.! = 18.2738, A 2 = 57.7075 

(d) 2 sinh ^//I — cosh JJL =0, /x = — A. 

(e) = -3.6673 

24. (a) A.,, = /x 4 , where /x„ is a root of sin /xL sinh /xL = 0, hence A.,, = ( nn/L ) 4 ; 

A.i = 97.409/L 4 , X 2 = 1558. 5/L 4 , </>„(x) = sin(nnx/L) 

(b) X n = /x 4 , where /x„ is a root of sin /xL cosh /xL — cos /xL sinh /xL = 0; 

4 . ^_. n , ,-.,-4 , sin /x„x sinh /x„L - sin /x„L sinh /x„x 

Xi = 237 .72, L , A 2 = 2496. 5/L , <p n = ; 

sinh/x„L 

(c) A„ = /x 4 , where /x„ is a root of 1 + cosh \±L cos /xL = 0; Aq = 12.362/L 4 , 

A 2 = 485.52 /L 4 

[(sin/x„x — sinh /x„x) (cos /x„L + cosh/x„L) + (sin /x„L + sinh /x„L) (cosh /x„x — cos/x„x)] 

<M*) = T7 — ~ u 

cos /x„L + cosh /x„L 

25. (c) cf> n (x) = sin a/X/x, where X n satisfies cos «JX~ n L — y^/X^L sin ~JX~ n L = 0 

(d) = 1.1597/L 2 , A 2 = 13.276/L 2 


Section 11.2, page 683 

1. <p„(x) = \/2sin(n — ^ )nx\ n = 1,2, . . . 2. </>„(x) = a/2cos(« — i)?rx; n = 1,2, .. . 

3. 0o(x) = 1, </>„(x) = V2cosnrcx ; n = 1,2,... 

VXcos-y/X/x . „ , — , — . . — 

4. c/) n (x) = = , where A„ satisfies coSa/A„ — v^« sin vX„ = 0 

(1 + sin 2 V>^) 1/2 


5. </>„ (x) = -v/2 e* sin nnx; n = l,2, ... 6. a n = 

„ 4V2(-1)"- 1 

= (2n — l) 2 7r 2 ’ M = 

9 /? 

{1 — cos[(2n — l)jr/4]j; n = l,2, ... 


2V2 


(2n — l)7r 


; n = 1,2, . . . 


9- (In — 


(2n — 1)tt 
2 a /2 sin (« — |)(;r/2) 

(« - ^) 2 ^ 2 


; « = 1 , 2 , . . . 


In Problems 10 through 13, = (1 4- sin 2 a/X/) 1/2 and cos «JX n — -Jx^ sin VX/ = 0. 

a/ 2 sin yX/ a/ 2(2 cos VX/ - 1) . 

10. a n = — ; n = 1,2, . . . 11. = ; /t = 1,2, . . . 

■\fX n 0ln h n CXn 

V2(l — cos s/X^) A/2sin(VX//2) 

12. a n = ; n = 1,2, ... 13. a n = = ; n = 1,2, . . . 


14. Not self-adjoint 
16. Not self-adjoint 
18. Self-adjoint 

21. (a) If a 2 = 0 or b 2 = 0, then the corresponding boundary term is missing. 


*JX n ot n 
15. Self-adjoint 
17. Self-adjoint 
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25. (a) Ai = jt 2 * /L 2 ; (j> i(x) = sm(nx/L) 

(b) Ai = (4.4934) 2 /L 2 ; cj>\ (x) = sin ./An x — -s/Xi x cos -/Ai L 

(c) X\ = (2ti) 2 /L 2 \ <j>i(x) = 1 — cos(2 ttx/L) 

26. Xi = ti 2 /4L 2 ', <pi(x) = 1 — cos(jtx/2L) 

27. (a) X" - (v/D)X' + AX = 0, X(0) = 0, X'(L) =0; T’ + ADF = 0 


(e) c(x, f) = E a « e ; " D, e M / 2D sin /x„x, where A„ = /x 2 + ( v 2 /4D 2 ); 

H = 1 

47) /x 2 f e~ vx/2D f(x) sin fi n xdx 

a n = 72 

(2L7)/x 2 + ii sin 2 fi„L) 

28. (a) u t + vu x =Du xx , u((), t) = 0, u x (L,t)=0, u(x, 0) = — Co 

OO 

(b) z/(x, f) = ^ b n e~ XnDt e vxl2D sin /x„x, where A„ = /x 2 + (i> 2 /47) 2 ); 


bn — 


8co7) 2 /x 2 (2D j [i„e vL/2D cos /x n L + ve vL / 2D sin /x„L — 27) /x„) 


(v 2 + 47) 2 /x 2 )(2L7)/x 2 + v sin 2 /x„L) 

Section 11.3, page 696 

(— 1 )"+! sin mtx 


i- y = 2E- 


(n 2 jr 2 — 2)nxr 


2. y = 2^ 


(— 1)” +1 sin(iz — \)nx 


[(« - i) 2 3r 2 - 2 ] (n - \) 2 n 2 


1 


3 -y = -7- 4 E 


cos(2« — l)jrx 


n=l 


[(2iz — 1) 2 tt 2 — 2](2n — l) 2 jr 2 


„ „ ^ (2 cos,/)/- l)cos,/A/x 

4. y = 2> ^ — — - 

“ A„(A„ - 2)(1 + sin- v%) 

6-9. For each problem the solution is 


5-y = §E 


sin(mr/2) sin mtx 
(n 2 n 2 — 2 )n 2 iz 2 


00 c n r 1 

y = — - — <Pn(x), c n = f(x)ct> n (x)d. x, /x / 

“ K - M Jo 

where 0„(x) is given in Problems 1-4, respectively, in Section 11.2, and A„ is the corresponding 
eigenvalue. In Problem 8 summation starts at n = 0. 


1 1 1 / 1\ 

10. a = — , y = — — cos nx 4 I x I + c sin nx 

2 2n 2 n 2 \ 2/ 

11. No solution 12. a is arbitrary, y = ccosnx + a/n 2 

13. a = 0, y = c sin7rx — (x/2n) sin nx 17. u(x) = a + (b — a)x 

18. v(x) = 1 — |x 

at T 4ci /4ci 1 \ 2 ,, /l . 7TX 

19. u(x,t) = V2 -- + (-+ )e~ sm^ 


C/7 


-v^E 

n=2 

4v / 2t-l)" +1 


4c„ 


(2n — l) 2 7r 2 


(2n — l) 2 7r 2 

, n = l,2,... 


[1 - e -<"-i/2> 2 * 2 f] si n (n - 1 ) jix , 
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20. u(x , t ) = \fl y 


x„-l 


(e“' - e _A ”0 + a„e 


i 


C/7 


n=l 

\/2sin v^h 


cos 


(1 + sin 2 ■JXn) 1 ' 2 ’ 

V2(l — COS 


V%"(1 + sin 2 v^) 1/2 ’ " X„(l + sin 2 V^) 1/2 ’ 

and X n satisfies cos -,/X^ — a/X^ sin «JXn = 0. 

21. w(x,f) = 8 y sm <» 7r / 2) (1 _ g-^^sinn^ 


c„(e ' — e 


') sin(« — i)7rx 


(n — \) 2 n 2 — 1 


22 . id*, t) = V 2 y 

«=i v “ — 2 

_ 2V2(2n - l)n + 4\/2(— 1)" 

(2« — 1) 2 tt 2 

23. (a) r(x)w, = \p(x)u)x\x — q(x)w, io(0,f) = 0, u>(l,t) = 0, u>(x, 0) = /(x) — u(x) 

, 4 e -(2K-i) 2 3r 2 t s j n (2n — 1 )ttx 

24. w(x, f) = x 2 — 2x + 1 4 — > 

7T ^ ' 


2n - 1 


25. u(x, t) = — cosjrx + e te2(/4 cos(3?rx/2) 


l 


31-34. In all cases solution is y = / G(x,s)f(s) ds , where G(x,s ) is given below. 


31. G(x,s) = 

32. G(x,s) = 

33. G(x,s) = 

34. G(x,s) = 


1— x, 0 < s < x 

1 — s, x < s < 1 

s(2 — x)/2, 0 < s < x 

x(2 — s)/2, x < j < 1 

cosssin(l — x)/cosl, 0 < j < x 

sin(l — s) cosx/cosl, x < s < 1 

s, 0 < s < x 

X, X < S < 1 


Section 11.4, page 707 

00 Cn pi / pi 

1. y = y — / 0 (a/X^x), c„= f{x)J Q (. s /X n x)dx / / x/q(v^x) dx, 

~l X„ ~ M do / do 

a/X/ satisfies J 0 (^/X) — 0. 

OO 

2. (c) y = + V -^-J 0 (VX/x); 

M X„ - n 

Co = 2 I f(x)dx; c„ = / f(x)Jo(^/X^x)dx / / xj\{\/X n x') dx, n = 1,2,...; 
Jo Jo / Jo 

•v/X^ satisfies J' a G/X) = 0. 

3. (d) a n = 1 xJi c (- s /X n x)f(x) dx j J xJ^ c (^/X n x)dx 

°° C n f 1 / /•! 

(e) y = y — JaJa/X^x), c n = f(x)J k ( k /X n x)dx / / xJ\{J\ n x)dx 

~l x n — M Jo / Jo 

4. (b) 1 = V - — — p 2 »-i(x), c„= / f(x)P ln -\{x)dx / / p\ n _ x (x)dx 

n _ i M i/0 / JQ 
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Section 11.5, page 713 

1. (b) «($,2)=/($ + l), k($, 0) = 0, 0 < £ < 2 

«( 0 , rf) = u(2, rj) = 0 , 0 < ij < 2 


1 /*! / /*! 

2. u(r,t) = "S^ k n J 0 (X n r) sin X n at, k n =- — / rJ 0 {X n r)g{r) dr / / rJ^(X n r)dr 

“ / Jo 

3. Superpose the solution of Problem 2 and the solution [Eq. (21)] of the exam 
text. 

6. M(r,z) = ^c„e _A "V 0 (l.„r), c„ = J rJ 0 (X„r)f(r) dr j J rJ^(X n r)dr, 
and X n satisfies /of/) = 0. 

OO 

7. (b) r(r,6>) = ±c 0 /o(£r) + E'- ( kr)(b m sin md + c„, cosmd), 

m = 1 

1 


— 


nj m ( kc ) 


1 


3. = f 

Jo 


7tJ m (kc) 


f 

r 


f(9)sinmddd\ m= 1,2,... 
/((?) cosmd dd\ m = 0,1,2 ,... 


8 . c„ = / rf(r)J 0 (X n r)dr 


rJl(X n r) dr 


10. u(p,s) = y, c n p n P n {s), where c„ = J f(arccos s)P n (s) ds //: P 2 n(S)ds\ 


P n is the wth Legendre polynomial and s = cos (p. 


Section 11.6, page 722 

1. n = 21 2. (a) f>„, = (— l)'" +1 \/2/rw7r 

3. (a) = 2\/2(l — cosmn)/m 3 n 3 (c) n = 1 

7. (a) /o(x) = 1 (b) /i(x) = \/3(l - 2 jc) (c) f 2 (x) = \/5(-l + 6.r - 6.r 2 ) 

(d) go(jc) = 1, gi(.r) = 2x - 1, g 2 (x) = 6x 2 - 6x + 1 

8. P 0 (x) = 1, Pi(x) = x , P 2 (x) = (3x 2 — l)/2, P3 (.r) = (5.r 3 — 3x)/2 


(c) n = 20 


INDEX 


A 

Abel, Niels Henrik, 153, 228 
Abel’s formula, 153, 172, 174. 
225, 240 

for systems of equations, 388 
Acceleration of convergence, 
602 

Adams, John Couch, 464 
Adams-Bashforth formula 
fourth order, 465 
second order, 465 
Adams-Moulton formula 
fourth order, 466 
second order, 466 
Adaptive numerical method, 
451,457,462 
Adjoint 

differential equation, 156 
matrix, 364 
Advection, 686 
Airy, George Biddell, 255 
Airy equation, 157, 255-259, 
264,278,303,322 
Amplitude modulation, 213 
Amplitude, 

of simple harmonic motion, 
196 

Analytic function, 246, 262 
Angular momentum 
principle of, 26, 500 
Annihilators, 

method of, 237-238 


Aquifer 

flow in, 656, 686 
Asymptotic stability, see 
Stability 

Augmented matrix, 369, 374 
Autonomous 
equation, 78 
system, 498 

B 

Backward differentiation 
formulas, 468-469 
Backward Euler formula, 
446-448 

Basin of attraction, 503, 515, 
527,549-551 
Beat, 213, 337-339 
Bendixson, Ivar Otto, 558 
Bernoulli, Daniel, 27, 91, 92, 611, 
631 

Bernoulli, Jakob, 27, 67, 77, 352 
Bernoulli, Johann, 27, 67, 268 
Bernoulli equation, 77 
Bessel, Friedrich Wilhelm, 292 
Bessel equation of order: 
k, 708 

nu, 156, 157, 250, 276, 278, 292, 
302, 671, 703 
one, 283, 298-301, 303 
one-half, 296-298, 302 
zero, 283, 292-296, 302, 322, 
656,703,711 


Bessel functions, 27 
/ 0 W, 283, 293, 296, 302, 321, 
322,704,707,708,711, 

715 

asymptotic approximation 
to, 296 

Laplace transform of, 321 
zeros of, 303, 704, 712 
AG), 284, 299,302 
298 

J\/2(x), 298 

YoG), 295, 296, 704, 711 
asymptotic approximation 
to, 296 
Yi G), 301 

orthogonality of, 304, 707, 708 
Bessel inequality, 724 
Bessel series expansion, 707, 712 
Bifurcation diagram, 93 
Bifurcation point, 92-94, 127, 
508, 532-533, 564-566, 568 
Hopf, 564 
pitchfork, 93 
saddle-node, 93 
transcritical, 94 
Boltzmann, Ludwig, 62 
Boundary conditions, 578 
for elastic string, 632, 641 
for heat conduction equation, 
613, 621, 624, 659 
for Laplace’s equation, 647 
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nonhomogeneous, 621-623, 
695 

periodic, 683, 721 
separated, 674 
Boundary layer, 475 
Boundary value problems: 
heat conduction equation, 
611-631,657-660, 
692-695 

homogeneous, 578, 673-687 
Laplace’s equation, 646-656 
nonhomogeneous, 578, 
687-702 

self-adjoint, 681-682, 706 
singular, 702-709 
Sturm-Liouville, 673-681 
two-point, 577-584, 667 
wave equation, 631-646, 
661-663,698,710-712 
see also Homogeneous 
boundary value 
problems, 
Nonhomogeneous 
boundary value problems 
Brachistochrone, 27, 67 
Brusselator, 565 
Buckling of elastic column, 685 

C 

Capacitance, 201 
Cardano, Girolamo, 228 
Cauchy-Euler equation, 268 
Cayley, Arthur, 364 
Center, 403, 493, 508, 513, 519 
Change of independent 
variable, 164-166, 303 
for Euler equation, 165, 271 
Chaotic solution 
of logistic difference 
equation, 128, 132 
of Lorenz equations, 571 
Characteristic equation, 140, 
226, 316 

complex roots, 157, 228 
repeated, 230 

real and equal roots, 166, 230 
real and unequal roots, 140, 
227 


Characteristic polynomial, 226, 
316 

Chebyshev, Pafnuty L., 265, 544 
Chebyshev equation, 265, 283, 
671,709 

Chebyshev polynomials, 265, 

709 

Chemical reactions, 94 
Collocation 
method of, 717 
Competing species, 520-533 
Complementary solution, 176 
Complete set of functions, 719 
Complex exponential function, 
158-159 

Compound interest, 54-56 
Computer use in differential 
equations, 23 
Conjugate matrix, 364 
Continuous spectrum, 706 
Convergence 

of an improper integral, 305 
of a numerical 

approximation, 448 
of a power series, 244 
Convergence in mean, 719 
Converging solutions, 4, 13, 108 
Convolution integral, 191, 
345-353 

Laplace transform of, 345-348 
Cosine series, 604, 680 
Cramer, Gabriel, 240 
Cramer’s rule, 240 
Critical amplitude, 86 
Critical damping, 199 
Critical point: 

approached by trajectory, 493 
center of linear system, 403, 
493,508,519 
definition of, 486, 498 
for first order equation, 80 
improper node of linear 
system, 425, 490 
isolated, 509 

node of linear system, 395, 
404,487 

nonisolated, 496 


proper node of linear system, 
488 

saddle point of linear system, 
393,404,488 

spiral point of linear system, 
403,404,491 

stability of, see Stability of 
critical point 
Cycloid, 68, 353 

D 

D'Alembert, Jean, 166, 611, 631, 
642 

Dal Ferro, Scipione, 228 
Damping force, 193, 500 
Decay, radioactive, 17 
Degenerate node, see Improper 
node 

Diagonalization 

of homogeneous systems, 
419-420 

of matrices, 417-419 
of nonhomogeneous systems, 
432-434 

Difference equation, 121-132 
equilibrium solution of, 122 
first order, 121-132 
initial condition for, 121 
iteration of, 122 
linear, 121-124 
logistic, 124-132 
chaotic solutions of, 128, 

132 

nonlinear, 121, 124-132 
solution of, 121 

Differential operator, 145, 674 
Diffusion equation 

see also Heat conduction 
equation 

Dimensionless variables, 619, 
641 

Dirac, Paul A. M., 340 
Dirac delta function, 340, 687, 
702 

Laplace transform of, 340-341 
Direction field, 

for first order equations, 3, 5 
for systems, 390 
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Index 


Dirichlet, Peter Gustav Lejeune, 
647, 678 

Dirichlet problem 
for circle, 651-654 
for rectangle, 648-650, 
653-654 
for sector, 654 
for semi-infinite strip, 654 
for semicircle, 654 
Discontinuous coefficients, 77 
Discontinuous forcing function, 
77,331-339 
Dispersion 

mechanical, 686 
Dispersive waves, 645 
Divergence: 

of an improper integral, 305 
of a power series, 244 
Diverging solutions, 6, 11, 109 
Drag force, 2, 9 
Duffing equation, 507 
Duffing, Georg, 507 

E 

Eigenfunctions, 581, 667 
normalized, 678 
orthogonality of, 677, 706 
series of, 680-681, 688, 
706-707, 719 
Eigenvalues 

of linear homogeneous 
boundary value 
problems, 581, 667 
of Sturm-Liouville problem 
existence, 676 
real, 582, 584, 676 
simple, 677 
when positive, 684 
Eigenvalues of matrix, 379-383 
algebraic multiplicity, 380 
geometric multiplicity, 380 
real, 382 
simple, 380 

Eigenvectors of matrix, 379-383 
generalized, 426 
linear independence of, 381 
normalized, 380 
orthogonality of, 382 


Elastic bar 

longitudinal vibrations of, 673, 
699 

transverse vibrations of, 672 
Elastic column 
buckling of, 685 
Elastic membrane 

vibrations of, 644, 710-712 
Elastic string: 

boundary value problems for, 
631-646 

derivation of wave equation, 
661-663 

free at one end, 641 
general problem, 638-640 
infinite length, 642-644 
justification of solution, 
637-638, 644 

natural frequencies of, 635 
natural modes of, 635 
propagation of discontinuities 
in initial data, 638 
wavelength of, 635 
Electrical networks, 18, 201-202, 
319, 356, 361-362 
Elliptic integral, 520 
Environmental carrying 
capacity, 81 
Epidemics, 91-92 
Equidimensional equation, 268 
Equilibrium solution, 3, 6, 13, 80, 
122, 486 

see also Critical point 
Error: 

for Adams-Bashforth 
formula, 465 

for Adams-Moulton formula, 
466 

effect of step size, 449, 
470-472 

for Euler method, 449-452 
for Fourier series, 591-592, 

599 

global truncation, 448, 450, 
410-412 

for improved Euler method, 
455,458 

local truncation, 448 


mean square, 717 
for modified Euler method, 
459 

round-off, 448, 453, 470-472 
for Runge-Kutta method, 460 
Escape velocity, 58-59 
Euler, Leonhard, 102, 610, 611, 
631,646 

Euler equation, 165, 174, 
268-652 

change of independent 
variable, 165, 271 
Euler formula for exp(if), 158, 
231 

Euler method, 101-111, 

443-453, 481 
convergence of, 111 
global truncation error, 450, 
452 

local truncation error, 

449-451 

Euler, Leonhard, 27 
Euler-Fourier formulas, 

587-588 

Euler-Mascheroni constant, 295 
Even function, 603 
Even periodic extension, 606 
Exact equations, 94-98 
necessary and sufficient 

condition for existence of 
solutions, 28, 96 
for second order equations, 
156 

Exchange of stability, 94, 127 
Existence and uniqueness 
theorems, 23 
for first order equations, 
68-72,111-121 
proof of, 112-118 
for nth order linear equations, 
220 

for second order linear 
equations, 146 
for series solution of second 
order linear equations, 
262,289 

for systems of first order 
equations, 358 
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Exponential growth, 78 
Exponential matrix, 416-417, 
421 

Exponential order 
functions of, 309 
Exponents at the singularity, 
280, 285 

F 

Falling object problem, 2—4, 
13-15 

Fehlberg, Erwin, 463 
Feigenbaum, Mitchell, 132 
Feigenbaum number, 132 
Ferrari, Ludovico, 228 
First order ordinary differential 
equations: 

applications of, 50-68, 78-94 
Bernoulli, 77 
direction field, 3, 5 
exact, 94-98 

existence and uniqueness 

theorems, 68-72, 111-121 
proof of, 112-118 
general solution of, 12, 32, 36, 
73 

graphical construction of 
integral curves, 74 
homogeneous, 27, 49-50 
implicit solutions, 74 
integrating factor for, 32, 36, 
98-99 

interval of definition, 37, 45, 
47, 72-73 
linear, 27, 31-41 
nonlinear, 68-77 
numerical solution of, see 
Numerical methods 
separable, 42-50 
series solution of, 266 
systems of, see Systems 
Fitzhugh, Richard, 566 
Fitzhugh-Nagumo equations, 
566 

Fourier, Joseph, 584, 611 
Fourier coefficients, 718 


Fourier series, 584-611 

acceleration of convergence, 
602 

choice of series, 606-607 
convergence of, 596, 602 
convergence of partial sums, 
591,598 

cosine series, 604, 680 
error, 591-592, 599 
Euler-Fourier formulas, 
587-588 

Gibbs phenomenon, 599, 606 
integration of, 587 
orthogonality of sines and 
cosines, 586-587 
Parseval equation, 601, 610, 
645, 724 

periodicity of sines and 
cosines, 585-586 
sawtooth wave, 605, 610 
sine series, 605, 679 
specialized kinds, 610 
square wave, 597, 609 
triangular wave, 588, 591, 609 
Fredholm, Erik Ivar, 690 
Fredholm alternative theorem, 
690 

Frequency, natural 

of simple harmonic motion, 
195 

of vibrating string, 635 
Frobenius, Ferdinand Georg, 
279, 364, 485 
Frobenius 

method of, 279 

Fuchs, Immanuel Lazarus, 263, 
485 

Fundamental matrix, 413-422, 
426 

Fundamental modes of 
vibration, 408 

Fundamental set of solutions, 
150,151,221,387,388 
Fundamental solutions 

of heat conduction equation, 
615,625 


of Laplace’s equation, 649, 

652 

of wave equation, 634, 639 
Fundamental theorem of 
algebra, 226 

G 

Galois, Evariste, 228 
Gamma function, 311 
Gauss, Carl Friedrich, 226, 368 
Gaussian elimination, 368 
Gear, C. William, 468 
General solution of linear 
equations: 

first order, 12, 32, 36, 73 
nth order, 221 
second order, 141, 150, 161, 
168, 175 

systems of first order 
equations, 387 
Generalized function, 340 
Gibbs, Josiah Willard, 599 
Gibbs phenomenon, 599, 606 
Global asymptotic stability, 516 
Gompertz, Benjamin, 89 
Gompertz equation, 89 
Graphical construction of 
integral curves, 74 
Gravity, 2, 58 
Green, George, 699 
Green’s function, 699-702 

H 

Half-life, 17, 61 
Harvesting a renewable 
resource, 90-91 
Heat conduction equation, 19, 
612, 658 

bar with insulated ends, 
623-627 

boundary conditions, 613, 621, 
624, 659 

derivation of, 657-660 
with external heat source, 630 
fundamental solutions of, 615, 
625 

nonhomogeneous boundary 
conditions, 621-623 
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nonhomogeneous source 
term, 660, 692-695 
in polar coordinates, 620 
smoothing of discontinuities 
in initial conditions, 621 
solution of fundamental 
problem, 613-616 
steady-state solution, 622 
transient solution, 622 
Heaviside, Oliver, 308 
Heaviside function, 323 
von Helmholtz, Hermann, 715 
Helmholtz equation, 715 
Hereditary systems, 346 
Hermite, Charles, 260, 364 
Hermite equation, 260, 278, 671 
Hermite polynomials, 260 
Hermitian matrix, 260, 382, 397, 
417,418,423,681,699 
Heun, Karl, 454 
History of differential 
equations, 26-29 
Hodgkin, Alan L., 532 
Hodgkin-Huxley equations, 
532,566 

Homogeneous algebraic 
equations, 374 

Homogeneous boundary value 
problems, 578, 667 
eigenfunctions, 581, 667 
eigenvalues, 581, 667 
singular Sturm-Liouville, 
702-709 

Sturm-Liouville, 673-687 
Homogeneous differential 
equations, 

with constant coefficients, 28, 
137-145, 157-174, 
226-234 

systems, 390-432 
definition of, 138, 359 
general theory of, 145-157, 
220-223, 385-390 
Homogeneous first order 

differential equations, 27, 
49-50 

Hooke, Robert, 192 
Hooke’s law, 192 


Hopf, Eberhard, 564 
Hopf bifurcation, 564 
Huxley, Andrew E, 532 
Huygens, Christian, 352 
Hypergeometric equation, 291 

I 

Identity matrix, 367 
Implicit numerical method, 447, 
466, 478 

Implicit solutions, 74 
Improper integrals, 305-307 
comparison theorem for, 307 
Improper node, 425, 490, 513 
Improved Euler method, 
454-459 

Impulse functions, 339-345 
Impulse of force, 339 
Impulse response, 350 
Indicial equation, 280, 285, 289 
Inductance, 201 
Initial conditions, 12, 75, 121, 

138. 220. 358 

propagation of discontinuities 
for wave equation, 638 
smoothing of discontinuities 
for heat conduction 
equation, 621 

Initial value problem, 12, 138, 

220. 358 

Laplace transform of, 312-320 
initial value problem, 44 
Inner product: 
for functions, 586, 675 
for vectors, 367 
Instability: 

of critical point, 83, 122, 124, 
494,499,501,547 
of numerical method. 473-478 
of periodic orbit, 557 
see also Diverging solutions 
Integral curves, 12, 74 
Integral equation, 112 
Volterra, 351 

Laplace transform of, 351 
Integral transform, 307 
Integrating factor, 28, 32, 36, 
98-99, 156 


Integro-differential equation, 
352 

Laplace transform of, 352 
Interval of convergence, 244 
Inverse Laplace transform, 316 
as linear operator, 318 
uniqueness of, 316 
Inverse matrix, 368-370 
Invertible matrix, 368 
Irregular singular point, 274, 
291-292 
Iteration 

of difference equation, 122 
method of, 113 
see also Successive 
approximations 

J 

Jacobi, Carl Gustav Jacob, 511 
Jacobian matrix, 511, 568 
Jordan, Camille, 364, 427 
Jordan form of matrix, 426^128, 
430,431 

Jump discontinuity, 306, 597, 638 

K 

Kernel, 

of convolution, 191 
of integral transform, 308 
Kirchhoff, Gustav, 201 
Kirchhoff’ s laws, 201, 361 
Kronecker, Leopold, 678 
Kronecker delta, 678 
Kutta, M. Wilhelm, 460 

L 

Lagrange, Joseph-Louis, 28, 186, 
584,611,631 

Lagrange’s identity, 674, 705 
Laguerre, Edmond Nicolas, 283 
Laguerre equation, 283, 671 
Laplace, Pierre-Simon de, 28, 
308, 646 

Laplace transform, 305-353 
of convolution, 345-348 
definition of, 308 
of derivatives, 312-314 
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of Dirac delta function, 
340-341 

existence of, 308 
of initial value problems, 
312-320 

of integral equations, 351 
of integro-differential 
equations, 352 
inverse of, 316 
as linear operator, 310 
of periodic functions, 330 
of rectified sine wave, 331 
of sawtooth wave, 331 
of square wave, 330 
for systems of equations, 
438-439 
table of, 318 

translation formula, 325, 328 
of unit step function, 325 
Laplace’s equation, 28, 646-656, 
713 

boundary conditions, 647 
in cylindrical coordinates, 656, 
714 

Dirichlet problem, 647 
for circle, 651-654 
for rectangle, 648-650, 
653-654 
for sector, 654 
semi-infinite strip, 654 
for semicircle, 654 
fundamental solutions of, 649, 
652 

mixed problem, 655 
Neumann problem, 647 
for circle, 655 
for rectangle, 655 
in polar coordinates, 651 
in spherical coordinates, 715 
Legendre, Adrien Marie, 153, 
266 

Legendre equation of order 
alpha, 156,157,250, 264, 
266-268, 274, 278, 282, 322, 
703,709,716 

Legendre polynomials, 267, 709, 
723 


Leibniz, Gottfried Wilhelm, 26, 
67, 77, 352, 609 
Length, of a vector, 367 
L'Hospital, Marquis de, 67 
Liapunov, Alexandr M., 544 
Liapunov function, 547 
Liapunov theorems, 546 
Liapunov’s second method, 
543-554 

Libby, Willard F., 61 
Lienard, Alfred-Marie, 520 
Lienard equation, 520, 552 
Limit cycle, 557 
Linear dependence and 
independence, 
of functions, 221, 226 
of vector functions, 378 
of vectors, 377-378 
Linear operator, 238, 310, 318 
Linear ordinary differential 
equations: 

adjoint equation, 156 
change of independent 
variable, 164-166,271, 
303 

characteristic equation, 140, 
226,316 

complex roots, 157, 228 
complex roots, repeated, 
230 

real and equal roots, 166, 
230 

real and unequal roots, 140, 
227 

complementary solution, 176 
definition of, 21, 137 
Euler equation, 165, 174, 268 
exact, 156 

existence and uniqueness 
theorems, 68, 146, 220, 
262,289,359 
first order, 27, 31^41 
fundamental set of solutions, 
150, 151,221,387,388 
general solution of, 12, 28, 32, 
36,73,141,150, 161,168, 
175,221,387 


homogeneous equation with 
constant coefficients, 28, 
137-145, 157-174, 
226-234 

integrating factor, 28, 32, 36, 
156 

interval of definition, 37, 69 
nonhomogeneous equation, 
138,174-191,223, 
234-242,432-441 
ordinary point, 250, 262, 266, 
277 

particular solution, 176, 224 
reduction of order, 170-171, 
224 

self-adjoint, 157 
series solution of, see Series 
solution 

singular point, 250, 262, 
268-277 

systems of, see Systems 
undetermined coefficients, 16, 
176-185,234-238 
variation of parameters, 28, 
41,185-191,239-242 
Linearization, 

of a nonlinear equation, 22 
of a nonlinear system, 511 
Liouville, Joseph, 113, 674 
Lipschitz, Rudolf, 120 
Lipschitz condition, 120 
Locally linear systems, 508-520 
Logarithmic decrement, 203 
Logistic difference equation, 
124-129 

Logistic differential equation, 
79-83,124,521,539 
Logistic growth, 79-83, 86-88 
Lorenz, Edward N., 567 
Lorenz equations, 566-574 
Lotka, Alfred J., 534 
Lotka-Volterra equations, 20, 
533-543 

M 

Magnitude, of a vector, 367 
Malthus, Thomas, 78 
Mascheroni, Lorenzo, 295 
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Mathematical model, 2, 50-52 
analysis of, 51 

comparison with experiment, 
51 

construction of, 7, 15, 51 
Matrices, 364-385 
addition of, 365 
adjoint, 364 
augmented, 369, 374 
conjugate, 364 
diagonalizable, 417-419 
eigenvalues of, 379-383, 580 
eigenvectors of, 379-383, 580 
equality of, 365 
exponential, 416^117, 421 
fundamental, 413-422, 426 
Gaussian elimination, 368 
Hermitian, 260, 382, 397, 417, 
418,423,681,699 
identity, 367 
inverse, 368-370 
invertible, 368 

Jordan form of, 426-428, 430, 
431 

multiplication by a number, 
365 

multiplication of, 365-366 
noninvertible, 368 
nonsingular, 368 
row reduction of, 368 
self-adjoint, 382 
similar, 418 
singular, 368 
subtraction of, 365 
transpose, 364 
zero, 365 

Matrix functions, 370-371 
Mean convergence, 719 
Mean square error, 717 
Millikan, Robert A., 66 
Mixing problems, 52-53, 56-58 
Mode 

natural (of vibrating string), 
635 

Modified Euler formula, 459 
Moulton, Forest Ray, 465 


Multiplicity of eigenvalue 
algebraic, 380 
geometric, 380 
Multistep method, 464-470 

N 

Nagumo, Jin-Ichi, 566 
Negative definite function, 546 
Negative semidefinite function, 
546 

Neumann, Karl Gottfried, 647 
Neumann problem, 647 
for circle, 655 
for rectangle, 655 
Newton, Isaac, 67 
Newton’s law: 

of cooling, 8, 17, 62, 63 
of motion, 2, 192, 360, 661 
Newton, Isaac, 26 
Node, 395, 405, 487, 509, 513 
see also Improper node. 
Proper node 

Nonhomogeneous algebraic 
equations, 374 

Nonhomogeneous boundary 
value problems, 578, 
687-702 

Fredholm alternative, 690 
solution 

by eigenfunction 

expansion, 687-699 
by Green’s function, 
699-702 

Nonhomogeneous linear 

differential equations, 138, 
174-191,223,234-242, 
349-350, 359, 432^141 
Noninvertible matrix, 368 
Nonlinear ordinary differential 
equations: 

autonomous systems, 497-574 
definition of, 21, 138 
existence and uniqueness 
theorems, 69, 112, 358 
first order, 68-75 
methods of solving, 42-50, 
94-101 

linearization, 22, 511 


periodic solutions of, 533-543, 
554-566 

y"=f{t,y'\ 134 
y"=f(y,y'), 134 
Nonsingular matrix, 368 
Normalization condition, 678 
Nullcline, 528 

Numerical dependence, All 
Numerical methods, 443^183 
Adams-Bashforth formula, 
465 

Adams-Moulton formula, 466 
adaptive, 451, 457, 463 
backward differentiation 
formulas, 468-469 
backward Euler formula, 
446-448 

comparison of, 469 
convergence of, 448 
effect of step size, 449, 
470-472 

errors in, see Error 
Euler, 101-111, 443-453, 481 
Heun, 454 

improved Euler, 454-459 
modified Euler, 459 
multistep, 464-470 
one-step, 464 

predictor-corrector, 466, 483 
Runge-Kutta, 459-464, 481 
stability of, 473^178 
for stiff equations, 475-476 
for systems of first order 
equations, 480-483 
vertical asymptotes, 472-473 
Numerical stability, 473—478 

O 

Odd function, 603 
Odd periodic extension, 607 
One-step method, 464 
Orbital stability, 557 
Order of differential equation, 
20 

Ordinary differential equation 
definition of, 19 
Ordinary point, 250, 262, 266 
at infinity, 277 
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Orthogonality 

of Bessel functions, 304, 707, 

708 

of Chebyshev polynomials, 

709 

of eigenfunctions of 
Sturm-Liouville 
problems, 677, 706 
of functions, 586 
of Legendre polynomials, 267, 
709, 723 

of sine and cosine functions, 
586-587 
of vectors, 367 
Orthonormal set, 678, 716 
Overdamped motion, 199 

P 

Parseval, Marc- Antoine, 601 
Parseval equation, 601, 610, 645, 
724 

Partial differential equation: 
definition of, 19 
see also Heat conduction 
equation, Laplace’s 
equation, Wave equation 
Partial fraction expansion, 315, 
318,319,323 

Particular solution, 176, 224 
Pendulum equation: 
generalized nonlinear 
undamped, 552 
linear undamped, 21 
nonlinear damped, 499-501, 
510,512-516,518 
nonlinear undamped, 21, 25, 
26,507,517-518,544-545, 
548-549, 552 
period, 519 
Period 

of nonlinear undamped 
pendulum, 519 
of simple harmonic motion, 
195 

Periodic boundary conditions, 
683, 721 


Periodic forcing terms, 600 
Periodic functions, 585 
derivative of, 594 
fundamental period, 585 
integral of, 594 
Laplace transform of, 330 
linear combination of, 585 
product of, 585 
sine and cosine, 586 
Periodic solutions of 

autonomous systems, 
533-543, 554-566 
Periodicity of sine and cosine 
functions, 585-586 
Phase line, 80 
Phase plane, 205, 390, 486 
Phase plot, 205, 217 
Phase portrait, 391, 486 
Phase, of simple harmonic 
motion, 196 

Picard, Charles-Emile, 113 
Picard 

method of, 113 

Piecewise continuous function, 
306, 596 

Pitchfork bifurcation, 93 
Poincare, Henri, 485, 558 
Poincare-Bendixson Theorem, 
558 

Poiseuille, Jean Louis Marie, 583 
Poiseuille flow, 583 
Population dynamics, 78-88 
Positive definite function, 546 
Positive semidefinite function, 
546 

Potential equation, 646-656 
see also Laplace's equation 
Power series, properties of, 
243-247 

Predator-prey equations, 20, 
533-543 

Predictor-corrector method, 
466, 483 

Proper node, 488, 513 

Q 

Quasi frequency, 198 
Quasi period, 198 


R 

Rossler equations, 574 
Rossler, Otto E., 574 
Radioactive decay, 17, 61 
Radiocarbon dating, 61 
Radius of convergence, 244, 263 
Ramp loading, 334 
Rate function, 5 
Rate of growth (decline), 5, 78 
intrinsic, 79 
Rayleigh, Lord, 564 
Rayleigh equation, 564 
Rectified sine wave, 331 
Recurrence relation, 252, 279, 
280, 285 

Reduction of order, 170-171, 
224 

Reduction to systems of 
equations, 356 

Region of asymptotic stability, 
503 

see also Basin of attraction 
Regular singular point, 268-277 
at infinity, 277 
Resistance, electric, 201 
see also Damping force 
Resonance, 209, 215, 337-339 
Riccati, Jacopo Francesco, 133 
Riccati equation, 133 
Rodrigues, Olinde, 267 
Rodrigues’ formula, 267 
Rosenzweig-MacArthur 
equations, 542, 565 
Round-off error, 448, 453, 
470-472 

Row reduction, 368 
Runge, Carl David, 460 
Runge-Kutta method, 459-464, 
481 

S 

Saddle point, 393, 405, 488, 513 
Saddle-node bifurcation, 93 
Saturation level, 81 
Sawtooth wave, 331, 605, 610 
Scalar product 
see Inner product 
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Schaefer model for fish 
population, 90 
Schrodinger, Erwin, 340 
Self-adjoint: 

boundary value problem, 
681-682, 706 
equation, 157 
matrix, 382 

Separable equations, 27, 42-50 
Separated boundary conditions, 
674 

Separation constant, 614, 624, 
633,648,667,711 
Separation of variables, 611 
further remarks, 709-716 
for heat conduction equation, 
614, 624 

for Laplace’s equation, 648 
in polar coordinates, 651 
for wave equation, 633 
in polar coordinates, 714 
Separatrix, 503, 515, 527 
Series of eigenfunctions, 

680-681, 687, 688, 706-707, 
719 

Series solution: 

existence theorem for, 262, 
289 

first order equations, 266 
indicial equation, 280, 285, 289 
near an ordinary point, 
250-268 

near a regular singular point, 
278-292 

recurrence relation, 252, 279, 
280,285 

when roots of indicial 
equation: 

are equal, 282, 288, 290, 
293-296 

differ by integer, 282, 288, 
290, 296-301 

Shift of index of summation, 
247-248 

Similar matrices, 418 
Similarity transformation, 418 
Simple eigenvalues, 380 
Simple harmonic motion, 195 


Simply connected region, 95 
Simpson, Thomas, 461 
Simpson’s rule, 461 
Sine series, 605, 679 
Singular matrix, 368 
Singular point, 250, 262, 268-277 
irregular, 274, 291-292 
regular, 268-277 
at infinity, 277 

Sink 

nodal, 487 
spiral, 491 

Slope field, see Direction field 
Solution of ordinary differential 
equation, 22 

general solution of linear 
equations, 12, 32, 36, 73, 
141,150, 161,168,175, 

221 

of systems, 387 
implicit, 74 

of systems of equations, 358 
Source 
nodal, 487 
spiral, 491 

Spiral point, 403, 404, 491, 508, 
513 

Spring-mass system, 191-200, 
202-217,319,412 
three springs, two masses, 355, 
360,405-408,412 
two springs, two masses, 232 
Square integrable function, 719 
Square wave, 330, 597, 609 
Stability: 

asymptotic, 83, 122, 124, 494, 
501,547,557 
definition of, 499 
see also Converging 
solutions 

basin of attraction, 503, 515, 
527,549-551 

of critical point, 494, 498^199, 
508, 547 

exchange of, 94, 127 
global asymptotic stability, 

516 

Liapunov theorems, 546 


for linear systems, 494, 508 
for locally linear systems, 512 
of numerical method, 473-478 
orbital, 557 

region of asymptotic stability, 
503 

semistable, 88, 557 
stable, 494, 501,547,557 
definition of, 498 
unstable, 83, 122, 124, 494, 501, 
547, 557 

definition of, 499 
see also Diverging solutions 
Stairstep diagram, 126 
Star point, 488 

Steady state solution, 63, 208, 
622 

Stefan, Jozef, 62 

Stefan-Boltzmann radiation law, 
62 

Step functions, 323 
Stiff equations, 475-476 
Stokes, George Gabriel, 65 
Stokes’ law, 65 
Strange attractor, 571 
Strutt, John William, see 
Rayleigh, Lord 
Sturm, Charles-Frangois. 674 
Sturm-Liouville boundary 
value problems, 673-687 
eigenfunctions orthogonal, 

677 

eigenvalues 
real, 676 
simple, 677 

nonhomogeneous, 688-692 
self-adjointness of, 682, 706 
singular, 702-709 
continuous spectrum, 706 
Successive approximations 
method of, 113, 421 
Superposition principle, 147, 
360,386,640,710 
Sylvester, James, 364 
Systems: 

of differential equations, 19 
autonomous, 498 
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existence and uniqueness 
theorem, 358 
initial conditions, 358 
linear, 358 
nonlinear, 358 
numerical solution of, 
480^183 

reduction to, 356 
solution of, 358 
of first order linear equations, 
385-441,485-496 
definition of, 358 
diagonalization, 419-420, 
432-434 

Euler, 399, 410, 429 
existence and uniqueness 
theorem, 359 
fundamental matrix, 
413^122, 426 
fundamental set of 
solutions, 387, 388 
general solution of, 387 
homogeneous, 359, 385-390 
homogeneous with 

constant coefficients, 
390-432 

homogeneous with 

constant coefficients, 
complex eigenvalues, 
401-413, 490-493 
homogeneous with 

constant coefficients, 
real and unequal 
eigenvalues, 390-401, 
486^188 

homogeneous with 

constant coefficients, 
repeated eigenvalues, 
422-432, 488-490 
nonhomogeneous, 358, 
432-441 

nonhomogeneous, Laplace 
transform of, 438^139 


superposition of solutions, 
386 

undetermined coefficients, 
434-435 

variation of parameters, 
436-438 

of linear algebraic equations, 
373-377, 578 

T 

Tangent line method, see Euler 
method 

Tautochrone, 352-353 
Taylor, Brook, 246 
Taylor series, 158, 246 

for functions of two variables, 
458,510 

Telegraph equation, 662, 671, 
699 

Thermal diffusivity, 612, 659 
Threshold models, 84-88 
Torricelli, Evangelista, 60 
Torricelli’s principle, 60 
Trajectories, 358, 391, 486, 493, 
504-506 

almost linear systems, 512 
linear systems, 486—493 
locally linear systems, 513 
Transcritical bifurcation, 94 
Transfer function, 349 
Transfer matrix, 438 
Transient solution, 63, 208, 622 
Translation of a function, 325 
Transpose of a matrix, 364 
Triangular wave, 588, 591, 609 
Truncation error 

global, 448, 450, 470-472 
local, 448 

U 

Undetermined coefficients, 16, 
176-185, 234-238 
for systems of equations, 
434-435 

Uniqueness theorems 

see Existence and uniqueness 
theorems 


Unit step function, 323 
Laplace transform of, 325 

V 

van der Pol, Balthasar, 559 
van der Pol equation, 559-562 
Variation of parameters, 28, 41, 
185-191,239-242 
for systems of equations, 
436-438 

Vectors, 365, 594 
inner product, 367 
length, 367 

linear dependence and 
independence of, 

377-378 
magnitude, 367 
multiplication of, 366 
orthogonality, 367 
Verhulst, P. E, 79 
Verhulst equation, 79 
Vibrations: 

of elastic bar, 672-673, 699 
of elastic membrane, 644, 
710-712 

of elastic string, 631-646, 
661-663 

natural modes of, 635 
of spring-mass system, 
191-200,202-217,319, 

412 

of three spring, two mass 

system, 355, 360, 405-408, 

413 

of two spring, two mass 
system, 232 
Volterra, Vito, 534 
Volterra integral equation, 351 
Laplace transform of, 351 

W 

Wave equation, 19, 631, 698, 
710-712 

boundary value problems for, 
631-646 

derivation of, 661-663 
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dispersive waves, 645 
fundamental solutions of, 634, 
639 

general problem, 638-640 
in infinite medium, 642-644 
justification of solution, 
637-638, 644 


in polar coordinates, 710, 714 
see also Elastic string 
Wave velocity, 632, 642, 662 
Wavelength 

of vibrating string, 635 
Wronski, Josef Maria Hoene-, 
149 


Wronskian, 149, 221 
Abel’s formula for, 153, 225 
for systems of equations, 386 
Abel’s formula, 388 

Y 

Yield, maximum sustainable, 90 


